This is a great data set. I only scraped back like 1 yr because the slowdown on letsrun servers was obvious when I was making requests.
Could you post the data somewhere?
I always liked a kind of 'h-index' of thread starters:
Who has the largest N, where N = number of threads with at least N replies.
Given the permission of the BroJo's I would post the data set. I have been trying to get the data up onto Google Big Query which would allow me (or potentially anyone if it was made public) to query the dataset utilizing Google's servers which would allow for extremely fast processing and users are able to process 1TB of data per month for free.
Thats an interesting question to pose to the data. That would probably be a more interesting metric than seeing who has the most posts
It would take forever to pull that off Letsrun's servers but once you get it, I have a pretty kickass computer that should be able to rip through that kind of stuff if you were interested
I already have all the data pulled from LetsRun and stored locally - and yes it did take forever to scrape it all