There have been a few posts (one being mine) around concerning fears about "fluff" posts taking over. Solutions are ranging from allowing downmods, adjusting weighting algorithms, and even just blacklisting Reddit posts.
One thing I'd like to see before considering adjustments like this would be more detailed statistics about how people vote, how leaders vote, how karma is distributed, &c.
I feel like interesting data sets would be:
Across all users
Karma
Number of votes
Number of submissions
Times downmodded
...
Across all posts
Karma
Comments
Flag for whether it's considered "fluffy" (mod's discretion)
Flag for Reddit submission
...
Really, it could be a pretty big problem. It does seem like a potential playground of hacks and at least some of those statistics are certainly not difficult to mine (probably Arc one-liners).
So, if it seems interesting and there's not any issue with privacy (strip out usernames and it'd be hard to correlate anything beyond the leaderboard) is there any chance of seeing a hn-stats tarball?