Hacker News new | past | comments | ask | show | jobs | submit login
Y Combinator Dataset Of Users Version 1.1
19 points by xirium on May 7, 2008 | hide | past | favorite | 8 comments
A 0.7MB archive of Y Combinator user profiles is available by accessing http://www.rushy.com/ycombinator-news-profile20080507.tar.gz

More to follow in the next day or so.




I see you have interest in search, I do too. Why don’t we get in touch we may share stuff.

edit: please ping me at okeumeni (at) intelliverb (dot) com


start publishing the same data for other sites and you'd have a pretty good business going ;)


how did you extract them all?


Uh, yeah. So, you've wondered why the site is slow sometimes...


Scraping in Perl. That's to follow in the next day or so.


how complete is this dataset? i count 7,164 users. is that all there is?


It includes all users who posted before Wed 7 May 2008. It doesn't have lurkers. Some profiles may be two weeks old. It is a more complete version of the previous version ( http://news.ycombinator.com/item?id=173045 ), which mostly excludes accounts which had only been used to post one or two items.


thank you.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: