Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I'm pretty sure all the twitter datasets violate the twitter TOCs.


On a quick pass of the Twitter datasets, they all seem to conform to Twitter's developer Terms.


Like the requirement that you have to delete tweets in datasets that have been deleted on twitter?


As far as I could tell, none of them actually contain tweets (e.g. any JSON), just IDs, and mostly user IDs at that.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: