Hacker News new | past | comments | ask | show | jobs | submit login

Is there actual case law that prohibits the use of copyrighted material for corpora and other training data?

Sure distribution can have issues, but do you have any references for simple possession as training and test data?




If you're gathering data for your own business, it would be difficult for anyone to know, sure.

But if the data/analysis is published, then the data source would need to be disclosed.

See more on the OKCupid case I mentioned above: http://www.vox.com/platform/amp/2016/5/12/11666116/70000-okc...




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: