Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I'm thinking, what if someone would add in their blog's terms of use something like "the content on this blog is not for machine learning purposes"? Could they later sue Google if their site gets scraped by Google for LLM training?


A contract requires an agreement between two parties. I don't see how writing "the content on this blog is not for machine learning purposes" alone shows Google has agreed to your terms.


Next you're gonna tell me the copy-paste Facebook statuses that say Facebook doesn't have rights to your photos don't do anything!


If those terms of use were machine readable, this would have been equivalent to something like robots.txt, but for AI.

https://news.ycombinator.com/item?id=34324208

https://news.ycombinator.com/item?id=35886288


The US Supreme Court has already said no, see the case of LinkedIn vs. HiQ.

https://www.fbm.com/publications/what-recent-rulings-in-hiq-...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: