Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

For any book popular enough to have be put into GPT training data, you can get a summary without uploading it. The idea that these big, corporate models (their "intellectual property" if you will) have been trained on data they downloaded from ebook pirating sites... it's a real head shaker.

https://aicopyright.substack.com/p/has-your-book-been-used-t...



If these were scrappy start-ups looking to survive one can sort-of understand why they might bend the rules about copyright to train their models.

But we're talking about behemoth companies here, one of which deigns to make a pitch worth 7 Trillion dollars (10% of global GDP).

So... they're trying to rake in investment at levels that are unheard of in human history. Surpassing Apollo, the Manhattan project, the great pyramids of Egypt, the Great Wall of China, and anything else one might think of.

And they're not paying for the books ???




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: