The problem are "viral" licences. Must the code generated by an AI trained with ...

spwa4 · 2025-12-28T16:23:33 1766939013

> Also, can an AI be trained with the leaked source of Windows(R)(C)(TM)?

I think you mean to ask the question "what are the consequences of such extreme and gross violations of copyright?"

Because they've already done it. The question is now only ... what is the punishment, if any? The GPL requires that all materials used to produce a derivative work that is published, made available, performed, etc. is made available at cost.

Does anyone who has a patch in the Linux kernel and can get ChatGPT to reproduce their patch (ie. every linux kernel contributor) get access to all of OpenAIs training materials? Ditto for Anthropic, Alphabet, ...

As people keep pointing out when defending copyright here: these AI training companies consciously chose to include that data, at the cost of respecting the "contract" that is the license.

And if they don't have to respect licenses, then if I run old Disney movies through a matrix and publish the results (let's say the identity matrix)? How about 3 matrices with some nonlinearities? Where is the limit?

Since copyright law cannot be retroactively changed, any update congress makes to copyright wouldn't affect the outcome for at least a year ...