CallFromMargin t1_j6hvui0 wrote on January 30, 2023 at 1:20 PM

Reply to comment by Ronny_Jotten in Microsoft, GitHub, and OpenAI ask court to throw out AI copyright lawsuit by Tooskee

The computer is not storing a copy of original work in trained model. It looks at picture, it learns stuff from it and it stores only what it learns.

Your argument is based either on fundamental misconception on your part, or a flat out lie from you. Neither one casts you in good light

Ronny_Jotten t1_j6hzcpu wrote on January 30, 2023 at 1:50 PM

> The computer is not storing a copy of original work in trained model. It looks at picture, it learns stuff from it and it stores only what it learns.

Just because you anthropomorphize the computer as "looking at" and "learning stuff", doesn't mean it's not digitally copying and storing enough of the original work in a highly compressed form within the neural network to violate copyright by producing something "substantially similar": Image-generating AI can copy and paste from training data, raising IP concerns | TechCrunch

But regardless of whether it produces a "substantially similar" work as output, making a copy of the original copyrighted work into the computer in the first place is a required step in training the AI network. Doing so is only legally allowed if it's fair use. That was the question in the Google books case - it was found that the scanning of books was fair use, because Google didn't use it to create new books or otherwise economically damage the authors or the market for the original books. But that's not necessarily the case with all instances of making digital copies of copyrighted works.

> Your argument is based either on fundamental misconception on your part, or a flat out lie from you. Neither one casts you in good light

Well, you can fuck off with that, dude. There's no call for that kind of personal attack.

CallFromMargin t1_j6i4o2a wrote on January 30, 2023 at 2:31 PM

No, the fact that it's mathematically impossible to store that many images, and if done, this compression algorithm would violate laws of physics, means that it is not storing images.

It is impossible to compress 380tb of data to 0.04tb of data.

Ronny_Jotten t1_j6i68gn wrote on January 30, 2023 at 2:43 PM

And yet, the citation I gave shows Stable Diffusion obviously replicating copyrighted images from the LAION training set, despite your musings about thermodynamics. It may not store reproducible representations of all images, I don't know - but it unquestionably does store some.

In any case, it doesn't change the fact that copying images into the computer in the first place, in order to train the model, would need to come under a fair use exemption. For example, research generally does - but not in every case, especially if it causes economic damage to the original authors. In many countries, authors also have moral rights, to attribution, to preservation of the integrity of their work against alteration that damages their reputation, etc., which may come into play.

[deleted] t1_j6ibove wrote on January 30, 2023 at 3:21 PM

[deleted]