Why Nostr? What is Njump?
2023-01-07 02:59:37
in reply to

cody on Nostr: Yeah it's not a settled question. I'm not familiar with how Midjourney et al. scraped ...

Yeah it's not a settled question. I'm not familiar with how Midjourney et al. scraped all the images to use as training data. If they got an image from a website where scraping/saving/using a copyrighted work violated the terms of use, that's a clearer line, but I'd hope they had the foresight to know that would be an issue. For other more "open" images, it can depend on things like how the model is tuned, what the training actually extracts from the pictures, how the extracted info is stored, and how many images are in the model.

E.g., if I just trained a McDonald's logo generator and scraped 1,000 variations of the golden arches, that starts to looks like copyright infringement. But if those 1,000 images are part of a 100,000,000 image data set, it would look more like fair use. Mind you the question of whether a copyright violation exists for the person generating an image from a dataset is a separate analysis. This is all from the US-perspective though.
Author Public Key
npub1hhnmlmx6ttcwpjglfrsnglfa0tpczgr26mk5vzl9kwqs9h3sdhkq9fwm0z