this post was submitted on 19 Sep 2023
635 points (98.0% liked)

Europe

8484 readers
1 users here now

News/Interesting Stories/Beautiful Pictures from Europe 🇪🇺

(Current banner: Thunder mountain, Germany, 🇩🇪 ) Feel free to post submissions for banner pictures

Rules

(This list is obviously incomplete, but it will get expanded when necessary)

  1. Be nice to each other (e.g. No direct insults against each other);
  2. No racism, antisemitism, dehumanisation of minorities or glorification of National Socialism allowed;
  3. No posts linking to mis-information funded by foreign states or billionaires.

Also check out [email protected]

founded 2 years ago
MODERATORS
 

Police investigation remains open. The photo of one of the minors included a fly; that is the logo of Clothoff, the application that is presumably being used to create the images, which promotes its services with the slogan: “Undress anybody with our free service!”

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 2 points 2 years ago (1 children)

The crux of course is that your average Instagram image looks not much different

And I believe that is the reason why these AI images look like this: they've been trained with these Instagram and Instagram-like social media images, therefore that's what they can do.

[–] [email protected] 2 points 2 years ago

It's part of the reason. Another big issue with current models is that their language understanding is very primitive. Something like "person standing" they can understand, but "one person with a red hat sitting and another standing behind them with a blue shirt" already fails. There will be red and blue things in the image, but they'll be pretty arbitrarily spread across and not assigned to the person the prompt said.

That said, this won't last for long. With Segment Anything we have AI that has a very good understanding of what is in an image, which should make training on arbitrary images much easier, as well as editing. We also have lots of research going into video and plenty of more powerful language models, that just haven't yet integrated into image generation. Even just ControlNet and some inpainting can overcome most of those issues, it just takes a bit more manual work than a text prompt. There is also DraGan, which is an incredible powerful drag&drop approach to AI image editing, but due to using a completely different approach from StableDiffusion it is not yet properly integrated into other tools.