Hello. I have asked this question on the subreddit, but was told to ask on here too so the dev can see it. I am not particularly tech savvy. I have recently come across Perchance which I have found useful to to create texts and images.

Even if the NSFW filter is disabled and NSFW material can be generated, do the text-to-text and text-to-image generators still prevent the production of illegal or harmful content?

I don’t want to try this for obvious reasons, but I am concerned that such content could inadvertently be generated. The obvious example is child and sexual abuse material, but I am also thinking of glorification of terrorism or genocide, promotion of self-harm, encouragement of violence toward others, etc.

Thank you!

  • Baŝto@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    1
    ·
    8 months ago

    I’d say it does a good job at preventing you to explicitly generate such content, but not to generate by accident. The filters can only work when you ask for it, but not if the AI adds that part on its own.

    Further you can’t really 100% prevent somebody to utilise it to generate such content. You can’t create it with just the AI, but it could be used for a step. That’s not really an AI specific problem. With captions you can put otherwise innocent images into a completely different context. Images can be manually edited.

    I’ve played with editing for complex anime scenes. Generating a few characters, remove backgrounds quickly with software and then place them on a separately generated background. It’s hard to describe that for the AI and not confuse it, but you could also use that to alter the context massively. It gets ‘worse’ the better photo editing AIs get, because they’d have a hard time to filter as they’d need to first understand the content of an image and it’s implications.

    Rewriting texts is even easier.