Under the hood of image generator

Grth@lemmy.world · 1 year ago

Under the hood of image generator

perchance@lemmy.world · edit-2 1 year ago

Yep SD 1.5, and you should be able to replicate the text-to-image-plugin’s results locally by just following vanilla tutorials on /r/stablediffusion or youtube with pretty much any of the top models on civitai - I’m not doing anything special. Your local results will actually end up be better than the plugin’s because I have a stupid amount of regex and stuff trying (and somewhat failing) to prevent the model from creating oversexualised stuff for benign prompts, and that almost always comes at a cost of quality/coherence. I’m not the best person to ask about troubleshooting local setups, but I’d just advise that you follow a tutorial/guide exactly to start with, and then once you’ve replicated what they’ve shown, you start exploring your own prompts, tweaking parameters, etc.

Grth@lemmy.world · 1 year ago

Thanks for getting back to me so quickly, even if it’s taken me so long to respond. Whatever non-special things you’re doing seem to work really well! It works a lot faster than my local instance (suboptimal video card to blame there) and I get really good consistent base results, which I can then pull into my own instance to do more fun stuff with inpainting, upscaling and the like. So hopefully that ad revenue is making it worth your while. :D

perchance@lemmy.world · 1 year ago

Yeah it’s definitely worth investing in a fast graphics card if you’re getting deep into AI stuff, but they’re pricey. Inpainting and image-to-image should be possible on perchance within the next month or so if all goes well. Ad revenue doesn’t cover all the server costs yet, so I pay for a portion of it out of my own pocket, but it’ll eventually be self-sustaining and it’s not ‘breaking the bank’ for me. Much closer to self-sustaining than it was 12 months ago when I made the plugin - research community has made SD inference a lot more efficient.

Ashenthorn@lemmy.world · 1 year ago

@[email protected] Is there a roadmap or existing discussion anywhere about your experiments with the t2i plugin or where you might be going with it? Or for user questions/feedback/requests?

I’m currently getting some fantastic results with it “as is”… but additional options are always appreciated. =)

perchance@lemmy.world · 1 year ago

Best place is probably here on the lemmy community. I’ll post updates here when there are new features available (e.g. inpainting, image-to-image), etc.

Also, @[email protected] has some notes and interesting experiments (with linked generators to play with) here: https://perchance.org/learn-perchance-plugins-text-to-image