Thread #9105298 | Image & Video Expansion | Click to Play
HomeIndexCatalogAll ThreadsNew ThreadReply
H
Previous thread >>9083124

> Chroma
https://civitai.com/models/1330309/chroma
> Z-Image
https://civitai.com/models/2168935/z-image-turbo
> XL models
https://civitai.com/models/575395/big-lust
https://civitai.com/models/573152/lustify-sdxl-nsfw-checkpoint
> ComfyUI
https://github.com/Comfy-Org/ComfyUI?tab=readme-ov-file#get-started
https://comfyanonymous.github.io/ComfyUI_examples/
> Wan UI
https://github.com/deepbeepmeep/Wan2GP

>Related threads
>>>/r/realistic
>>>/gif/vdg
>>>/g/sdg
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/trash/sdg
>>>/aco/futasdg
>>>/b/ai
+Showing all 237 replies.
>>
>>
>>
>>
>>
>>
>>
>>
>>9105339
>>
File: img.png (2.5 MB)
2.5 MB
2.5 MB PNG
>>9105367
>>
>>9105298
Why was the OP gutted down to useless civitai links?
>>
>>9105373
1. no one cares
2. people will read a short OP
3. you or whoever added outdated links with "might hold value", why even bother at that point

you can add your OP back if you want next month, i'm not your mom.
>>
>>9105373
the last thread is still available. furthermore, few people actually learn about ai tools. most are lazy and simply beg for images or videos. just check out other ai channels
>>
>>
File: klein9b.jpg (337 KB)
337 KB
337 KB JPG
>>
File: .png (2 MB)
2 MB
2 MB PNG
Don't get in the way of science.
>>
>>
>>
>>
>>9105377
>1. no one cares
silly to assume that
it's for new readers, not old
>>
>>9105829
what's this?
>>
>>
>>9106118
just a 5090 with 1 lora, low res, and https://github.com/shootthesound/comfyUI-LongLook
>>
>>9106130
wow
>Longlook
there's several ways to extend wan videos yeah (e.g. SVI Pro loras)?
this seemed like the sanest/minimalist method from what I saw, don't know about the results tho.
>>
File: klein.jpg (2.6 MB)
2.6 MB
2.6 MB JPG
>>9105317
the 4b model does a better job here for some reason
>>
>>9106123 >>9106130
This took ~10 seconds to gen? Damn.
>>
>>9106177
Anon...
>>
File: 0001.png (804.9 KB)
804.9 KB
804.9 KB PNG
>>9103733
It got worse.
>>
>>9103733
I'm too slow to understand.
Reverse search implying that they're not making AI art, they're posting "real 3D art"?
A model's face not coming up? Why wouldn't you NOT get sloppa if you're searching sloppa?
>>
>>9106194
It means that newer image gen models have more complex compositions and higher levels of detail,
so something like Yande* brings up real porn or 3D art because there's not enough slop that looks like that. This isn't true for SFW images.

But it also means even if you use a real image, if it was mass imitated by simpler slop models then your search results are already polluted.
>>
>>9106189
If you want to know what kind of models and prompts people are using to generate their images, the only thing you can do is ask the poster to upload a version of the image with metadata to catbox. Reverse searching isn't going to give you shit.
>>
>>9106200
>>
>>9106251
didn't mean to quote but alright
you can imitate anime images too if you wanted
>>
File: .png (2.4 MB)
2.4 MB
2.4 MB PNG
>>9105587
>>
>>
>>9105587
>>9106263
are these quen-edit?
>>
Not born
SHIT into existence
>>
>>
I've been recreating my old workflow in comfy and I'm thinking about testing newer models as refiners (still gonna use pony base for the xxx) - any recommendations? I'd rather not download triple digit GBs of models if there's one or two that outshine the others - I'm looking for a good refiner (if there is such a thing, I know back in the day it didn't work because other models wouldn't know the poses and/or genitalia details) and a good face detailer - I can live with just finding a good face changer that improves over ponyface without resorting to loras that 99/100 times are blurry due to source data being blurry.
>>
>>9106491
It'll be the new Flux 2 Klein model, probably the 9B but maybe the 4B. You'll want the one just named Flux 2 Klein which is the turbo distill for 4-steps (better with 6-8 though), not the "-Base" named one which isn't really intended for inference use. Very good (multi-)image editing and a very good t2i model too, it's a bit fucked around genitals but it's reasonably compliant around posing, breasts, and simple genitalia (though all still a bit dubious quality until we get nudity loras or a finetune).
>>
>>9106649
Your shit looks the same as the stuff posted 1 000 000 times here and everywhere else for years now.
That's what I hate in A.I.: People like YOU.
>>
>>9106731
What sort of improvement do you have in mind?
>>
>>9106744
Probably just the usual complainer >>9088883
He's gonna tell you to use some shitty micro-scale finetune that makes everyone look like the final fantasy movie.
>>
>>9106655
any idea why I keep getting "Expected size for first two dimensions of batch2 tensor to be: [64, 128] but got: [64, 32]."? Tried both base and distilled, using correct encoder/VAE/latent image, error occurs in 5 different workflows at KSampler step whether default or advanced, image size irrelevant. only relevant search result seemed to finger a recent comfyui update - is your flux 2 klein workflow working right now?
>>
>>9106649
First recommendation would be to stop using Pony, move to illustrious, noobAI or Chroma. But suppose you can use those as refiner too if you really wanted. In the case of Chroma, it's a different architecture so can't do the true refiner flow just img2img of a finished pic.

Faces are a lot easier, unless it's a blowjob or something you're not limited to just nsfw models. That would mean z-image, qwen, flux2. Cannot say which is best for this use case, I mostly do cosplay shit where they all suck.
>>
File: 1.jpg (107.9 KB)
107.9 KB
107.9 KB JPG
>>
>>
File: mona.png (2.6 MB)
2.6 MB
2.6 MB PNG
>>
>>
>>9106788
> update_comfyui.bat
It works with the workflow from the template browser.
>>
>>9105577
>>9106189
Enjoy not being able to tell if anything is an edit ever again.
>>
>>
>>
>>9106140
didn't jolie wear fake tits for the old tomb raider movie? she's going to need them.
>>
>>9107956
>>
>>
>>
>>9106788
in case anyone else is hunting for this when trying to run Flux 2 Klein, it's one specific thing: in settings for VHS custom nodes turn off "Display animated previews when sampling". Restart and enjoy.
I haven't even started testing integration into other workflows or upscaling/refining but it's impressive image modification
>>
>>
>>9108500
your shit is fried dawg, that's a distilled model, low steps low cfg
>>
File: mona1.png (1.3 MB)
1.3 MB
1.3 MB PNG
>>9107094
>>
File: mona2.png (1.5 MB)
1.5 MB
1.5 MB PNG
>>
>>
Maybe I'm finally starting to figure out Chroma
>>
>>
>>9105298
>4 years in and they still can't get her outfit right because the cosplayers it's stealing from never do either
And people really want to use this shit for professional commercial purposes.
>>
>>9108777
I'm guessing the source is probably cartoon/anime fanart and not irl cosplay. But either way, her having 3+ canon costumes with the same color theme and overlapping features doesn't help. That always confuses AI.
>>
This will get me branded and mocked, but what are some models close in quality to what Grok spews out? I'm trying to go artisanal since they are killing even bikini pictures now. Thanks, kings.
>>
>>9108777
>professional commercial purposes.
Oh fear not, ad companies have more than enough contempt and a prostitute's dignity, so they already got started yesterday (see: the Christmas Co*a C*la ad).
>>
>>9108852
I'm not sure what exactly you want, "quality" looks a lot different in AI than it does in art or photography.
>>
>>9108862
Dude what are you talking about. He means Grok, the equivalent online slop service.

Anyway, go to /g/ if you want to look at what the latest local models can do since images are more varied there.
>>
>>9108879
But he mentioned artisanal and quality, not one of grok's actual advantages.
>>
File: smut.jpg (504.2 KB)
504.2 KB
504.2 KB JPG
>>9108862
Sorry, should've given an example. Something like this I guess.
>>
>>9108892
there's nothing more artisanal than getting a girlfriend
>>
Imagine getting dragnet range banned in 2026 with these captchas. Couldn't be me.
>>
File: 1.jpg (162.1 KB)
162.1 KB
162.1 KB JPG
>>
>>9108901
Already there friend, I'm just generating images for fun, that's all. I was just hoping anyone could point me to some models close or better in quality than that. I've played around with ZIT and I had some success, but I don't think it's quite like what I'm after. Same prompt.
>>
>>9108993
Same prompt, far lower resolution.
>>
>>9109009
I'm still new, just doing quick tests before upping the resolution and/or passing it through an upscaler.
>>
>>9108993
Might as well try Chroma while you're at it, and gen actual nudity instead of bikinis.
>>
>>9109049
uh no as someone warned >>9096235
zit is the best of what's available for speed and simplicity. XL is also better than chroma for a [degenerate] beginner.
>>
>>
>>9109023
keep the prompt simple, do small batches in lower resolution, then pick the better ones and pipe them through img2img.
that's when you enhance your prompt by small details like facial expressions, pubic hair, etc. since you don't want to confuse the model with these beforehand.
>>
>>9109057
But if he's coming from grok and z-image, chroma at least pretends to understand natural language. SDXL is either braindead or entirely dependent on using Danbooru tags, based on which one you pick.
>>
>>
>>
>>9109065
Eh, get better results with a semi-natural syntax on SDXL than with pure tags.
>>
>>9109532
Concept-wise, there's no way. But they do lean more towards anime visually and anatomically, if you use too many tags at once.
>>
i *would* prefer my pics a bit less glossy, but my potato can't run the better models or filters.
>>
>>9109559
Judging by the scores you're still using Pony, consider moving to illustrious or noobai. Same hardware requirements, similar prompting style.

Could also try dropping some or all of the score tags. The way they improve "quality" is mainly through style bias, and you have a different style in mind than what score_8_up wants to look like on the Pony base.
>>
>>9109574
yeaah... the relevant word is "potato". it can handle euler ancestral at max 25 iterations, and the illustrious-based models i looked at tell me to use karras at 40, which would triple the computing time for me.
didn't have a closer look at noobAi yet, at first glance it seemed anime only.

oh, and by potato i mean: i render on cpu.
>>
>>9109657
That's just people being retarded. With the exception of lightning distills, speed loras and similar; samplers and schedulers work the same regardless of model. Whether it's SD1.5, or XL, or Flux, or Qwen, whatever, you can run it at 20-step euler with no issue.
>>
>>9109574
i don't know... i removed the scores, and now she looks at me like she has an icepick hidden somewhere.
>>
File: _.webm (2.9 MB)
2.9 MB
2.9 MB WEBM
>>9108259
get a room llamas
>>9109657
>i render on cpu.
are you brazilian or something anon?
a used 3060 ti or whatever isn't horrendously expensive in burgerland.
>>
>>9109845
when i assembled my pc 8 years ago, before AI, i didn't have gaming in mind... i didn't just skip the big graphics card, i also dimensioned the power supply accordingly, i.e. too small for adding one later... and now i consider it too much of a hassle. *shrugs*
>>
>>9109855
Well if all you're genning are latex maids standing around, you could even go back to SD1.5. Get a 5x speedup if not more.
>>
File: .gif (2.6 MB)
2.6 MB
2.6 MB GIF
>>
>>
>>
>>
>>9110293
>>
>>9110294
>>
>>9110293
>>9110294
>>9110295
Now that's the good shit
>>
>>
>>9110294
hot, what model did you use for this?
>>
>>
>>
>>
>>
>>
oh well, on with the latex maids ^^
i remember sd1.5, looked like crap compared to pony. *but* i found a lcm lora that *seems* to work so far.
>>
>>9110462
ARAZ
I don't think it is on civitai.
>>
>>9110486
"If you don't stop I'm going to write a song about this!"
>>
>>9106140
What are you doing to make these? Can you go over the steps please? They look great.
>>
>>
>>9109855
>8 years ago
anon you survived the gpu/sdd cryptoslop crunch and the gpu ai slop crunch and now it's the apocalypse.
>>
Did grok change the ability to do full body bikini from images or the like just like one days ago or am I crazy?
Some happening I missed?
>>
>>9111065
>>9110822
>>
File: nigel.jpg (146.2 KB)
146.2 KB
146.2 KB JPG
>>9110914
if you mean the face swap, it's just the klein edit models.
you can find workflows in comfyui templates browser if you've updated.
>>
>>
File: 1.jpg (130.3 KB)
130.3 KB
130.3 KB JPG
>>
>>
>>
>>
>>
>>
>>
>>
comic pages are a goldmine but i need to set up regional prompting so I can prompt details of each panel separately
>>
>>9112660
>>9112660 (You)
>>9112767
Nice, but its always the hands/fingers that get fucked up.
>>9112789
Maybe inpaint would work better?
>>
>>9112796
yeah that chunli hand was about the best out of a few attempts, the age-old struggle. inpainting is fine but I might as well just start with inpainting each panel and reassembling if i'm gonna do that
>>
File: img.jpg (962 KB)
962 KB
962 KB JPG
>>
File: img2.jpg (317.7 KB)
317.7 KB
317.7 KB JPG
>>
>>
for the next OP:
> Flux 2 Klein https://docs.comfy.org/tutorials/flux/flux-2-klein
>>
>>9112979
oo these are way too slopped up.
it's very good at certain kinds of edits, but "make real" and other altering prompts is not one of them.
>>
>>
With local models and the jazz y'all are running, can you get this sort of detail with simple prompts like you can with Grok? Or is it a lot more buy-in and time to get this sort of result
>>
>>
Are there any boorus for uploading and viewing realistic image content? For mass tagging and the like
>>
>>9113162
>boorus
Don't think so.
There's a smattering of random porn websites where users upload their slop by the dozens of galleries every day though.
>>
>>9113034
>this sort of detail
The amount of detail was not an issue three years ago, you can always prompt more stuff just for the sake of it, or slap on a detail lora that will make all textures more complex and add shit like dust particles or dripping fluids or whatever.

Takes some figuring out how to get the scene you want because the models don't really understand language, only short phrases or tags. But I'm guessing it's no more effort than figuring out how to skirt around the censors online.

Maybe rephrase the question?

>>9113162
https://realbooru.com
Better than nothing, but it's shit. There's no tag wiki or other guidance on how a specific tag is supposed to be used, and no moderation/enforcement either.

If one person tags "huge breasts" when they're big for anime standards and another person when they're bigger than his ex-gf had, then the model ends up jumping between c-cups and k-cups and it's all pointless. Not to mention people making up random tags like "performing_fellatio_on_male_lying_on_back_while_bent_over_beside_him" which would only serve to confuse the model.
>>
>>9113164
The purpose is tag standardization, which would make training these small local models easier. It's why people keep merging in Pony or illustrious, even though those were only meant for cartoons and anime. Because at least they understand camera angles, composition, tons of clothing and hairstyles, two dozen sex poses, etc.
>>
>>9113176
What do you mean? There are already sophisticated VLMs for tagging in booru styles or natural language. Maybe they're not entirely consistent, but that doesn't appear to be the bottleneck.

The issues come from the actual challenge of training the models and also how realism datasets appear to cause unique problems compared to only 2D images.
>>
>>9113188
You say that as if's ever been attempted. afaik Chroma is the only realism tune that listens to tags natively without merging. And all of that comes from the danbooru portion, judging by how it starts to struggle with realism the more tags you use.
>>
>>9113205
>attempted
Yes. Have you used bigASP? Did you use the models that came before it? The difference is apparent.
>>
>>9113208
Not since v1. Thanks, I'll take a look.
>>
>>9113208
BigAsp v2, I tried the simplest test I could think of:
>score_9, 1girl, 1boy, pov, sex, cowgirl position
Success rate is less than 20%, giving reverse cowgirl position or third person view, usually both.
>>
I think this is the best place to ask: are there any base models which approach dalle3 capability in doing soles of feet? Chroma is alright but nowhere near dalle3. Other models I've tried are just plain atrocious.
>>
>>
>>9113344
Chroma is a finetune, the base model is Flux
but no

Since it's technically SFW you can try the other big bases, z-image, Qwen image, Flux2
>>
>>9113361
Chroma is so different from flux that it acts like a different base model. Flux is the same as all censored models in that it cannot do soles ever in any scenario. Chroma can do them but not great. I tried qwen once and it was somewhat promising so maybe that's the way.
>>
>>9113334
oh, just omit the 1boy so the model doesn't try to show him. pov should suffice.
>>
>>9113334
If you want to prevent reverse cowgirl position, you should add nipples, navel and some facial expression like open mouth.
>>
File: qwen.png (1.8 MB)
1.8 MB
1.8 MB PNG
>>9113173
>the amount of detail was not an issue three years ago
uuuh lol. come on.
picrel isn't even upscaled.
people are posting 4k jpgs >>9105307
>>
>>9113344
i looked at a couple of old dall-e /aco/ threads, just long enough before my brain leaked out my ears.
in fairness, like I read someone say, a lot of AI use seems like self-inflicted brain damage, but it's more understandable if you view it as people puffing magic smoke repeatedly with no open window.

anyway old dall-e 3 is still a better footfag model. and it's still a massive image model compared to anything available for local, maybe with the exception of Flux 2?
and the training material they used at the time probably had less filtering.
>>
>>9113475
I'm talking about the amount of detail, not quality. If you just want a ton of crap in the image instead of flat surfaces. SD1.5 had an issue with it even, too much detail that often made no sense. And if you upscaled it to 2K, or 4K, it would only get worse as it added more.
>>
File: 00.jpg (1.4 MB)
1.4 MB
1.4 MB JPG
>>9113475
half the dall-e images posted back then looked like this
>>
File: wan.jpg (416.7 KB)
416.7 KB
416.7 KB JPG
>>9113536
>I'm talking about the amount of detail, not quality.
yeah dude, only you know what this means.
detail ≠ noise/artifacts
>>
File: 87201.jpg (1.4 MB)
1.4 MB
1.4 MB JPG
>>9106140
>klein
yes, this is impressive, but why the hell can't i rip off vermeer or monet accurately? everything is folded into generic style tags. what, stealing from the long dead is as unethical as doing it to mickey mouse or what?
>>
Anyone have advice on adding cum and condoms onto a girl? This is for ComfyUI and Z-Image Turbo.
The output I have is really just a bit of water.
>>
>>
>>9113685
Could try running the inpaint with a different model, z-image is censored.
>>
>>9113780
>z-image is censored
Huh? Since when?
>>
>>
File: .jpg (487.6 KB)
487.6 KB
487.6 KB JPG
>>
>>
File: eileen.jpg (412 KB)
412 KB
412 KB JPG
>>9112990
>certain kinds of edits
>>
File: img.jpg (171.1 KB)
171.1 KB
171.1 KB JPG
>>
>>9113358
Model? Or can you share it with workflow?
>>
>>9109057 >>9096235
Non-turbo Z-Image getting the prompt here.
>>
>>9105302
Getting real heavy Animorphs vibes from this.
>>
>>
File: z-image.png (3.9 MB)
3.9 MB
3.9 MB PNG
>>
>>
>>
>>
>>
>>
>>
>>
>>9116420
Box? How did you glitch out?
>>
>>9110293 >>9116528
there's no "prompt" to catbox, just use natural language.
>>
Klein 4B fails to edit the orb in this image after several attempts. 9B gets it.
>>
File: original.jpg (53.5 KB)
53.5 KB
53.5 KB JPG
>>9116955
>>
>>
>>9116865
I mean what model, prompt for other things like style, is this i2i and so on. With Klein and natural language just one word can change or break output.
>>
>>9117001
>I mean what model
flash chroma with a character lora. the style is just from the lora and basic tags like "glossy" or "realistic 3d".
klein 9b changed almost the entire image
https://litter.catbox.moe/n8arabu1zfsmbzud.jpg
>With Klein and natural language just one word can change or break output.
it seems that klein needs specific resolutions or it can shift and warp the image. idk if it has to be divisible by 32 or whatever. if res is too low and maybe too high it can also shift colors.

assuming none of that is a problem, if it changes the entire image it's because your prompt is too vague, too general, poorly described or it just doesn't understand. and it's ultimately RNG.
it also matters if you're using the 4b model that will struggle more.
>>
>>9109845
>>
Anyone knows what AI adds a diamond watermark?
Saw it twice now.
>>9117652
>>
>>9117658
Nano Banana.
>>
>>
>>9117618
showing this to the next guy who asks why the cheap laptop he bought came with 2GB of RAM and no screen.
>>
>>
>>9114321
Computer disengage safety protocols and disable all power limits.
Generate nude pregnant Aerith sequence for the next 24 hours and forward all my calls.
>>
>Adult mediocre diffusion.
>>
>>9118075
Let's see your gens and something that resembles a joke next time, big girl.
>>
>>
>>9118078
You didn't say I was wrong.
>>
>>9118083
>>
>>9118085
You forgot to post your Adult not-mediocre diffusion image.
>>
>>9118086
>>
>>9118089
I am a "lurking" kind of guy.
But there isn't much to see.
>>
>>9118091
Seems like you're also a "not posting anything worth seeing" guy.
>>
>>9118083
And she was never seen again.
>>
>>9118094
Sometimes I remember that dolphin-on-human sex is part of recorded US history.
>>
File: img.jpg (311.9 KB)
311.9 KB
311.9 KB JPG
>>
File: _.png (1.2 MB)
1.2 MB
1.2 MB PNG
>>9118044
these edit models just partly solved the image layer problem. didn't see that coming at all.
>>
>>
File: .gif (3.4 MB)
3.4 MB
3.4 MB GIF
>>
>>9118231
Bro she's got dem michigan j. frog legs.
>>
File: br.png (1.6 MB)
1.6 MB
1.6 MB PNG
>>9105298
>>
>>9118546
noice
>>
File: 342043393.jpg (437.6 KB)
437.6 KB
437.6 KB JPG
>>
File: .gif (2.5 MB)
2.5 MB
2.5 MB GIF
>>
>>9117232
Thanks!
>>
>>
>>
File: klein_001.jpg (414.1 KB)
414.1 KB
414.1 KB JPG
>>
>>
>>
>>
is there another board I can share realistic gens? I feel everyone is focused on drawings...
>>
>>
>>
>>9120382
Would you be able to do a version, the same woman as exactly as possible, where she has a gown, maybe transparent, over the leotard?
>>
>>9120206
There isn't, unless you want to hang out with lolicons on /b/. /s/ and /hc/ don't accept AI gens because they classify them as fakes.
>>
>>9120441
That's too bad. I have outgrown /b/, I think. I guess CivitAi it is for me...
>>
>>9120481
>I have outgrown /b/
>CivitAi it is for me
>>
File: .gif (698 KB)
698 KB
698 KB GIF
>>
>>9112869
Has anyone tried genning stereoscopy like picrel

Reply to Thread #9105298


Supported: JPG, PNG, GIF, WebP, WebM, MP4, MP3 (max 4MB)