Thread #108053187 | Image & Video Expansion | Click to Play


Showing all 288 replies.
>>
Anonymous
02/03/26(Tue)21:33:05 No.108053206 >>108053185
I'm not from St. Petersburg and you are replying to an anonymous 4chan post.
This proves one fact: most ocd trolls are 17 year old virgins.
>>
Anonymous
02/03/26(Tue)21:33:36 No.108053210 >>108053187
ty 4 bake
>>
Anonymous
02/03/26(Tue)21:34:16 No.108053214 >>108053206
did you just admit to being underage while posting on 4chan?
>>
Anonymous
02/03/26(Tue)21:34:17 No.108053215 >>108053187
> Do not engage with posts mentioning "debo", "ani" or "ran". These are troll posts.
Based.
>>
Anonymous
02/03/26(Tue)21:35:54 No.108053227 is klein 4B supposed to be this shit? can't even generate a proper selfie
>>
Anonymous
02/03/26(Tue)21:38:06 No.108053245 reminder, dont use sageattention in starting args if you try acestep, I was getting jibberish till I removed --use-sage-attention.
now it is VERY clear, and almost Suno quality. using comfy workflow and the non-AIO one.
>>
Anonymous
02/03/26(Tue)21:38:16 No.108053246 >>108053210
no problem
>>
Anonymous
02/03/26(Tue)21:38:37 No.108053253 >>108053214
You don't have anything compared to the original schizo. You are just a little baby.
>>
Anonymous
02/03/26(Tue)21:38:47 No.108053255 >>108053245
doesnt come even close to suno loser retard
>>
Anonymous
02/03/26(Tue)21:42:10 No.108053284 >>108053245
example: bill gates epstein song, used grok to generate lyrics. left the kpop description the same in the top node.
https://voca.ro/12VaksFm9iMn
>>
Anonymous
02/03/26(Tue)21:42:32 No.108053288 https://voca.ro/1mNw5x3sY7Ak
It's definitely not as good as Suno, but it's cool that it exists.
>>
Anonymous
02/03/26(Tue)21:44:50 No.108053303 >>108053284
same lyrics with "a slow rock ballad" in the top node.
https://voca.ro/17EKFTdU0qDi
>>
Anonymous
02/03/26(Tue)21:47:16 No.108053328 >>108053245
is sage just done for at this point?
>>
Anonymous
02/03/26(Tue)21:47:32 No.108053333 okay dont use randomize for duration unless youre a retard like me. was wondering why it was taking longer than last time and the duration was set to 862
>>
Anonymous
02/03/26(Tue)21:47:44 No.108053338 Does batch size work for you?
>>
Anonymous
02/03/26(Tue)21:48:23 No.108053342 My first song ever... I kind gave up until 1:00 tho
https://voca.ro/14gqzALZkd3z
>>
Anonymous
02/03/26(Tue)21:48:41 No.108053345 >>108053328
Sage only has major gains for video. For image fast fp16 is miles better. Idk how is it with audio.
>>
Anonymous
02/03/26(Tue)21:49:28 No.108053354 Consider using the front end the ace step team built for the model. It can do covers and turn vocal tracks into polished songs.
>>
Anonymous
02/03/26(Tue)21:49:59 No.108053357 >>108053354
does it have inpainting?
>>
Anonymous
02/03/26(Tue)21:50:02 No.108053360 cumrag...
>>
Anonymous
02/03/26(Tue)21:50:09 No.108053362 >>108053354
link?
>>
Anonymous
02/03/26(Tue)21:50:21 No.108053366 >>
Anonymous
02/03/26(Tue)21:52:02 No.108053380 >>108053354
I used it. It works, but I wouldn't recommend it.
>>
Anonymous
02/03/26(Tue)21:52:27 No.108053384 >>108053354
can this be done in comfy?
>>
Anonymous
02/03/26(Tue)21:52:29 No.108053386 I hate 'thunar' it has nothing compared to explorer.
>>
Anonymous
02/03/26(Tue)21:53:23 No.108053395 >>108053386
just use a real file manager like doublecmd (totalcmd clone)
>>
Anonymous
02/03/26(Tue)21:53:35 No.108053396 >>108053386
Have you tried using a real DE like Plasma with Dolphin?
>>
Anonymous
02/03/26(Tue)21:54:19 No.108053403 >>108053227
you look like shit
>>
Anonymous
02/03/26(Tue)21:54:49 No.108053407 >>108053284
>used grok to generate lyrics.
vramlet or retard?
>>
Anonymous
02/03/26(Tue)21:54:57 No.108053408 >>108053396
I knew that the solution is always to change the distro. You are a techlet.
>>108053395
I don't care that much. File transfer logistics are an issue.
When it happens I want to kill someone.
I can use terminal and this fine but who the fuck wants to type all the time.
>>
Anonymous
02/03/26(Tue)21:55:32 No.108053413 so can ace step do speech and stuff or only music and singing?
>>
Anonymous
02/03/26(Tue)21:55:52 No.108053418 >>108053408
>changing DE means changing distro
it is you who is the techlet
>>
Anonymous
02/03/26(Tue)21:55:59 No.108053420 kek it can do synthwave, here is a song about bill gates and epstein. also, use grok to get prompts for song styles + lyrics.
https://voca.ro/15wCyq253sQe
>>
Anonymous
02/03/26(Tue)21:56:07 No.108053422 >>108053354
I already made all of those features work in comfy, and no I won't share fuck you
>>
Anonymous
02/03/26(Tue)21:56:48 No.108053428 >>108053396
Dolphin is actually worse than Thunar.
What also bothers me is the software's name. Who the fuck names a file manager as 'thunar'.
>>108053418
Oh sorry I didn't know you were supposed to wipe your pants.
>>
Anonymous
02/03/26(Tue)21:56:52 No.108053429 >kek it can do [blank]
i swear you are an organic bot
>>
Anonymous
02/03/26(Tue)21:57:35 No.108053441 our minds are like an ai model.
just as we write a prompt, to generate an image,
we think, say, feel, and do, to generate our lives.
what will your prompt be?
>>
Anonymous
02/03/26(Tue)21:57:53 No.108053448 so, is ace fun? can i make some perverted techno? i already have the lyrics. "s my ice cream..."
>>
Anonymous
02/03/26(Tue)21:58:08 No.108053450 >>108053441
tldr
>>
Anonymous
02/03/26(Tue)21:58:25 No.108053453 >>108053366
Now do Misaka and Uiharu and Accel
>>
Anonymous
02/03/26(Tue)21:58:39 No.108053458 >>108053448
dafuq is peverted techno?
>>
Anonymous
02/03/26(Tue)21:59:39 No.108053466 >>108053453
i haven't watched the anime
>>
Anonymous
02/03/26(Tue)22:03:49 No.108053495 https://github.com/ace-step/ACE-Step-1.5?tab=readme-ov-file#-installation
works well, gonna try their frontend for the inpainting/edit stuff
>>
Anonymous
02/03/26(Tue)22:04:48 No.108053504 >>108053284
>>108053303
kek. nice. why doesn't it finish the songs though?
>>
Anonymous
02/03/26(Tue)22:06:38 No.108053519 >>108053342
Based
>>
Anonymous
02/03/26(Tue)22:07:06 No.108053521 https://github.com/ace-step/ACE-Step-1.5/blob/main/docs/en/GRADIO_GUIDE.md
hmm looks like for now comfy's implementation is simple t2i
gradios (bleah) UI has inpainting, reference audio, cover mode and with BASE it has instrument add/segmentation.
but I hate gradio
also comfy can offload across gpu/cpu, their implementation doesnt.
and finally the 4b model isnt actually implemented in comfy.
SAD.
>>
Anonymous
02/03/26(Tue)22:09:48 No.108053533 Is MultiGPU broken with Klein for anyone else? Trying to offload the text encoder winds up partially loading the UNET and giving me an OOM the moment my gens finish, even though half my vram is free.
>>
Anonymous
02/03/26(Tue)22:09:48 No.108053534 >>108053521
seems like that might be better for my 4gb card though since it disables the LLM which takes about 18 minutes on my system
>>
Anonymous
02/03/26(Tue)22:11:36 No.108053541 >>108053521
Comfy did the absolute minimum for this lmao
>>
Anonymous
02/03/26(Tue)22:11:51 No.108053543 >>108053504
hey can you stop being creepy?
>>
Anonymous
02/03/26(Tue)22:12:11 No.108053545 Ace, everything works but the audio is empty.
Fuck this python shit I'm not going to trouble shoot why the audio file is empty. It actually calculates.
>>
Anonymous
02/03/26(Tue)22:14:08 No.108053562 >>
Anonymous
02/03/26(Tue)22:14:11 No.108053564 It is running on 4GB of vram:
>Warning: Ran out of memory when regular VAE decoding, retrying with tiled VAE decoding. Prompt executed in 69.03 seconds
>>
Anonymous
02/03/26(Tue)22:14:52 No.108053568 Anima is absolutely insane for something so lightweight. I only hope we will get it to prompt in higher resolution because upscaling nukes it.
>>
Anonymous
02/03/26(Tue)22:14:56 No.108053570 Nah, Ace sucks. Suno 0.5 tier.
>>
Anonymous
02/03/26(Tue)22:15:11 No.108053576 >>108053564
I mean I have 12GB of vram.
>>
Anonymous
02/03/26(Tue)22:16:32 No.108053586 >>108053545
my acestep was shit till I removed sageattention from my startup args.
use grok to make a song template or song + lyrics, ie:
(Verse 1)
Flying high on a private jet, secrets in the air,
Bill Gates, the corrupt globalist, doesn't seem to care.
Island breeze and hidden deals, under tropic skies,
Bill Gates, the corrupt globalist, wearing his disguise.
(Chorus)
Oh, he's off to Epstein's island, chasing what he sees,
Bill Gates, the corrupt globalist, living on his knees.
Girls are dancing in the shadows, whispers in the night,
Bill Gates, the corrupt globalist, everything's not right.
>>
Anonymous
02/03/26(Tue)22:17:22 No.108053592 >>
Anonymous
02/03/26(Tue)22:18:00 No.108053598 >>108053568
Some SDXL tunes are better and you can still use multiple characters.
Anima has lack of perspective and it doesn't understand backgrounds.
Test it with a random booru tag prompts and you will see what I mean.
And yet SDXL gen is <30 seconds compared to this even on low end hardware.
>>
Anonymous
02/03/26(Tue)22:19:14 No.108053606 >>108053586
It seems to calculate them but vae decode spits out an empty file.
Must be one of those ComfyUI(tm) Quirks.
>>
Anonymous
02/03/26(Tue)22:20:57 No.108053615 >>108053598
No SDXL finetune can handle multiple cutstom subjects unless they are characters. Not a single one of them.
And why the fuck would you booru prompt a natural language model?
>>
Anonymous
02/03/26(Tue)22:21:29 No.108053622 >>108053568
sonic, masterpiece, safe
>>
Anonymous
02/03/26(Tue)22:21:45 No.108053624 >>108053598
Lol aah reminds me all the cope I used to listen to why SD1.5 was so much better, nostalgic
>>
Anonymous
02/03/26(Tue)22:22:23 No.108053631 >>108053533
MultiGPU updates slow as fuck. It's was broken for a month, patched, then broken for another month.
>>
Anonymous
02/03/26(Tue)22:22:24 No.108053632 >>108053615
Anima is designed for tags. Maybe read the huggingface page. In any case, Qwen 0.6B is not that helpful lol.
>>
Anonymous
02/03/26(Tue)22:23:00 No.108053640 Many of you can't prompt for shit so I need to actually use the model before making a judgment
>>
Anonymous
02/03/26(Tue)22:24:37 No.108053647 my Mom said I'm a great prompter, I'll have you know
>>
Anonymous
02/03/26(Tue)22:24:39 No.108053649 >>108053632
>The model is trained on Danbooru-style tags, natural language captions, and combinations of tags and captions.
Oh really now?
>>
Anonymous
02/03/26(Tue)22:25:07 No.108053655 >>108053543
have I truly been so abominable?
>>
Anonymous
02/03/26(Tue)22:26:02 No.108053660 Klein is way too good at upscaling old XL gens
>>
Anonymous
02/03/26(Tue)22:26:12 No.108053662 >>108053586
It began to write the files after a reboot. Need to do a bigger test run. Maybe it was python cache or something.
>>
Anonymous
02/03/26(Tue)22:26:43 No.108053669 >>108053568
that skin color doesnt exist
>>
Anonymous
02/03/26(Tue)22:26:58 No.108053671 >>108053649
so far ive been doing tags combined with comma separated short phrases myself. pretty much what i did for illustrious but anima handles the short non-tag phrases way better
>>
Anonymous
02/03/26(Tue)22:27:06 No.108053672 >>108053640
>Anima
Good
>Acestep
Good
Thats my review right now and they can only get better from here
>>
Anonymous
02/03/26(Tue)22:27:14 No.108053673 >>108053671
bro that's crazy
>>
Anonymous
02/03/26(Tue)22:27:44 No.108053677 >>108053672
fuck you poor you are ruining it for everyone
>>
Anonymous
02/03/26(Tue)22:28:13 No.108053681 >>108053660
I don't think this is even SDXL. SDXL was more graceful. You are upscaling a chatgpt gen I suppose.
>>
Anonymous
02/03/26(Tue)22:29:20 No.108053686 >>108053681
its sd 1.5 retard
>>
Anonymous
02/03/26(Tue)22:29:24 No.108053687 kek
grok prompt: make a song about George Floyd doing too much fent that he got high and overdosed, with rhyming lyrics.
output for a slow, melodic synthwave song:
https://voca.ro/1g9wrs64VKPo
>>
Anonymous
02/03/26(Tue)22:29:38 No.108053691 >>
Anonymous
02/03/26(Tue)22:30:21 No.108053699 >>108053660
I've been using klein to cobble together character art by using blurry as fuck screenshots as character reference, the model is insane.
>>
Anonymous
02/03/26(Tue)22:30:48 No.108053702 >>108053686
I don't think so. Paid service image.
>>
Anonymous
02/03/26(Tue)22:31:05 No.108053706 >>108053681
No
>>
Anonymous
02/03/26(Tue)22:32:08 No.108053712 >>108053706
Ok, I believe. You used caps at least.
>>
Anonymous
02/03/26(Tue)22:33:05 No.108053721 >>108053687
thanks to acestep, I call this fentwave:
https://voca.ro/1gV4SnoxVvsA
>>
Anonymous
02/03/26(Tue)22:33:09 No.108053722 >>108053687
What does this tell you about yourself? You are a teenager <18 and spend your time on youtube listening to trash.
Jesus go home and tell your parents something, faggot.
You are the reason why 4chan sucks.
>>
Anonymous
02/03/26(Tue)22:33:32 No.108053729 >>108053671
Anima has strong full length boomerprompt adherence, IDK why anyone is claiming otherwise
>>
Anonymous
02/03/26(Tue)22:34:43 No.108053735 >>108053712
It was literally base SDXL with like 4 loras
>>
Anonymous
02/03/26(Tue)22:35:12 No.108053741 Pretty funny.
https://voca.ro/1kzK4q82Hkh2
>>
Anonymous
02/03/26(Tue)22:35:26 No.108053746 >>108053729
i tried a few full natural language prompts but only a short paragraph or so. im just not used to multi-paragraph huge prompts since i pretty much only use tag models
>>
Anonymous
02/03/26(Tue)22:35:26 No.108053748 >>108053722
it's worse because he is 30+ years old
>>
Anonymous
02/03/26(Tue)22:35:39 No.108053749 Anima needs to work on forge neo soon. On a unrelated note how close is to to Z in regards to prompt adherence?
>>
Anonymous
02/03/26(Tue)22:35:51 No.108053753 >>108053721
top node: a slow, driving synthwave song. ok, now THIS is fentwave.
https://voca.ro/11u9EFCq9j69
>>
Anonymous
02/03/26(Tue)22:36:01 No.108053757 >>108053729
*bleeds your prompt*
heh nothing personnel kid
>>
Anonymous
02/03/26(Tue)22:38:18 No.108053777 >>108053757
Legit skill issue, I don't think you have the hardware to run the model
>>
Anonymous
02/03/26(Tue)22:38:21 No.108053778 >>108053753
It has been calibrated for lack of balls music, trained on dataset without testosterone.
>>
Anonymous
02/03/26(Tue)22:39:27 No.108053783 >>108053749
fucking loving your gens for all the wrong reasons, my coom instinct is fully triggered
>>
Anonymous
02/03/26(Tue)22:39:36 No.108053785 says the shill crying about zit being slow
>>
Anonymous
02/03/26(Tue)22:42:33 No.108053801 bumped steps to 20 since it's already super fast.
a heavy metal song with drums and guitars.
https://voca.ro/1inqaeYxA5YI
>>
Anonymous
02/03/26(Tue)22:43:10 No.108053807 >>108053801
This thread is image related, why do you always talk music here?
>>
Anonymous
02/03/26(Tue)22:44:42 No.108053821 >>108053807
>Discussion of Free and Open Source Diffusion Models
>>
Anonymous
02/03/26(Tue)22:45:30 No.108053827 >>108053749
In my experience not as good, but still capable of parsing fairly complex prompts.
>>
Anonymous
02/03/26(Tue)22:45:42 No.108053831 >>108053366
i NEED the anon who made the y2k ZIT lora to come back and remake it for klein
>>
Anonymous
02/03/26(Tue)22:46:03 No.108053832 >>108053807
LOCAL DIFFUSION general. not local image general.
anyway. an uptempo anime song from the opening of an anime show with synths and guitars.
https://voca.ro/1amYCIdp6r44
>>
Anonymous
02/03/26(Tue)22:46:06 No.108053834 >>108053699
>>
Anonymous
02/03/26(Tue)22:46:39 No.108053839 >>108053801
Much better than what I had.
It still sounds like Midi convertions of metal songs of early 2000s
>>
Anonymous
02/03/26(Tue)22:46:54 No.108053841 >>108053807
Better than the other brain damage samefagging shit i have to scroll through here usually
>>
Anonymous
02/03/26(Tue)22:47:07 No.108053845 holy shit the gradio ui for acestep is so fucking CANCER
>>
Anonymous
02/03/26(Tue)22:50:40 No.108053867 They will call us lucky for having been here.
>>
Anonymous
02/03/26(Tue)22:51:04 No.108053872 >>108053832
Can this model generate made-up lyrics like in Nier songs?
>>
Anonymous
02/03/26(Tue)22:51:42 No.108053883 >>108053867
Thank you.
>>
Anonymous
02/03/26(Tue)22:52:11 No.108053887 >>
Anonymous
02/03/26(Tue)22:52:34 No.108053892 so you can apparently use acestep to mod songs
"It's just a process similar to img2img. Vae encode a song and use the latent to render with low Denoise around 0.25, I also increased the cfg a bit."
toss this into latent image instead of the default. set denoise to 0.25. seems to work.
>>
Anonymous
02/03/26(Tue)22:52:54 No.108053896 >>108053749
why is ani white
>>
Anonymous
02/03/26(Tue)22:53:08 No.108053901 is rannigger really going we wuz kangz mode?
>>
Anonymous
02/03/26(Tue)22:53:36 No.108053902 >>108053892
fucking stupid website captchas
anyways, do this.
>>
Anonymous
02/03/26(Tue)22:54:24 No.108053908 >>108053896
He is white, why would I not keep the character in line?
Refining the prompt
>>108053901
I don't think I'm a king but you seem to worship me so why not
>>
Anonymous
02/03/26(Tue)22:54:53 No.108053911 >>108053901
I'm his best friend and protector now.
>>
Anonymous
02/03/26(Tue)22:55:16 No.108053917 >>108053908
DAS RITE
>>
Anonymous
02/03/26(Tue)22:55:21 No.108053918 >>108053908
make him blue instead
>>
Anonymous
02/03/26(Tue)22:55:43 No.108053922 >>
Anonymous
02/03/26(Tue)22:55:54 No.108053923 >>108053908
Well said my friend. Let's concentrate on creating new images.
>>
Anonymous
02/03/26(Tue)22:56:37 No.108053928 >>108053832
>https://voca.ro/1amYCIdp6r44
ok nigga you got me
>>
Anonymous
02/03/26(Tue)22:57:50 No.108053936 >>108053908
then why is ran brown?
>>
Anonymous
02/03/26(Tue)22:57:52 No.108053937 https://vocaroo.com/17FBr0Hqcimc
>Heavy metal, slow.
>[Intro - Robotic Metal Riff]
>[Instrumental - oriental solo]
Yeah this model is a joke. Unless you have yodling japanese voices and drumbeat it doesn't do anything.
Thing is most people are as blind as they are musical which equals to none.
>>
Anonymous
02/03/26(Tue)22:59:01 No.108053943 >>108053918
I will make him a blue space slave at a later date, trying to think of a good setting
>>
Anonymous
02/03/26(Tue)22:59:33 No.108053948 >>108053902
and, if you put in M@GICALCURE! LOVE SHOT! by SAWTOWNE feat.Hatsune Miku
you will get Miku singing about Floyd using fent. I need to tweak the denoise and stuff. but you can edit songs with this/change lyrics.
https://voca.ro/15ugOpa1EuuZ
>>
Anonymous
02/03/26(Tue)22:59:45 No.108053950 >>108053943
keep up the good work king
>>
Anonymous
02/03/26(Tue)22:59:57 No.108053954 >>108052364
>Qwen Image 2512 fp8_e4m3fn
>4 seconds/step and 4 minutes for the whole image
>27gb of vram used during inference
I get 23.4GB VRAM usage when running qwen in comfy+linux, on my 7900 XTX, and that's with the desktop and other software in VRAM too. maybe more of the workflow like the text encoder is also being loaded in VRAM for you since you have more. My perf:
>100%|| 8/8 [00:22<00:00, 2.78s/it]
pretty good showing from the 9700, but I think the driver support needs improvement. the architecture of that card should allow it to perform better than it does.
>nice to know I'm not missing out by not being able to run larger models on my 16gb vram card at home
qwen image is capable of rendering things that other models simply can't handle. complex geometry, architecture, crowds, interactions, etc. and it's actually very fast with this lora:
https://huggingface.co/lightx2v/Qwen-Image-2512-Lightning/blob/main/Qwen-Image-2512-Lightning-8steps-V1.0-bf16.safetensors
>>
Anonymous
02/03/26(Tue)23:00:23 No.108053959 Repaint feature really needs to get here. I'm finding music variety rivalling Suno (given good prompts and all). On good seeds it rivals Udio. If you don't agree you haven't been around long enough.
https://files.catbox.moe/blln37.mp3
>>
Anonymous
02/03/26(Tue)23:00:59 No.108053963 >>108053908
>why would I not keep the character in line?
Why is your self insert white then?
>>
Anonymous
02/03/26(Tue)23:01:53 No.108053972 >>108053954
>hormone pills
kek
>>
Anonymous
02/03/26(Tue)23:01:55 No.108053973 >>108053959
Still cacophony without any music theory or even composition.
No wonder you are a chronic masturbator.
>>
Anonymous
02/03/26(Tue)23:02:06 No.108053975 >>108053943
fuck it turns me on insanely
>>
Anonymous
02/03/26(Tue)23:02:10 No.108053976 >>108053963
You seem to be of low IQ but high in rage
>>
Anonymous
02/03/26(Tue)23:03:30 No.108053991 >>108053976
High sub 70s?
>>
Anonymous
02/03/26(Tue)23:03:39 No.108053993 wait is ran brown irl?
>>
Anonymous
02/03/26(Tue)23:04:36 No.108054001 >>108053991
now we are being racist
>>
Anonymous
02/03/26(Tue)23:04:37 No.108054003 >>108053948
holy shit kek
8 steps, set cfg to 3.0. now skip to 30s to see how it swaps the lyrics. need to tweak more but it's funny.
https://voca.ro/1I7ddfbHLz5L
>>
Anonymous
02/03/26(Tue)23:05:07 No.108054005 >outs himself by using the same nickname he's assigned.
Low IQ and can't even troll correctly. Can't even make a gen because you lack the skill too
>>
Anonymous
02/03/26(Tue)23:06:51 No.108054016 >>108053993
somalian
>>
Anonymous
02/03/26(Tue)23:09:46 No.108054038 >>108053757
IDK what you're referring to
>>
Anonymous
02/03/26(Tue)23:09:49 No.108054040 >>108054005
fuck you be respectful
>>
Anonymous
02/03/26(Tue)23:21:25 No.108054130 Please help, never used comfy but I want to try AceStep1.5, it keeps hanging at VAEDecodeAudio "GET was unable to find an engine to execute this computation"
--lowvram works but takes 5minutes for 30 seconds on 16GB 4080
>>
Anonymous
02/03/26(Tue)23:22:35 No.108054138 >>108054130
okay
>>
Anonymous
02/03/26(Tue)23:24:12 No.108054146 >>108054130
https://github.com/ace-step/ACE-Step-1.5?tab=readme-ov-file#-installation Try this, it works on 12gb rtx 3060.
>>
Anonymous
02/03/26(Tue)23:25:25 No.108054160 >Years later
>Same group of failures mad that they can't match me
I even take multi month breaks and you losers can't make anything new or interesting. I'm not even going to waste my time to look at the /sdg/ miscarriages.
>>
Anonymous
02/03/26(Tue)23:28:33 No.108054184 >>108054160
your loss
>>
Anonymous
02/03/26(Tue)23:28:59 No.108054189 >>108054130
restart and try again, it is most likely a pychache issue.
New nodes don't refresh even if you click 'refresh' inside cumui.
>>
Anonymous
02/03/26(Tue)23:29:15 No.108054190 >>108053993
>be black irl
>generate pictures of yourself as a white slave owner
>portray your enemies as cotton pickers
its quite a display of psychological trauma,
>>
Anonymous
02/03/26(Tue)23:29:38 No.108054192 >>108053541
Bordering on absurdity compared to the functionality their own front end has.
>>
Anonymous
02/03/26(Tue)23:29:43 No.108054193 a 1980s pop song with synths:
https://voca.ro/19p6qt3QHU18
>>
Anonymous
02/03/26(Tue)23:30:12 No.108054196 I really didn't think ran was real, I thought he was some schizos hallucination, this is crazy
>>
Anonymous
02/03/26(Tue)23:30:53 No.108054200 ran won btw
>>
Anonymous
02/03/26(Tue)23:31:03 No.108054202 >>108054196
So, he made few images and now some off-site psycho is crying about it.
>>
Anonymous
02/03/26(Tue)23:31:09 No.108054203 >>108054160
I think I'm going to have to bust a nut to the next one just to worship your gens
>>
Anonymous
02/03/26(Tue)23:31:23 No.108054205 >>108054146
Already tried it but the gradio interface is dogshit and after doing "initialize service", "generate sample" and "generate music" the buttons for the "results" literally do nothing, on multiple browsers with no extensions etc.
>>108054189
I restarted multiple times and tried on the manual and portable installations of comfy, doesn't explain why it worked with --lowvram
>>
Anonymous
02/03/26(Tue)23:32:12 No.108054212 >>108054130
I get 60 seconds for a 2 minute song on 10gb
You are fucking something up
>>
Anonymous
02/03/26(Tue)23:32:49 No.108054214 >>108054205
Remove that --lowvram from your script.
This is mine
#!/bin/bash
python3 ./main.py --disable-manager --disable-manager-ui --disable-api-nodes --preview-method auto
>>
Anonymous
02/03/26(Tue)23:32:59 No.108054217 >>108054212
with --lowvram? The point is it doesn't work at all unless I do --lowvram which obviously is wrong
>>
Anonymous
02/03/26(Tue)23:34:13 No.108054225 I see links to songs and I ask myself. If this really worth a bare minimum two minutes of my time to appreciate? And the answer is usually no.
Your songs needs to be incredibly catchy and or funny to get my interest and I'm not going to open that link unless you sell it to me first.
>>
Anonymous
02/03/26(Tue)23:35:00 No.108054228 >>108054217
No extra settings it all loads into vram normally, downloaded the split files from comfy huggingface
Using the basic template they uploaded today
>>
Anonymous
02/03/26(Tue)23:35:00 No.108054229 if you do this you can clone/mess with existing audio but im trying to figure out the best settings. start at 0.25 denoise.
>>
Anonymous
02/03/26(Tue)23:38:10 No.108054246 >>108054225
Yeah basically this is worse than Microsoft's VibeVoice 7b. It was able to clone voices and be realistic.
This is just pomp pomp japan japan. It's not a model.
>>
Anonymous
02/03/26(Tue)23:38:13 No.108054247 >>108054203
based hornyposter
>>
Anonymous
02/03/26(Tue)23:38:20 No.108054248 >>108054205
These are the settings that worked for me.
>>
Anonymous
02/03/26(Tue)23:40:00 No.108054256 >>108054248
is that anistudio?
>>
Anonymous
02/03/26(Tue)23:40:55 No.108054265 >>108054256
kek
>>
Anonymous
02/03/26(Tue)23:41:53 No.108054276 >>108053973
It absolutely does have insane composition matching instructions on my lyrics. It's following the prompt precisely, just needs a repaint for 100% lyrics match.
https://files.catbox.moe/3i5itm.mp3
It helps that I can quickly iterate on songs based on a system prompt I made based on songs from their demo, here's what I'm giving Gemini
https://pastebin.com/Xt551MqD
Works like a charm.
>>
Anonymous
02/03/26(Tue)23:42:32 No.108054281 >>108054256
No it's my original character Amispoodeo.
>>
Anonymous
02/03/26(Tue)23:42:48 No.108054283 bloody nodes
>>
Anonymous
02/03/26(Tue)23:47:14 No.108054312 >>108054214
Launching Comfy with this worked.
1st gen: 120s audio in 105s. 2nd was 60s audio in 29s. 3rd 120s in 50s.
Thanks everyone
>>
Anonymous
02/03/26(Tue)23:50:03 No.108054322 >>108054246
There's nothing wrong with the quality of the music. It's just all so generic and dependent on the tastes of the individual that I fail to really care.
>>
Anonymous
02/03/26(Tue)23:53:34 No.108054339 >>108054283
jej
>>
Anonymous
02/03/26(Tue)23:53:54 No.108054341 >>108054283
unironically kino
>>
Anonymous
02/03/26(Tue)23:55:01 No.108054349 >>108054283
saar i have broduced most beutiful skin for the comyui
>>
Anonymous
02/03/26(Tue)23:57:07 No.108054360 >basically already here
>just a completely different model
>>
Anonymous
02/03/26(Tue)23:57:11 No.108054361 >>108054283
I kind of want this.
>>
Anonymous
02/03/26(Tue)23:58:26 No.108054371 >>108054283
I want this but with a sensible non-poojeet aesthetic
>>
Anonymous
02/03/26(Tue)23:58:43 No.108054375 >>108054361
if you build it sar they will bloody come
>>
Anonymous
02/04/26(Wed)00:02:10 No.108054400 >>108054276
No, sorry, it's all shit.
Sort of solves the normie question- you are as blind as you are amusical.
>>
Anonymous
02/04/26(Wed)00:04:25 No.108054414 are 60% of threads ran talking to herself?
>>
Anonymous
02/04/26(Wed)00:04:36 No.108054416 >ricing your workflow
>>
Anonymous
02/04/26(Wed)00:05:16 No.108054419 >>
Anonymous
02/04/26(Wed)00:05:24 No.108054420 >>108054414
https://www.youtube.com/watch?v=TBV8_0BqIw8&list=RDTBV8_0BqIw8
This my song.
>>
Anonymous
02/04/26(Wed)00:05:57 No.108054424 Where's the anon who said it couldn't do authentic chiptunes (for keygen music)
Well, I got some news for you anon-kun... It can do some killer keygen music!
https://files.catbox.moe/olocnw.mp3
>>
Anonymous
02/04/26(Wed)00:08:36 No.108054449 >>
Anonymous
02/04/26(Wed)00:09:00 No.108054454 >>108054276
>>108054424
One thing I forgot to mention is of course you should tell the LLM to give you appropriate duration in seconds, BPM and keyscale in addition to the prompt. Changing BPM based on pace of song does change it a lot and is crucial depending on what you're going for.
>>
Anonymous
02/04/26(Wed)00:09:37 No.108054465 >>108054449
Thank you for inspiring me.
I will post some new stuff tomorrow or something.
>>
Anonymous
02/04/26(Wed)00:11:19 No.108054477 >>108054449
put somebody whipping him as well
>>
Anonymous
02/04/26(Wed)00:12:03 No.108054485 >>108053288
>Is not better than Suno, goy
Kek shilleets are working hard. Go back fag, I can literally made a lora with real artist, meanwhile you are still cucked by Suno shit
>>
Anonymous
02/04/26(Wed)00:13:06 No.108054495 >>108054485
1.5 by itself it pretty passable. But with LoRAs the door is kind of open to anything, and it trains crazy fast.
>>
Anonymous
02/04/26(Wed)00:14:34 No.108054508 >>108054495
Is there guide on how to train somewhere?
>>
Anonymous
02/04/26(Wed)00:15:49 No.108054515 >>108053288
>It's definitely not as good as Suno
It absolutely is. Do you have access to v5? So far from my tests the musicality matches or surpasses v4.5, music sounds less slopped.
>>
Anonymous
02/04/26(Wed)00:16:12 No.108054522 >>108054508
Guides? Not that I'm aware of. I think it helps to caption the data using their captioner though.
Are you using their front end? There's a LoRA trainer attached to it. A few as 8 songs seems to produce pretty good results but idk, I haven't actually tried it yet. Gonna steal a bunch of stuff from troono when I get off work today and try it.
>>
Anonymous
02/04/26(Wed)00:16:43 No.108054527 >>
Anonymous
02/04/26(Wed)00:16:59 No.108054528 >>108054522
Thanks, I'll check out their front end
>>
Anonymous
02/04/26(Wed)00:19:07 No.108054543 LTX2 video extend is so much fun.
https://files.catbox.moe/a7a1ip.mp4
>>
Anonymous
02/04/26(Wed)00:19:18 No.108054544 >>108054485
He posted a bad gen. Probably prompt issue, or something else. Very easy to get good gens where you can clearly hear the lyrics with quality matching or surpassing Suno.
>>
Anonymous
02/04/26(Wed)00:19:41 No.108054547 >>108054527
>ldg
>1girl
Other than the lady on the right, nice gen.
>>
Anonymous
02/04/26(Wed)00:21:16 No.108054560 >>108054449
make him carry the platform with your throne on it. cant remember the exact name for it but its usually like 6 people doing it
>>
Anonymous
02/04/26(Wed)00:22:41 No.108054569 >>108054544
Model author himself says it doesn't surpass suno. But it is good enough and has enough tools that with a little tweaking, can suprass suno.
>>
Anonymous
02/04/26(Wed)00:22:55 No.108054573 >>108054477
It kind of struggles with that with the scenes I'm doing, I might do one off a black man whipping him or a NTR image series of Busty Aries (comfy mascot) is getting dicked down by a faceless man with corpo written on his face and Ani is seething/crying in the corner.
>>
Anonymous
02/04/26(Wed)00:24:29 No.108054588 >>108054573
Forgot image
>>
Anonymous
02/04/26(Wed)00:28:51 No.108054618 we're eating so good it's not even funny anymore
>>
Anonymous
02/04/26(Wed)00:29:12 No.108054622 >>108053827
THE MOON WAXES AN MUH MERCY WANES
nice. this is zit with a lora?
>>
Anonymous
02/04/26(Wed)00:31:17 No.108054640 ```Add a garish Anime Waifu theme to the entire ComfyUI user interface in image 1 while maintaining the original composition and layout.```
>>
Anonymous
02/04/26(Wed)00:32:57 No.108054657 >>108054640
bros... I kinda want this unironically...
>>
Anonymous
02/04/26(Wed)00:33:32 No.108054658 >>108054640
woah very pretty and comfy
>>
Anonymous
02/04/26(Wed)00:36:12 No.108054678 >>108054573
Yeah for a dynamic image like that I'll need controlnet.
I want to have the ultimate whipping position
>>
Anonymous
02/04/26(Wed)00:36:22 No.108054680 >>108053568
>I only hope we will get it to prompt in higher resolution because upscaling nukes it.
desu all it needs is a cnet. with a good prompt the only thing that really suffers are some background details
>>
Anonymous
02/04/26(Wed)00:36:28 No.108054681 >>108054640
You might have unironically stumbled on a thing people would want.
>>
Anonymous
02/04/26(Wed)00:37:25 No.108054687 >>108054640
WTF?!?!!? I love ComfyUI now
@ComfyAnon!!!!
Please this is a great idea!!!!
>>
Anonymous
02/04/26(Wed)00:37:29 No.108054688 Heart shitters on suicide watch right now after ace step.
>>
Anonymous
02/04/26(Wed)00:38:09 No.108054691 When the ai bubble pops will we finally be able to have good local models by renting gpu clusters for 10 cents/hour? please say yes I cant take this anymore
>>
Anonymous
02/04/26(Wed)00:38:52 No.108054693 acestep 1.5 can extend songs?
>>
Anonymous
02/04/26(Wed)00:40:54 No.108054714 you're a big guy.
https://files.catbox.moe/f6jgsu.mp4
>>
Anonymous
02/04/26(Wed)00:41:00 No.108054716 >>108054693
>acestep 1.5 can extend songs
No reason why it couldn't. No idea if it does it well or not.
>>
Anonymous
02/04/26(Wed)00:41:30 No.108054717 >>108054691
you can turn your house into a data center because gpus will be 10c each
>>
Anonymous
02/04/26(Wed)00:43:07 No.108054727 does AceStep have a way to add in sound effects with the prompt like gunshots?
>>
Anonymous
02/04/26(Wed)00:46:58 No.108054745 >>
Anonymous
02/04/26(Wed)00:47:09 No.108054747 I need Chroma Klein to be finished immediately.
>>
Anonymous
02/04/26(Wed)00:47:55 No.108054750 >>108054424
except that isn't keygen at all
>>
Anonymous
02/04/26(Wed)00:48:07 No.108054751 >>108054681
jej
>>
Anonymous
02/04/26(Wed)00:49:07 No.108054760 LTX2 likes to be stuck on the first frame for a full 3 seconds and then start moving at the very last second. What am I doing wrong now?
>>
Anonymous
02/04/26(Wed)00:50:08 No.108054767 >>108054760
you're using an image and not a video
>>
Anonymous
02/04/26(Wed)00:50:46 No.108054772 >>108054727
[Gunshots]
>>
Anonymous
02/04/26(Wed)00:50:54 No.108054773 >>108054640
kek
>>
Anonymous
02/04/26(Wed)00:52:11 No.108054779 >>108054760
Use the movement lora
https://huggingface.co/MachineDelusions/LTX-2_Image2Video_Adapter_LoRa/tree/main
>>
Anonymous
02/04/26(Wed)00:52:32 No.108054783 >>108054640
trans coded UI
>>
Anonymous
02/04/26(Wed)00:53:03 No.108054788 ltx model knows spongebob natively:
https://files.catbox.moe/ln833s.mp4
>>
Anonymous
02/04/26(Wed)00:53:27 No.108054790 >>108054779
Got a bad feeling throwing another 5gb LoRA into the mix isn't going to be easy on my 12gb card.
>>
Anonymous
02/04/26(Wed)00:53:40 No.108054793 >>108054788
we know
>>
Anonymous
02/04/26(Wed)00:54:46 No.108054798 >>108054790
I use it on 12 too, it doesn't really hurt that much since we are using a lot offload as it is
>>
Anonymous
02/04/26(Wed)00:54:52 No.108054799 >>108054788
bud can you at least test to see what other characters it knows?
>>
Anonymous
02/04/26(Wed)00:56:25 No.108054811 >>108054798
Certainly will try it. I'll let you know whether it worked for me someday when this gen finishes and the next one finishes
>>
Anonymous
02/04/26(Wed)01:00:16 No.108054829 Another thing I noticed is that Comfy seems to have messed up the speed for whatever reason. No idea why it's going through text-encode each time for whatever reason kek.
More keygen kino
https://files.catbox.moe/4kgqcy.mp3
Also this model absolutely can match Udio in musicality with prompts targeting the genre.
https://files.catbox.moe/vpcuf0.mp3
Gave this Udio 1.0 gen to Gemini asking it how to prompt it, then pass tags to my AceStep prompt template
This is one I got
https://files.catbox.moe/fkf9l9.mp3
Here's another which better matches lyrics (though it's more generic)
https://files.catbox.moe/u00pjf.mp3
>Model author himself says it doesn't surpass suno
Yes, he says it matches 4.5/v5 though when all the tools are leveraged, and since it's local we can surpass it because Suno is prone to censorship, it's gated/expensive per gen, and it also has no way to get specific voices or cover copyrighted songs like we do.
>>
Anonymous
02/04/26(Wed)01:02:47 No.108054844 >>108054527
```The woman in photograph image 1 is now completely surrounded by leering dark-skinned South Asian men. Maintain all other aspects of the composition and layout.```
>>
Anonymous
02/04/26(Wed)01:04:56 No.108054854 >>108054773
very punchable image.
I can imagine it doing slightly sped up Jim Carrey face expression impersonations to pitched up music.
>>
Anonymous
02/04/26(Wed)01:05:56 No.108054858 >>108054799
>bud can you at least test to see what other characters it knows?
Everyone knows test anon only tests one of like 6 individuals.
>>
Anonymous
02/04/26(Wed)01:10:41 No.108054887 >>
Anonymous
02/04/26(Wed)01:17:39 No.108054932 >>108054547
ty
>>108054844
>mfw trying to curry favor
>>
Anonymous
02/04/26(Wed)01:21:23 No.108054948 >>
Anonymous
02/04/26(Wed)01:21:37 No.108054949 Z is a great model
>>
Anonymous
02/04/26(Wed)01:22:33 No.108054952 >>108054949
getting ready to send the pyramids to earth?
>>
Anonymous
02/04/26(Wed)01:26:01 No.108054975 >>108054829
First one sounds somewhat similar to Plastic Love so I know at least that was in training data, which is great since that means it has high quality songs.
One thing it definitely needs a LoRA for is spaghetti western songs, I can't get it to do this Udio gen
https://www.youtube.com/watch?v=8moLFyfgUR4
The model likely has never heard it unfortunately due the dataset used.
>>
Anonymous
02/04/26(Wed)01:26:24 No.108054979 Just a heads up that ace step also has an optional 4B LM and a base model (most workflows use turbo)
https://huggingface.co/ACE-Step/acestep-v15-base/tree/main
If you want that see variety and shiet.
>>
Anonymous
02/04/26(Wed)01:27:11 No.108054982 >>108054844
>the woman is surrounded by 4 angry Italians because she ate the slice incorrectly
>>
Anonymous
02/04/26(Wed)01:29:15 No.108054997 >>108054979
I want a working remix/cover option
>>
Anonymous
02/04/26(Wed)01:32:09 No.108055010 >>108054997
Is this a comfy issue or is the gradiot interface not working too?
>>
Anonymous
02/04/26(Wed)01:33:10 No.108055015 >>
Anonymous
02/04/26(Wed)01:34:24 No.108055022 Anyone tried remaking the Ken-sama Go song in acestep yet? the original fucked up the pronunciation of nippon
>>
Anonymous
02/04/26(Wed)01:35:24 No.108055030 >>108055010
nta comfy just added text to music, They mention the other features on the blog page but say the community has to do them lol
>>
Anonymous
02/04/26(Wed)01:35:49 No.108055032 >>108054773
>>
Anonymous
02/04/26(Wed)01:35:56 No.108055034 >>108054979
>Just a heads up that ace step also has an optional 4B LM and a base model (most workflows use turbo)
How does loading the 4B LM work in comfy? Do we just load it by itself or do we include the other ones as well
>>
Anonymous
02/04/26(Wed)01:36:43 No.108055045 >>108055034
One or the other. They do the same thing. But the 4B has more Bs and is therefore better.
>>
Anonymous
02/04/26(Wed)01:40:18 No.108055067 >>108054982
kek
>>
Anonymous
02/04/26(Wed)01:42:50 No.108055074 >>
Anonymous
02/04/26(Wed)01:43:16 No.108055079 Chroma Kaleidoscope status check
not quite there yet but coming along
https://files.catbox.moe/3u8z4e.png
can almost do dog fucking a chick properly
>>
Anonymous
02/04/26(Wed)01:45:48 No.108055095 I feel like LoRAs for ACEStep would get taken down from sites like Civit and treated like celebrity LoRAs except even worse, so users would need tor resort to piracy to share LoRAs.
>>
Anonymous
02/04/26(Wed)01:48:13 No.108055107 the asian girl in image1 is wearing a white tshirt with the anime girl in image2 on it. her midriff is visible.
>>
Anonymous
02/04/26(Wed)01:49:48 No.108055120 >>108055095
come to this thread for now to discuss ace step 1.5 specifically:
>>108051632
gradio status: no amd support, rumor is Metal may work (???)
>>
Anonymous
02/04/26(Wed)01:50:40 No.108055128 hello, saars. can somebody make NAG work with anima???
>>
Anonymous
02/04/26(Wed)01:52:13 No.108055144 >>108055128
Why? Negative works?
>>
Anonymous
02/04/26(Wed)01:53:07 No.108055148 >>108055144
isn't nag better than negative most of the time?
>>
Anonymous
02/04/26(Wed)01:54:43 No.108055157 >>108055148
No NAG is a cope
>>
Anonymous
02/04/26(Wed)01:54:53 No.108055158 >>108055148
I don't think so (correct if I am wrong), It's just a hack to add negatives into Distill models, doesn't do much for base models.
>>
Anonymous
02/04/26(Wed)01:56:04 No.108055167 >>108055157
next you're gonna say negpip is cope also. negatives are not that good by themselves
>>
Anonymous
02/04/26(Wed)01:58:07 No.108055177 >>108055074
ani is coming out determined and stringer in these. like he could support a family
>>
Anonymous
02/04/26(Wed)02:00:56 No.108055195 >>108055177
he has to support his bull's offspring
>>
Anonymous
02/04/26(Wed)02:10:33 No.108055247 >>
Anonymous
02/04/26(Wed)02:12:47 No.108055256 I need a $9,999,999 graphics card, I am going to fucking die if I don't get a $9,999,999 graphics card
>>
Anonymous
02/04/26(Wed)02:14:25 No.108055264 >>108055256
GB200 NVL72 is what you're looking for, stay safe anon
>>
Anonymous
02/04/26(Wed)02:28:13 No.108055343 >>108055079
>not quite there yet but coming along
spoiler alert: This will forever and always be the status of Chroma models.
>>
Anonymous
02/04/26(Wed)02:31:50 No.108055365 >>108055195
sounds like a cope fantasy
>>
Anonymous
02/04/26(Wed)02:31:53 No.108055366 >>108055167
what do you expect from people who use euler as a sampler
>>
Anonymous
02/04/26(Wed)02:33:21 No.108055381 >>108054949
tried a recreation with Klein 4B Distilled
>>
Anonymous
02/04/26(Wed)02:34:03 No.108055384 >>108055381
neat
>>
Anonymous
02/04/26(Wed)02:34:12 No.108055386 >>108055343
nah, as long as the NSFW is there it'll be fine. Especially if distilled back into somthing like 4B Distilled.
>>
Anonymous
02/04/26(Wed)02:34:20 No.108055389 >>108055381
why are you spamming more of this shit? it's fucking tiresome
>>
Anonymous
02/04/26(Wed)02:34:57 No.108055393 >>108055381
I like his helmet.
>>
Anonymous
02/04/26(Wed)02:35:03 No.108055395 fresh
>>108055391
>>108055391
>>
Anonymous
02/04/26(Wed)02:51:04 No.108055489 replace the face of the man in image1 wearing armor with the face of the cartoon frog in image2, in the same pose as the man in image1.
>>
Anonymous
02/04/26(Wed)02:54:46 No.108055503 >>108055395
>>
Anonymous
02/04/26(Wed)03:02:55 No.108055547 >>108053366
delicious thighs
>>
Anonymous
02/04/26(Wed)03:05:48 No.108055562 >>108055547
thanks
>>
Anonymous
02/04/26(Wed)07:50:23 No.108056812 >>108055074
proompt
Reply to Thread #108053187