Thread #108624673
HomeIndexCatalogAll ThreadsNew ThreadReply
H
Previous /sdg/ thread : >>108606788

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Anima
https://huggingface.co/circlestone-labs/Anima

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/aco/sdg
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/r/realistic+parody
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai

OP https://rentry.co/twkuk8tz
+Showing all 57 replies.
>>
First for shithole general
>>
"adorable Quokka" according to ERNIE image turbo
Lmao
>>
>>108624720
it's an adorable stuffed quokka
>>
>mfw Resource news

04/17/2026

>ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling
https://yjx-research.github.io/ControlFoley

>TokenGS: Decoupling 3D Gaussian Prediction from Pixels with Learnable Tokens
https://research.nvidia.com/labs/toronto-ai/tokengs

>MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation
https://aka.ms/mm-webagent

>Qwen2D-VAE
https://huggingface.co/Anzhc/Qwen2D-VAE

>ComfyUI HY-World 2.0 — WorldMirror 3D
https://github.com/AHEKOT/ComfyUI_HYWorld2

>Anima Style Explorer: A free web tool for ComfyUI styles
https://anima.mooshieblob.com

>Stanford AI Index Report 2026
https://hai.stanford.edu/assets/files/ai_index_report_2026.pdf

04/16/2026

>Motif-Video 2B: A micro-budget text-to-video diffusion transformer from Motif Technologies
https://motiftech.io/videoshowcase

>HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds
https://huggingface.co/tencent/HY-World-2.0

>ErnieTurbo_extracted_lora
https://huggingface.co/GuangyuanSD/ErnieTurbo_extracted_lora/tree/main

04/15/2026

>DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching
https://huggingface.co/tencent/DisCa

>Lyra 2.0: Explorable Generative 3D Worlds
https://research.nvidia.com/labs/sil/projects/lyra2

>AniGen: Unified S3 Fields for Animatable 3D Asset Generation
https://github.com/VAST-AI-Research/AniGen

>T2I-BiasBench: A Multi-Metric Framework for Auditing Demographic and Cultural Bias in Text-to-Image Models
https://gyanendrachaubey.github.io/T2I-BiasBench

>Generative Refinement Networks for Visual Synthesis
https://github.com/MGenAI/GRN

>VideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenization
https://videoflextok.epfl.ch

>DiffusionPrint: Learning Generative Fingerprints for Diffusion-Based Inpainting Localization
https://github.com/mever-team/diffusionprint
>>
>mfw Research news

04/17/2026

>Seen-to-Scene: Keep the Seen, Generate the Unseen for Video Outpainting
https://arxiv.org/abs/2604.14648

>Prompt-to-Gesture: Measuring the Capabilities of I2V Deictic Gesture Generation
https://arxiv.org/abs/2604.14953

>Beyond Prompts: Unconditional 3D Inversion for Out-of-Distribution Shapes
https://daidedou.sorpi.fr/publication/beyondprompts

>Flow of Truth: Proactive Temporal Forensics for I2V Generation
https://arxiv.org/abs/2604.15003

>AnimationBench: Are Video Models Good at Character-Centric Animation?
https://animationbench.github.io

>DVFace: Spatio-Temporal Dual-Prior Diffusion for Video Face Restoration
https://arxiv.org/abs/2604.14560

>Geometrically Consistent Multi-View Scene Generation from Freehand Sketches
https://arxiv.org/abs/2604.14302

>Analysis of Regularization and Fokker-Planck Residuals in Diffusion Models for Img Gen
https://arxiv.org/abs/2604.15171

>Step-level Denoising-time Diffusion Alignment with Multiple Objectives
https://arxiv.org/abs/2604.14379

>Prompt-Guided Image Editing with Masked Logit Nudging in Visual Autoregressive Models
https://arxiv.org/abs/2604.14591

>Towards Design Compositing
https://arxiv.org/abs/2604.14605

>LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories
https://rockeycoss.github.io/leapalign

>The Courtroom Trial of Pixels: Robust Image Manipulation Localization via Adversarial Evidence and Reinforcement Learning Judgment
https://arxiv.org/abs/2604.14703

>Reward-Aware Trajectory Shaping for Few-step Visual Generation
https://arxiv.org/abs/2604.14910

>Deepfake Detection Generalization with Diffusion Noise
https://arxiv.org/abs/2604.14570

>Switch-KD: Visual-Switch Knowledge Distillation for VLMs
https://arxiv.org/abs/2604.14629

>Bird-SR: Bidirectional Reward-Guided Diffusion for Real-World Image Super-Resolution
https://arxiv.org/abs/2602.07069
>>
darn teeth
>>
>>
>>
>>108624914
nice
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
lel
>>
>>
>>108625483
hmm, which is the vile one
>>
>>
>>108625573
the one in the back
>>
>>
>>
>>
>>108624720
Nice. lol
Here's my ERNIE.
>>
>>
>>
>>108626018
>>108624720
nice. i plan on getting around to trying it, how hard is it to prompt compared to like klein/chroma/zit?
>>
>>108626170
So far feels easier, I couldn't change the aspect ratio without distortion/multiple subjects appearing. Follow styles, and knows characters..
>>
>>108626170
>>108626199
do we know what text encoder ernie uses? I don't see it mentioned on the model card
>>
>>108626351
ministral 3 3b afaict
>>
>>108624734
Now it turned into a koala
>>108626018
Kek, heebs will not divide us
>>
>>108626369
I dont think I've ever done anything with minstrel. is it the same as qwen basically?
>>
>>108626392
yeah but french instead of chinese. idk if they just use the text encoder built into the model or what. the comfy template also uses a prompt enhancer which i'd expect is kind of like that one thing from the news we were talking about the other day. i'm still downloading so i'm just talking out my ass atm
>>
>>108626420
>yeah but french instead of chinese
god damn it, I just finished learning mandarin and now I have to learn french too? wǒ zhè cāodàn de féi zhái rénshēng..
>>
>>108626369
oof and yikes
even t5 would be better
>>
>>
>>
>>
>>108626825
omg lewd
>>
>>108626935
it's a skin colored bodysuit
>>
ERNIE Quokkas are so pink
>>
>>108626966
it's either a disease or they've turned carnivorous
>>
This one came out better
>>108626991
New species!
>>
>>108626966
i remember some sdxl mix long ago kept drawing quokkas as green. if I had a quarter for every time quokkas had a spurious correlation with a random color.....
>>
zit version of chromagirl (zgirl?) has a real death-cult vibe to her, i dont get it
>>
>>108627088
should try out ernie and see what kind of stuff erniegirl gets up to
>>
>>108627101
>would have to update comfyui
i dont know about that
>>
>>
>>
>>
>>108627141
don't be a pussy it'll be fine, unless it's been months and then idk maybe ur fucked lol. there haven't been any issues i've seen on desktop for a while now and i update everytime it nags me like a good comfyslave
>>
>>108624673
i was going hard on SD back in 2023 or 2024 and it worked on my gtx1080 8gb but now that is simply not possible. What are the boys using nowadays and how much do i need to pay to get up to speed?
>>
>>108627450
z-image-turbo might work on that card, it's enough vram anyway. might be a tad slower than on a 30xx+
>>
>>108624673
Kik Epp23g
Tele Bgftg33

Make a Lora of my gf?

Reply to Thread #108624673


Supported: JPG, PNG, GIF, WebP, WebM, MP4, MP3 (max 4MB)