1
1
FLUX, I love you... (www.reddit.com)
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Akumetsu_971 on 2024-09-07 23:58:41+00:00.

2
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Psi-Clone on 2024-09-07 22:48:19+00:00.

3
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Total-Resort-3120 on 2024-09-07 19:20:04+00:00.

4
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/smooshie on 2024-09-07 18:28:45+00:00.

5
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Overall-Newspaper-21 on 2024-09-07 17:01:09+00:00.


Is the problem Dim/Alpha ?

6
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Glittering-Football9 on 2024-09-07 13:28:49+00:00.

7
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Puzzled-Background-5 on 2024-09-07 08:06:53+00:00.

8
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/protector111 on 2024-09-07 11:29:36+00:00.

9
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/RepresentativeJob937 on 2024-09-07 11:23:00+00:00.


It is possible to run Cog within 3.5GBs of VRAM with quantization and offloading.

We have released a repository that provides optimized recipes to generate images and videos with very few lines of code.

Check it out here:

10
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Nyao on 2024-09-07 08:18:10+00:00.

11
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Huihejfofew on 2024-09-07 08:01:32+00:00.


I didn't believe the hype. I figured "eh, I'm just a casual user. I use stable diffusion for fun, why should I bother with learning "new" UIs", is what I thought whenever i heard about other UIs like comfy, swarm and forge. But I heard mention that forge was faster than A1111 and I figured, hell it's almost the same UI, might as well give it a shot.

And holy shit, depending on your use, Forge is stupidly fast compared to A1111. I think the main issue is that forge doesn't need to reload Loras and what not if you use them often in your outputs. I was having to wait 20 seconds per generation on A1111 when I used a lot of loras at once. Switched to forge and I couldn't believe my eye. After the first generation, with no lora weight changes my generation time shot down to 2 seconds. It's insane (probably because it's not reloading the loras). Such a simple change but a ridiculously huge improvement. Shoutout to the person who implemented this idea, it's programmers like you who make the real differences.

After using for a little bit, there are some bugs here and there like full page image not always working. I haven't delved deep so I imagine there are more but the speed gains alone justify the switch for me personally. Though i am not an advance user. You can still use A1111 if something in forge happens to be buggy.

Highly recommend.

12
1
Roasted spider (i.redd.it)
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/KacperXX on 2024-09-06 17:54:14+00:00.

13
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/joachim_s on 2024-09-06 14:27:20+00:00.

14
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/upboat_allgoals on 2024-09-06 20:48:50+00:00.

15
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/wonderflex on 2024-09-06 18:37:26+00:00.

16
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/alexds9 on 2024-09-06 14:26:28+00:00.

17
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/KudzuEye on 2024-09-06 16:52:05+00:00.

18
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/cocktail_peanut on 2024-09-06 16:19:18+00:00.

19
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Hullefar on 2024-09-06 12:47:50+00:00.

20
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/justin_wiggins on 2024-09-06 10:28:41+00:00.

21
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/sonicboom292 on 2024-09-06 02:26:01+00:00.

22
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-09-06 09:03:37+00:00.


  • SKYBOX AI: create 360° worlds with one image ()
  • Text-Guided-Image-Colorization: influence the colorisation of objects in your images using text prompts (uses SDXL and CLIP) (GITHUB)
  • Meta's Sapiens segmentation model is now available on Hugging Faces Spaces (HUGGING FACE DEMO)
  • Anifusion.ai: create comic books using UI via web app ()
  • MiniMax: NEW Chinese text2video model (), they also do free music generation (https://hailuoai.com/music)
  • Viewcrafter: generate high-fidelity novel views from single or sparse input images with accurate camera pose control (GITHUB CODE | HUGGING FACE DEMO)
  • LumaLabsAI released V 6.1 of Dream Machine which now features camera controls
  • RB-Modulation (IP-Adapter alternative by Google): training-free personalization of diffusion models using stochastic optimal control (HUGGING FACE DEMO)
  • New ChatGPT Voices: Fathom, Glimmer, Harp, Maple, Orbit, Rainbow (1, 2 and 3 - not working yet), Reef, Ridge and Vale (X Video Preview)
  • FluxMusic: SOTA open-source text-to-music model (GITHUB | JUPYTER NOTEBOOK | PAPER)
  • P2P-Bridge: remove noise from 3D scans (GITHUB | PAPER)
  • HivisionIDPhoto: uses a set of models and workflows for portrait recognition, image cutout & ID photo generation (HUGGING FACE DEMO | GITHUB)
  • ComfyUI-AdvancedLivePortrait Update (GITHUB)
  • ComfyUI v0.2.0: support for Flux controlnets from Xlab and InstantX; improvement to queue management; node library enhancement; quality of life updates (BLOG POST)
  • A song made by SUNO breaks 100k views on Youtube (LINK)

These will all be covered in the weekly newsletter, check out the most recent issue.

Here are the updates from the previous week:

  • Joy Caption Update: Improved tool for generating natural language captions for images, including NSFW content. Significant speed improvements and ComfyUI integration.
  • FLUX Training Insights: New article suggests FLUX can understand more complex concepts than previously thought. Minimal captions and abstract prompts can lead to better results.
  • Realism Techniques: Tips for generating more realistic images using FLUX, including deliberately lowering image quality in prompts and reducing guidance scale.
  • LoRA Training for Logos: Discussion on training LoRAs of company logos using FLUX, with insights on dataset size and training parameters.

⚓ Links, context, visuals for the section above ⚓

  • FluxForge v0.1: New tool for searching FLUX LoRA models across Civitai and Hugging Face repositories, updated every 2 hours.
  • Juggernaut XI: Enhanced SDXL model with improved prompt adherence and expanded dataset.
  • FLUX.1 ai-toolkit UI on Gradio: User interface for FLUX with drag-and-drop functionality and AI captioning.
  • Kolors Virtual Try-On App UI on Gradio: Demo for virtual clothing try-on application.
  • CogVideoX-5B: Open-weights text-to-video generation model capable of creating 6-second videos.
  • Melyn's 3D Render SDXL LoRA: LoRA model for Stable Diffusion XL trained on personal 3D renders.
  • sd-ppp Photoshop Extension: Brings regional prompt support for ComfyUI to Photoshop.
  • GenWarp: AI model that generates new viewpoints of a scene from a single input image.
  • Flux Latent Detailer Workflow: Experimental ComfyUI workflow for enhancing fine details in images using latent interpolation.

⚓ Links, context, visuals for the section above ⚓

23
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/AnimeDiff on 2024-09-05 22:17:13+00:00.

24
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Fearless-Chart5441 on 2024-09-06 03:19:56+00:00.

25
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Nyao on 2024-09-05 18:01:00+00:00.

view more: next ›

StableDiffusion

97 readers
1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago
MODERATORS