TheCornCollector

joined 4 months ago

GLM-Image released: a hybrid image generation model (huggingface.co)

submitted 1 week ago by TheCornCollector@lemmy.zip to c/fosai@lemmy.world

0 comments fedilink

GLM-Image, an open-weight image generation model, adopts a hybrid autoregressive + diffusion decoder architecture. In general image generation quality, GLM‑Image aligns with mainstream latent diffusion approaches, but it shows significant advantages in text-rendering and knowledge‑intensive generation scenarios. It performs especially well in tasks requiring precise semantic understanding and complex information expression, while maintaining strong capabilities in high‑fidelity and fine‑grained detail generation. In addition to text‑to‑image generation, GLM‑Image also supports a rich set of image‑to‑image tasks including image editing, style transfer, identity‑preserving generation, and multi‑subject consistency.

The model weights are MIT licensed, did not see any training code or data, yet.