387
submitted 8 months ago* (last edited 8 months ago) by misk@sopuli.xyz to c/technology@lemmy.world
you are viewing a single comment's thread
view the rest of the comments
[-] CrayonRosary@lemmy.world 4 points 8 months ago* (last edited 8 months ago)

~~Absolutely not! ChatGPT is a large language model and cannot generate images.~~

ChatGPT can have a little image gen once in a while as a treat.

[-] june@lemmy.world 17 points 8 months ago

It’s awful at text in images though. Pretty sure it draws the text rather than writes it, if that makes sense lol. I had it try 4 times and it got it wrong every time

[-] admin@lemmy.my-box.dev 10 points 8 months ago

That's GPT talking to DALL-E though - GPT is just the messenger, and has no idea what's in the image, other than the prompt it generated for you.

[-] srecko@lemm.ee 4 points 8 months ago

ChatGPT talks to GPT something (3 or 4 with or without turbo) and Dall-e, and ChatGPT isnt generating anything at all but that is just being pedantic for the sake of it. We all know what the OP meant.

[-] CrayonRosary@lemmy.world 2 points 8 months ago

I certainly didn't. I had no idea it was hooked into an image generator now.

[-] CrayonRosary@lemmy.world 2 points 8 months ago

Nice! Shows what I know. I had no idea it was hooked into an image generator now.

[-] june@lemmy.world 2 points 8 months ago

To be fair, it is limited to GPT4.

[-] tsonfeir@lemm.ee 11 points 8 months ago
[-] fidodo@lemmy.world 4 points 8 months ago

The llm is executing a function on a diffusion image model. The llm does not generate the image itself

[-] kelvie@lemmy.ca 7 points 8 months ago

This doesn't contradict what the OP said. ChatGPT is now an interface to both an LLM and a diffusion-based image generator.

[-] CrayonRosary@lemmy.world 1 points 8 months ago

ChatGPT is just a front-end that maintains a session that gets fed to an LLM each time you add a reply, and now has access to image gen, too, so I was wrong.

[-] tsonfeir@lemm.ee 1 points 8 months ago

You’re being pedantic—and confidently ignorant. The product is called “ChatGPT” and through that you can access multiple models. Like ChatGPT 3.5, or DALL•E.

[-] h3rm17@sh.itjust.works 1 points 8 months ago

Yeah, but the model that does the images is actually Dall-e, you are just using gpt's interface to create them

[-] tsonfeir@lemm.ee 2 points 8 months ago

So, I’m using ChatGPT.

Thank you for agreeing with me.

[-] h3rm17@sh.itjust.works 2 points 8 months ago

Sure, sure, was not desagreeing, technically you are using ChatGPT. Just pointing out that the model itself handling the image creation is not chatgpt

[-] wjrii@kbin.social 1 points 8 months ago

Girl on the right probably killed a Spanish swordsmith back in the day.

[-] Nexz@feddit.nl 1 points 8 months ago

I mean, the GPT model is a LLM and ChatGPT uses DALL-E in the background to create images. So depending on definition you’re both correct :-)

[-] tsonfeir@lemm.ee 0 points 8 months ago

Depending on how I define anything means I’m always correct I guess. 🤷‍♂️

this post was submitted on 02 Mar 2024
387 points (96.4% liked)

Technology

59415 readers
2634 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS