232

I just listened to this AI generated audiobook and if it didn't say it was AI, I'd have thought it was human-made. It has different voices, dramatization, sound effects... The last I'd heard about this tech was a post saying Stephen Fry's voice was stolen and replicated by AI. But since then, nothing, even though it's clearly advanced incredibly fast. You'd expect more buzz for something that went from detectable as AI to indistinguishable from humans so quickly. How is it that no one is talking about AI generated audiobooks and their rapid improvement? This seems like a huge deal to me.

you are viewing a single comment's thread
view the rest of the comments
[-] theskyisfalling@lemmy.dbzer0.com 30 points 1 year ago

As someone who only consumes books in audiobook form this is great news for me, I tried to listen to some automatically generated audio books around 2 years ago and I found them horrible to listen to just because they sounded so off.

I'd love to be able to copy in the text of a book and get actually listenable (is that a proper word?) audiobook out of the other side for some books that will just simply never be recorded by actual people due to being too old / obscure.

I've been wanting to be able to listen to the Pelucidar books for years but they just don't exist in audio format, is there somewhere publically available that I can do this?

[-] not_a_bot_i_swear@lemmy.world 17 points 1 year ago

I would guess there is a LOT of work going into each voice. Playing with different parameters and prompts. I don't think it's as simple as just copying the text into a box. Not yet at least :)

[-] pretzelz@lemmy.world 1 points 1 year ago* (last edited 1 year ago)

I don't see why you couldn't give a few examples and then grab the dialog of a person in along with their description (or just the whole book) and get the llm to generate the prompt for you

load more comments (4 replies)
load more comments (37 replies)
this post was submitted on 11 Nov 2023
232 points (94.6% liked)

Asklemmy

43812 readers
890 users here now

A loosely moderated place to ask open-ended questions

Search asklemmy ๐Ÿ”

If your post meets the following criteria, it's welcome here!

  1. Open-ended question
  2. Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
  3. Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
  4. Not ad nauseam inducing: please make sure it is a question that would be new to most members
  5. An actual topic of discussion

Looking for support?

Looking for a community?

~Icon~ ~by~ ~@Double_A@discuss.tchncs.de~

founded 5 years ago
MODERATORS