this post was submitted on 19 Nov 2025
68 points (98.6% liked)
Games
21161 readers
473 users here now
Tabletop, DnD, board games, and minecraft. Also Animal Crossing.
Rules
- No racism, sexism, ableism, homophobia, or transphobia. Don't care if it's ironic don't post comments or content like that here.
- Mark spoilers
- No bad mouthing sonic games here :no-copyright:
- No gamers allowed :soviet-huff:
- No squabbling or petty arguments here. Remember to disengage and respect others choice to do so when an argument gets too much
- Anti-Edelgard von Hresvelg trolling will result in an immediate ban from c/games and submitted to the site administrators for review. :silly-liberator:
founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Aren't
s supposed to be super nitpicky about blurry graphics, fps stutters and so on.
But now that it's poor quality AI gen voices, slapped on top of otherwise what seems like a very well made game, they're gonna talk about how we're just not ready for it?
I'm still unclear what the AI voices in it even are, going off what people have been saying about it. It sounds like the actual NPC dialogue was from voice actors, who were also hired to provide samples for a text-to-speech thing for player characters? Or that it's doing a speech-to-text and then text-to-speech filter process on proximity VOIP? If it's literally just "it's using text-to-speech tech on dynamic text that players provide" that's such a completely unobjectionable thing that I can only imagine the backlash is coming from people who see the buzzword "AI" and immediately think ChatGPT instead of "text-to-speech with a slightly higher quality than it used to have".
you can do a ping in game that makes your character say: "i have a (insert game item here" or "let's go to (insert game location here)". from what i understand they trained ai off the hired voice actors, with consent, to voice these lines
See that doesn't sound any different from just having them record a phonemic inventory for a traditional text-to-speech system, it's just simplifying the process and making it so the text doesn't need to have a corresponding pronunciation key, and presumably meshing it in a bit better.