Technology

86563 readers

2667 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 3 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

496

Anthropic/OpenAI may be spending more than $1000 for every $100 you pay them (ea.rna.nl)

submitted 1 month ago by Trilogy3452@lemmy.world to c/technology@lemmy.world

179 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] ag10n@lemmy.world 3 points 1 month ago (2 children)

It’s not the 90s anymore. Unless there’s a compression algorithm putting billions of relationships into a manageable size, local AI is highly specific under 8G vram (text-to-speech as an example is under 1G) let alone the context required for keeping a conversation or writing code.

[–] ThirdConsul@lemmy.zip 1 points 1 month ago (1 children)

If text-to-speech is what Youtube uses to autogenerate the subtitles, it is worthless for anything that uses slightly richer vocabulary.

[–] pirat@lemmy.world 2 points 1 month ago

No. Autogenerated subtitles would be speech-to-text, rather than text-to-speech.

[–] blackbeans@lemmy.zip -2 points 1 month ago (2 children)

To be clear, I wasn't talking about a leap in LLM design. I was talking about a leap in hardware capabilities...

[–] ag10n@lemmy.world 2 points 1 month ago

Which are increasingly out of reach for a normal person. Phones let alone PC hardware have increased exponentially in recent history

[–] KRAW@linux.community 2 points 1 month ago

Improved hardware capabilities used to come very quickly (see Moore's Law and Dennard Scaling). However that trend is basically over, so getting higher performance hardware takes a lot of effort to make hardware specialized for certain tasks. That's why you see there inference accelerators like Groq, SambaNova, Cerebrus, etc. However this is hardware that still is gonna go into data centers. Something innovative has to happen on the AI side for commercial-grade models to be runnable on consumer hardware.