People Twitter

10037 readers

958 users here now

People tweeting stuff. We allow tweets from anyone.

RULES:

Mark NSFW content.
No doxxing people.
Must be a pic of the tweet or similar. No direct links to the tweet.
No bullying or international politcs
Be excellent to each other.
Provide an archived link to the tweet (or similar) being shown if it's a major figure or a politician. Archive.is the best way.

founded 2 years ago

MODERATORS

SendMeYourTaTas@sh.itjust.works

pelespirit@sh.itjust.works

749

Managers (media.piefed.zip)

submitted 2 days ago* (last edited 2 days ago) by inari@piefed.zip to c/whitepeopletwitter@sh.itjust.works

171 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] zloubida@sh.itjust.works 15 points 2 days ago (3 children)

I'm not a developer and I don't know a thing about the capabilities of LLMs so this may explain that, but I'm quite surprised that open weight LLMs could actually match Claude.

[–] theunknownmuncher@lemmy.world 26 points 2 days ago (1 children)

Yes, the big proprietary cloud models have an edge, but it is narrow and the open-weight models are constantly closing the gap. There is no moat when it comes to AI models and no company has yet discovered some secret special sauce to improve their model significantly over others.

Running the latest and greatest open-weight GLM, Kimi, or Qwen model is basically equivalent to running the previous latest and greatest version of Claude. So if you were happy with Claude then, you'll basically be happy with an open-weight model now.

[–] Bluescluestoothpaste@sh.itjust.works 3 points 2 days ago (1 children)

Well it's the speed and processing power, i dont believe you can get anywhere close to cloud claude performance on any standard desktop

[–] theunknownmuncher@lemmy.world 7 points 2 days ago (1 children)

Surprisingly, yes you absolutely can with Qwen3.6 35b. Also, a business would be putting together a dedicated interference server to serve many users, not any standard desktop.

[–] Bluescluestoothpaste@sh.itjust.works 1 points 2 days ago* (last edited 2 days ago)

I see, but im guessing that OP dumbass literally wants to run llm on their laptops lol

[–] Xanvial@lemmy.world 5 points 2 days ago

Match current Claude is not, but Claude 6-12 months ago should be possible using Open model

[–] MalReynolds@slrpnk.net 3 points 2 days ago* (last edited 2 days ago)

Mostly down to frameworks (the bits around the LLM like RAG, memory, prompts, agents etc.) now. The ability to just throw more tokens at the problem is also super important. And you can because you're just paying for electricity (and CapEx for the hardware), not tokens from companies that are doing pre-IPO monetization (i.e. tokens gonna go up, way up). They've been losing money hand over fist to gain market share and pump the idea, that was never going to last.