Programming

25441 readers

330 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Follow the programming.dev instance rules
Keep content related to programming in some way
If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities !webdev@programming.dev

founded 2 years ago

MODERATORS

snowe@programming.dev

Ategon@programming.dev

UlrikHD@programming.dev

bugsmith@programming.dev

Spyro@programming.dev

142

LLMS Are Not Fun (orib.dev)

submitted 1 month ago by codeinabox@programming.dev to c/programming@programming.dev

42 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] brucethemoose@lemmy.world 26 points 1 month ago* (last edited 1 month ago) (2 children)

If you think of LLMs as an extra teammate, there's no fun in managing them either. Nurturing the personal growth of an LLM is an obvious waste of time. Micromanaging them, watching to preempt slop and derailment, is frustrating and rage-inducing.

Finetuning LLMs for niche tasks is fun. It's explorative, creative, cumulitive, and scratches a 'must optimize' part of my brain. It feels like you're actually building and personalizing something, and teaches you how they work and where they fail, like making any good program or tool. It feels you're part of a niche 'old internet' hacking community, not in the maw of Big Tech.

Using proprietary LLMs over APIs is indeed soul crushing. IMO this is why devs who have to use LLMs should strive to run finetunable, open weights models where they work, even if they aren't as good as Claude Code.

But I think most don't know they exist. Or had a terrible experience with terrible ollama defaults, hence assume that must be what the open model ecosystem is like.

[–] ExLisper@lemmy.curiana.net 4 points 1 month ago

What he's talking about is teaching a person and watching them grow, become better engineer and move on to do great things not tweaking some settings in a tool so it works better. How do people not understand that?

[–] BlameThePeacock@lemmy.ca 2 points 1 month ago (1 children)

Improving your input, and the system message can also be part of that. There are multiple optimizations available for these systems that people aren't really good at yet.

It's like watching Grandma google "Hi, I'd like a new shirt" back in the day and then having her complain that she's getting absolutely terrible search results.

[–] brucethemoose@lemmy.world 11 points 1 month ago* (last edited 1 month ago)

Mmmmm. Pure "prompt engineering" feels soulless to me. And you have zero control over the endpoint, so changes on their end can break your prompt at any time.

Messing with logprobs and raw completion syntax was fun, but the US proprietary models took that away. Even sampling is kind of restricted now, and primitive compared to what's been developed in open source.