this post was submitted on 30 Dec 2025
92 points (93.4% liked)

Programming

24135 readers
495 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Rules

  • Follow the programming.dev instance rules
  • Keep content related to programming in some way
  • If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities !webdev@programming.dev



founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] brucethemoose@lemmy.world 23 points 19 hours ago* (last edited 19 hours ago) (1 children)

If you think of LLMs as an extra teammate, there's no fun in managing them either. Nurturing the personal growth of an LLM is an obvious waste of time. Micromanaging them, watching to preempt slop and derailment, is frustrating and rage-inducing.

Finetuning LLMs for niche tasks is fun. It's explorative, creative, cumulitive, and scratches a 'must optimize' part of my brain. It feels like you're actually building and personalizing something, and teaches you how they work and where they fail, like making any good program or tool. It feels you're part of a niche 'old internet' hacking community, not in the maw of Big Tech.

Using proprietary LLMs over APIs is indeed soul crushing. IMO this is why devs who have to use LLMs should strive to run finetunable, open weights models where they work, even if they aren't as good as Claude Code.

But I think most don't know they exist. Or had a terrible experience with terrible ollama defaults, hence assume that must be what the open model ecosystem is like.

[–] BlameThePeacock@lemmy.ca 2 points 19 hours ago (1 children)

Improving your input, and the system message can also be part of that. There are multiple optimizations available for these systems that people aren't really good at yet.

It's like watching Grandma google "Hi, I'd like a new shirt" back in the day and then having her complain that she's getting absolutely terrible search results.

[–] brucethemoose@lemmy.world 9 points 19 hours ago* (last edited 19 hours ago)

Mmmmm. Pure "prompt engineering" feels soulless to me. And you have zero control over the endpoint, so changes on their end can break your prompt at any time.

Messing with logprobs and raw completion syntax was fun, but the US proprietary models took that away. Even sampling is kind of restricted now, and primitive compared to what's been developed in open source.