I would not be surprized if Anthropic would actually hire a real developer to make these PRs as a marketing stunt
Programmer Humor
Welcome to Programmer Humor!
This is a place where you can post jokes, memes, humor, etc. related to programming!
For sharing awful code theres also Programming Horror.
Rules
- Keep content in english
- No advertisements
- Posts must be related to programming or programmer topics
Well, if the model detected an issue, and a human tested it to make sure it was real and then fixed it, I think that's an acceptable use of AI tools.
Yeh, AI as an assistant/tool. Not as a replacement
In 2021, when Amazon launched its first “just walk out” grocery store in the UK in Ealing, west London, this newspaper reported on the cutting-edge technologies that Amazon said made it all possible: facial-recognition cameras, sensors on the shelves and, of course, “artificial intelligence”.
An employee who worked on the technology said that actual humans – albeit distant and invisible ones, based in India – reviewed about 70% of sales made in the “cashier-less” shops as of mid-2022
UK AI company builder.ai has been tricking customers and investors for eight years – selling an advanced code-writing AI that, it turns out, is actually an Indian software farm employing 700 human developers.
AI: Actually Indians
So this is basically a rebrand of fiverrr or whatever it's called?
builder AI was genuine AI, it's just that the company simultaneously also did contracted development with real humans. journalists got confused.
there's a really good youtube documentary i watched which actually got into the tools and software used, but I can't find it anymore. either way, you can't dress up humans coding as AI. it's not fast enough.
(In case someone has been living under a rock in the last 48 hours. Anthropic's new model "Mythos" has been finding a lot of new vulnerabilities. This is about patching one.)
ai tools can detect potential vulnerabilities and suggest fixes. You can still go in by hand and verify the problem carefully apply a fix.
AI is actually SUPER good at this and is one of the few places I think AI should be used (as one of many tools, ignoring the awful environmental impacts of AI and assuming an on-prem model). AI is also good at detecting code performance issues.
With that said, all of the fix recommendations should be fixed by hand.
Yeah I would add also ignoring how the training data is usually sourced. I agree AI can be useful but it just feels so unethical that I find it hard to justify.
I'm a big LLM hater atm but once we're using models that are efficient, local and trained on ethically sourced data I think I could finally feel more comfortable with it all. Can't be writing code for me though - why would I want the bot to do the fun part?
LLM is very good at programming when there are huge number of guardrails against them. For example, exploit testing is a great usecase because getting a shell is getting a shell.
They kind of acts as a smarter version of infinite monkey that can try and iterate much more efficiently than human does.
On the other hand, in tasks that requires creativity, architecture, and projects without guard rail, they tend to do a terrible job, and often yielding solution that is more convoluted than it needs to be or just plain old incorrect.
I find it is yet another replacement for "pure labor", where the most unintelligent part of programming, i.e. writing the code, is automated away. While I will still write code from scratch when I am trying to learn, I likely will be able automate some code writing, if I know exactly how to implement it in my head, and I also have access to plenty of testing to gaurentee correctness.
People have trouble with the middle ground. AI is useful in coding. It's not a full replacement. That should be fine, except you've got the ai techbros and CEOs on one end thinking it will replace all labor, and the you've got the backlash to that on the other end that want to constantly talk about how useless it is.
I’d buy you a beer for that summary. That is exactly SPOT ON.
the times i trust LLMs: when i am using it to look up stuff i have already learned, but i can't remember and just need to refresh my memory. there's no point memorizing shit i can look up and am not going to use regularly, and i'm the effective guardrail against the LLMs being wrong when I'm using them.
the times i don't trust the LLMs: all the other times. if i can't effectively verify the information myself, why am i going to an unreliable source?
having to explain that nuance over and over, it's just shorter and easier to say the llm is an unreliable source. which it is. when i'm not doing lazy output, it doesn't need testing (it still gets at least 2 reviews, but the last time those reviews caught anything was years ago). the llm's output always needs testing.
They are also great for programming one off personal projects that frankly, don't have the use scale that needs rigerous security oversight. Especially since like, if you did it yourself, you probably were not sanitizing the inputs (etc) anyway. You were slapping down some Python code and moving on.
Like, I don't care if my script to convert Wordpress exports to Markdown files crashes if you feed it a JPEG. I am the only one using it, for this data manipulation task.
The thing is, you know how it is in your head and you need to lay out that entire context.
And after that you MUST review the code because you'd never know. Wouldn't call it automation if I have to double check EVERY TIME
It's OK to hate AI slop and recognize the immediate threat to cyber security it brings. At least they are trying to mitigate it. There's been no similar actions from other frontier models. They are deliberately helping open source projects with little funding to keep pace.
Anthropic right now are the good people.
That probably won’t last. But out of a bad bunch they’re the least bad.
the good people.
You are limiting your own intelligence by thinking companies can be described in those words.
They are not good. They are profit-seeking. Profit seeking doesn't necessarily mean evil, but it can never mean good. A non-profit who's goal is to improve their community around them, a co-op who's goal is to treat their workers with respect etc etc can all be described as 'good' to varying degrees, but no for-profit entity, especially a publicly traded one, can ever be described as 'good'
Hence their point about being the best of a bad bunch. Remember the people making decisions are people. A corporation has no soul and only seeks profit. People work for them and can make good decisions and be good people whomever they work for.
There were good people that worked for the nazis. Unless you think the cleaner, for instance of the Nazi headquarters cleaned as a way to speak evil.
However. I take your point. I just think that's not what is the point of the discussion here and is no different to both sides being bad on politics. It lacks nuance.
The ffmpeg team was mad at Google when they reported a bug that was found and reported automatically with an AI. Google reported the bug without providing a fix and also gave an ultimatum. Google would publicize the bug report after 60 days. That’s what pissed off the ffmpeg devs. Not to mention that it was a very obscure bug, like ffmpeg didn’t decode a video file from a 90’s videogame correctly.
Anthropic on the other hand found a bug and provided a fix. So why would they be mad if the fix is properly written and fixes the bug ?
Because people want to only back their tribe and not the truth.
It's really only a minority, or else the world would not work. Think how the theory of evolution gained mainstream acceptance, despite resistance by fanatics who had support by society,
So they read them, and the patches were good (according to this message)
Why hate then?
Hold on, wasn't one of the "features" of the "leaked" Assumed Intelligence source code the "human"-like version?
The leak was harness code, not agent weights. This is a new frontier model, not some CLI upgrade
I'm not sure if we're talking about the same thing. One of the recent leaks had code that pretended to be a developer, so you could pick if it submitted a PR as Assumed Intelligence, or as a person.
I'll see if I can find a reference.
Edit: Undercover Mode in Claude Code:
Ohh yes sorry. Would love to read about that one too if you happen to find it
Just added the link.