this post was submitted on 11 Aug 2025
30 points (100.0% liked)

technology

24027 readers
198 users here now

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

Rules:

founded 5 years ago
MODERATORS
top 32 comments
sorted by: hot top controversial new old
[–] EnsignRedshirt@hexbear.net 41 points 1 month ago (1 children)

Hopefully the automated bug hunters can help keep up with the security vulnerabilities created by AI coding.

[–] invalidusernamelol@hexbear.net 24 points 1 month ago (1 children)

Make both of them part of the same reward function so the AI can generate vulnerabilities that the AI can immediately bug hunt.

[–] GrouchyGrouse@hexbear.net 11 points 1 month ago (1 children)

The capitalists finally became job creators

[–] invalidusernamelol@hexbear.net 5 points 1 month ago

Number of resolved tickets go up

[–] hello_hello@hexbear.net 36 points 1 month ago (2 children)

"Hey google can you publish the bug hunter AI and its details so we can verify?" doggirl-smart

porky-happy : "no"

doggirl-sleep

[–] Palacegalleryratio@hexbear.net 6 points 1 month ago

The Aurora Borealis? At this time of year? At this time of day? In this part of the country? Localized entirely within your kitchen?

Yes.

May I see it?

No.

[–] limer@lemmy.ml 18 points 1 month ago (1 children)

I’ll reserve judgement until after the bugs are published. Until then, I am expecting minor issues only

[–] yogthos@lemmygrad.ml 5 points 1 month ago (2 children)

I mean if these tools help catch any issues in automated fashion that's still a win.

[–] limer@lemmy.ml 21 points 1 month ago (1 children)

They found ten issues, but how many hours spent filtering out the false positives?

[–] yogthos@lemmygrad.ml 1 points 1 month ago (1 children)

We don't know, however of this is security related issues then it doesn't matter. The cost of a breach would be obviously higher.

[–] Le_Wokisme@hexbear.net 5 points 1 month ago (1 children)

compare to the cost of humans finding them the normal way, not whatever breach you're imagining.

[–] yogthos@lemmygrad.ml 2 points 1 month ago (2 children)

Clearly the humans didn't find them the normal way, because they wouldn't be there to be found otherwise would they?

[–] limer@lemmy.ml 3 points 1 month ago (1 children)

We don’t know the details yet. Maybe they have a great new tool; perhaps they picked projects that are not maintained so well.

It will be awesome if they found bugs in curl, not so good to show if they picked my project.

What they did will be revealed in time

[–] yogthos@lemmygrad.ml 2 points 1 month ago (1 children)

I'm sure we'll get more info in due time.

[–] limer@lemmy.ml 2 points 1 month ago

Yes, hopefully in a couple of weeks

[–] WrongOnTheInternet@hexbear.net 2 points 1 month ago (1 children)

The last time Google did a media run about Deepmind finding bugs, it related to a vulnerability on an dev branch that hadn't been deployed yet (and was not likely to have been with the vulnerability).

[–] yogthos@lemmygrad.ml 2 points 1 month ago (1 children)

So it found a vulnerability in the code it was given. 🤷

[–] WrongOnTheInternet@hexbear.net 2 points 1 month ago* (last edited 1 month ago) (1 children)

I don't think anyone is suggesting that it is impossible for an LLM to find any vulnerabilities?

But right now we are specifically discussing the costs of a breach, and your post that I responded to specifically relied on a bug not being identified a person.

The discussion isn't whether an LLM can identify bugs, it's whether it can do so in a useful way. In the single previous example, it was not useful.

But similar to the last time, it is likely that the limited utility will only be known until well after the breathless reporting on how amazing AI is

[–] yogthos@lemmygrad.ml 2 points 1 month ago

In the example you provided, it found a vulnerability, which is useful, but they didn't point it at production code. The vulnerability might have been found by other tests and code reviews or it might have not been. The question of whether it's valuable or not really depends on what sort of code we're talking about and what the cost of missing a vulnerability would be.

All I'm saying here is that AI is just another tool that helps find bugs. People here freaking out over the idea that there might be legitimate uses for AI is kind of hilarious to be honest.

[–] TrashGoblin@hexbear.net 13 points 1 month ago (1 children)
[–] yogthos@lemmygrad.ml 2 points 1 month ago (1 children)

That article isn't referring to the specific system google is using, so we don't know what the false positive rate is.

[–] WrongOnTheInternet@hexbear.net 9 points 1 month ago (1 children)

Uh pretty high if it's an LLM

[–] yogthos@lemmygrad.ml 0 points 1 month ago (2 children)
[–] Orcocracy@hexbear.net 8 points 1 month ago (1 children)
[–] yogthos@lemmygrad.ml 0 points 1 month ago (1 children)

It really depends on how their particular system is set up. You're just making sweeping vibe based statements without any evidence to support them.

[–] Orcocracy@hexbear.net 7 points 1 month ago (1 children)

Yeah, like maybe this is one of those AIs that is actually just a guy in the Philippines being paid shit wages. Or maybe it’s a dumb LLM that makes lots of mistakes. Or maybe it’s all just bullshit from TechCrunch where an underpaid journalist is just recycling a fucking press release from Google and none of this actually happened anything like how it’s written.

[–] yogthos@lemmygrad.ml 0 points 1 month ago (1 children)

Or maybe new technology actually has valid applications despite the hype associated with it.

[–] Orcocracy@hexbear.net 5 points 1 month ago (1 children)

It’s not entirely impossible. But given the story is light on detail and the main source is Google PR it looks very much like a case of hypemongering.

[–] yogthos@lemmygrad.ml 1 points 1 month ago

I mean we'll see, in general stuff like finding vulnerabilities in large code bases seems like a good fit for this tech. All it's doing is making statistical inferences based on training, and this can help spot problems that would be hard to track down by hand.

[–] GaveUp@hexbear.net 2 points 1 month ago (1 children)

It's literally the 2nd paragraph lmao

Heather Adkins, Google’s vice president of security, announced Monday that its LLM-based vulnerability researcher Big Sleep found and reported 20 flaws in various popular open source software.

[–] yogthos@lemmygrad.ml 1 points 1 month ago

what specifically do you think this paragraph says lmao