20
submitted 1 week ago* (last edited 1 week ago) by AtmosphericRiversCuomo@hexbear.net to c/technology@hexbear.net

They fine-tuned a Llama 13B LLM with military specific data, and claim it works as well as GPT-4 for those tasks.

Not sure why they wouldn't use a more capable model like 405B though.

Something about this smells to me. Maybe a way to stimulate defense spending around AI?

top 3 comments
sorted by: hot top controversial new old
[-] ProletarianDictator@hexbear.net 5 points 6 days ago

Incoming meltdown and export restrictions on transformer models?

Seems like something the US would hype up into a new red scare tool. So many incentives line up here, I could see it happening, no matter how stupid.

[-] supafuzz@hexbear.net 12 points 1 week ago

works as well as GPT-4

that's a pretty vicious self-own

[-] JoeByeThen@hexbear.net 11 points 1 week ago

Looking past all the red scare/ai bullshit, it's probably a nothingburger. Researchers funded by the Chinese equivalent of DARPA doing something they thought would be cool.

this post was submitted on 01 Nov 2024
20 points (100.0% liked)

technology

23277 readers
196 users here now

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

Rules:

founded 4 years ago
MODERATORS