World News
A community for discussing events around the World
Rules:
-
Rule 1: posts have the following requirements:
- Post news articles only
- Video links are NOT articles and will be removed.
- Title must match the article headline
- Not United States Internal News
- Recent (Past 30 Days)
- Screenshots/links to other social media sites (Twitter/X/Facebook/Youtube/reddit, etc.) are explicitly forbidden, as are link shorteners.
-
Rule 2: Do not copy the entire article into your post. The key points in 1-2 paragraphs is allowed (even encouraged!), but large segments of articles posted in the body will result in the post being removed. If you have to stop and think "Is this fair use?", it probably isn't. Archive links, especially the ones created on link submission, are absolutely allowed but those that avoid paywalls are not.
-
Rule 3: Opinions articles, or Articles based on misinformation/propaganda may be removed.
-
Rule 4: Posts or comments that are homophobic, transphobic, racist, sexist, anti-religious, or ableist will be removed. “Ironic” prejudice is just prejudiced.
-
Posts and comments must abide by the lemmy.world terms of service UPDATED AS OF OCTOBER 19 2025
-
Rule 5: Keep it civil. It's OK to say the subject of an article is behaving like a (pejorative, pejorative). It's NOT OK to say another USER is (pejorative). Strong language is fine, just not directed at other members. Engage in good-faith and with respect! This includes accusing another user of being a bot or paid actor. Trolling is uncivil and is grounds for removal and/or a community ban.
Similarly, if you see posts along these lines, do not engage. Report them, block them, and live a happier life than they do. We see too many slapfights that boil down to "Mom! He's bugging me!" and "I'm not touching you!" Going forward, slapfights will result in removed comments and temp bans to cool off.
-
Rule 6: Memes, spam, other low effort posting, reposts, misinformation, advocating violence, off-topic, trolling, offensive, regarding the moderators or meta in content may be removed at any time.
-
Rule 7: We didn't USED to need a rule about how many posts one could make in a day, then someone posted NINETEEN articles in a single day. Not comments, FULL ARTICLES. If you're posting more than say, 10 or so, consider going outside and touching grass. We reserve the right to limit over-posting so a single user does not dominate the front page.
We ask that the users report any comment or post that violate the rules, to use critical thinking when reading, posting or commenting. Users that post off-topic spam, advocate violence, have multiple comments or posts removed, weaponize reports or violate the code of conduct will be banned.
All posts and comments will be reviewed on a case-by-case basis. This means that some content that violates the rules may be allowed, while other content that does not violate the rules may be removed. The moderators retain the right to remove any content and ban users.
Lemmy World Partners
News !news@lemmy.world
Politics !politics@lemmy.world
World Politics !globalpolitics@lemmy.world
Recommendations
For Firefox users, there is media bias / propaganda / fact check plugin.
https://addons.mozilla.org/en-US/firefox/addon/media-bias-fact-check/
- Consider including the article’s mediabiasfactcheck.com/ link
view the rest of the comments
Haven't they clearly documented how they did it and what they used so that anyone can replicate it? Anyone with the compute power, which of course few have. But universities could do it.
So how is it not open source in this specific domain of problems? What would a LLM model need to do to be open source then? Duplicate the whole training dataset in a big zipfile for you to download?
From what I understand you could even replicate deepseek by replacing the "cold start" with latest deepseek instead.
They don't put up the actual code for their training pipeline though. It's more of a "if you have enough engineers, you can do this too" whitepaper, because they wouldn't want any rando training their own model.
Right now, even if you had the exact training set (which is a CRUCIAL part of an LLM and you can NOT replicate it without it), you couldn't rebuild the thing exactly, you'd need to do a whole lot of extra work.
You could call all proprietary software open source then. The UI and user manual describe what it does, you can do your own engineering to duplicate the functionality.
So its significantly closer to being 'open' in that qualified organizations can poke around with some of its innards and probably achieve something useful, in comparison to fully propietary models where that is either legally impossible or extremely, absurdly expensive.
Its sorta like ...
... a compiled game that requires you to fully reverse engineer a lot of it to be able to mod it at all, while also its use liscense states that doing that is illegal,
... a compiled game that is highly moddable via tools/apis and/or a significant portion of it that is well publically documented or just source code available,
... and then a truly totally open source, libre game.
Yeah, not totally open source.
But functionally and practically closer to it.
I can mod the shit out of Half Life 2 or New Vegas or CyberPunk or Kenshi, but not so much with ... I dunno, basically any live service game.
I still need those original core compiled exes for those games, but basically, many things I can fuck with relatively easily... and maybe if I really go nuts I can figure out how to hack or shim or hijack the exe to make NVSE or RedScript or ReKenshi or whatnot.
As compared to trying to mod HellDivers or Fortnite... near instant ban, most likely.
But also, if you don't know much about how to make a mod, well you're basically SoL in that department just the same for any kind of game, really.
Why even have this discussion? Self-learning algorithms appeared more than ten years ago. AI is being used very effectively in countless areas.
The idea that there is some sort of prize waiting for whomever gets the most computing power, is highly dubious.
Yet that idea is the entire basis of the US economy, at the moment.