"They can't just help themselves to all the data they can get their hands on because they feel entitled to it for training! Wait..."
Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
Pot. Kettle. Black.
Depending on where you live it's a good thing or a very bad thing.
It's just two parasites slinging mud at each other.
"They've stolen our rightfully stolen data!", said the spokesman.
Aww, did the serial copyright violator get copyright violated?
No, machine generated output is not eligible for copyright protection. https://en.wikipedia.org/wiki/Threshold_of_originality
Yeah, it wouldn't be copyright. It might be trade secrets, though. And trade secrets can be made out of public data, but arranged in a way that gives competitive advantage (for example, customer lists themselves might be trade secrets, even if each entry is a publicly available set of name/contact information/job title/company).
If a company voluntarily discloses a trade secret to a member of the public, it ceases to be a trade secret, so I doubt that would apply here either.
Depends on the agreement. Contracts (like EULAs) can cover a fair bit
EULAs aren't legally binding in sane countries.
Plagiarism machine plagiarizes plagiarism machine. Film at 11.
Anthropic, I mean this with the upmost sincerity:
No one gives a fuck.
If your competitor can put out a model that functions really similarly to yours for $2 less per month, and your entire userbase can just leave and move to them... explain to me why investors would want to pump hundreds of billions into your business to be 'first to market'? That's a really dumb thing to admit for Anthropic.
Who is 'first to 100 million users' is utterly irrelevant under a business model where your sole value is Intellectual Property (IP) and that IP can be "illicitly extracted" by a clever competitor without ever hacking into your nextwork or doing anything explicitly illegal.
I've had to explain this to a lot of people who seem to think Anthropic/OpenAI are incredibly valuable companies because "they'll make money long-term so long as they keep being pumped full of it investment cash to be the first to earn a big userbase", but that just doesn't make sense. OpenAI owns no datacenters...zero. Theyre 100% IP. Anthropic "is building" some datacenters, but they exist on paper only so far, so they're also presently 100% IP.
Can this obvious scam just collapse already so I can upgrade my PC without a personal loan?
I think your take is completely reasonable but I think the 'first to 100 million users' is actually noteworthy because if they can become entrenched and people become unwilling to learn anything else, they've won and can charge nearly whatever the fuck they want (at least in the medium term). See Microsoft and Adobe. They charge whatever they want for their subscription programs because what else are you going to do, use GIMP? Even in situations where the FLOSS alternative is legitimately good, a lot of people will still refuse to switch. I don't think Anthropic can survive long enough for them to become the only thing Susan from HR knows or is willing to use, but I think there's a path to profit somewhere here.
There’s nothing to “learn”. Using one of these is in no way different than using the other.
Unless you start using fancy little features that let you do things the others don’t do quite as well.
I'd argue that agentic AI by nature makes transition to a different model easy.
Yeah this is a key realization that I suspect most investors aren’t privy to. With proven viable local, accessible, scalable, and energy-efficient 2TB infiniband clusters and routed multi-agentic stacks of open source models constantly nipping at their heals, achieving longterm market dominance for any of these AI developers is simply a tenuous prospect.
The only legitimate option is to maintain a meaningful lead at the cutting edge of performance and/or offer a superior efficiency/value proposition via SLA guarantees. Beyond that, the brute force options are limited to things like short-term market manipulation (such as outbidding everyone else for existing talent pool, chip manufacturing capacity, etc) or suppression of competition via regulatory capture.
In every case, above or below board, there is no permanent longterm global breakaway strategy, only treading water as long as investors are willing to inject enough funds to temporarily outrun market efficiency.
Once that reality sinks in… pop.
See Microsoft and Adobe.
Except Microsoft and Adobe never bankrupted a company by getting adopted. It was a tax that companies could afford since they were still rounding errors compared to labor.
If the adoption of a tech can be measured as being roughly equal to higher than the labor expense of a company, that decision isn't going to be dictated by what Susan in HR knows.
Lol. Stupid thieving fucks whine that their stolen data gets copied?
I'm still on the side of treating AI development with more caution than less. So depending where you live this could be a very good thing or a very bad thing in the long haul.
Laws for thee
Alibaba picking up Anthropic's fair use strategy?
Edit: is there an argument for letting the US ruin its economy and environment to train all these models and then just swooping in before it turns into a mild madmaxian hellscape to distill and/or extract the knowledge? Beats having to do this on your own, doesn't it?
Yeah. Any Co2/other climate change regressions that the US makes affect everyone globally, and while water use is local, its also as-needed, so post-collapse you have to use up all your water anyway.
AI could use solely graywater/non-water cooling and renewable energies, and that's the answer, just takes slowing down, building specific and rigorous facilities, . Letting the US speed along just hurts everyone due to climate change.
That and every major company economically depends on each other, and disconnecting from the US in a way that doesn't cause backlash also takes time.
Fuck america but don't let them drill holes in the boat we're all riding.
You know what they say, no honor among thieves.
No honor among thieves.
It was not clear how exactly they extracted capabilities..using the service and making prompts?! If it was just that, that's bullshit. AI companies have no moat..besides trillion dollar investments.
Oh, thanks for letting me know. I am now going to subscribe to Alibaba Cloud and cancel my Anthropic subscription
It's literally in the name. Open fucking sesame 🤣.
Didn't they stole also from Alibaba? I read somewhere that if you asked in Chinese to Claude opus 4.8 which model was using the api (the web service injects an hidden prompt with bias), it replied it was based on Alibaba qwen 3.6
Well then Alibaba needs to get better at it cause the Qwen models have kinda sucked in my experience.
Do they? I only use local models on my GPU and my experience is that Qwen3.6 is so much better than Google's Gemma 4. I have no comparison to big models, because I refuse to use those. But friends told me that Claude and Co are doing pretty dumb things too while frying the planet
I've used local models and they just tend to screw up more often in my experience. But I'm also more focused on having agents do long running tasks which small models just aren't good at.