The death of Stackoverflow is one of these events where the site has been completely killed by AI and yet its contents is completely necessary for AI to know about solving programming problems. Its death will mark the end of AIs ability to learn how to solve programming issues. Its cannibalizing itself in the process, as it destroys its sources it destroys its own ability to learn.
Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
It's not just that, it's shitting where it eats. People are using it to fill the internet with disinformation, then it trains itself on it's own disinformation, and breeds even worse disinformation. This is why AI can never be smarter than it was in 2021.
On top of that, due to the indiscriminate DDOSing of the entire internet by AI bots, websites have been blocking any web crawlers that are not Google, which just contributes to their monopoly.
Friends don't let friends use Google.
I can't remember the name, but when the internet was just starting and there were a lot of search engines with no dominate ones, there was an aggregator program that you could input many search engines into, then use it as the searching tool. It would query all the engines and combine, sort, rank, and remove duplicate finds.
Edit: more specific - It was much like an FTP or torrent program but you'd load up what search engines to use and your search words, and it would actively pull the info then provide a single page with all results.
The reason I mention it is because we're sort of back at that point. Google is failing, Bing never was great, and all the alternatives have their issues, usually with not having the same database to work with. So if you gathered all the best ones, the ones without ties to corporate or AI, then put their results together, maybe you'd have something like what Google was at its peak before "do no evil" got painted over.
Incidentally, Google became what it was/is because it gobbled up a lot of those early search engines' databases. I miss you, Hotbot. You were a good one.
Search used to be so good. I had an old Honda civic that suddenly wouldn't start. It wasn't the starter, alternator, or battery. I managed to find a forum post with my exact issue, which was that a small rubber piece on the clutch pressed a button to "tell" the starter it was okay to start. Twenty minutes later I had zip tied a piece of plastic into place and had a working car again.
If I tried to diagnose that same issue today, it'd be dozens of SEO garbage slop sites without any actual useful information.
They are literally walling off all this information that used to be easy to access and for the public. It’s our data that we the people decide to share with the world and these rent seeing corporations are hiding it away so they can start charging us "tokens" to access our own public information.
I was thinking the same thing recently. It's not the place it once was. But in general the internet has changed a lot. And it's not just AI.
- All sorts of paywalls especially in news sites.
- Everything is getting centralized into a few sites and they're usually eithe poorly indexable or not at all (Discord, facebook, X, Instagram and so on)
- Fediverse (Lemmy, Mastodon) also struggles with search engines.
- People trying to sell you shit, create a brand even more than before. Because of this all sorts of SEO optimization crap is done like writing BS articles nobody cares about.
- AI slop.
- Search engines have gotten better of getting rid of "illegal stuff".
- A lot of sites are just presentational bloat with no substance. Very cool looking landing pages with all sorts of cool animations but when you need to actually find the information that you need... the same UI usually gets in the way.
Oh and now we're getting into age verification crap also yay
he means reddit mostly. AI SLOP GENERATOR, TRAINING ON SLOP like reddit. with a little of plagiarizing from authors, and artists.
I called to schedule a play date at my local dog daycare/boarding, it went to an AI answering service. I asked if it was AI since I could hear noice in the background (literal fake background chatter and noise), when she said yes, hung up. SO tired of AI everywhere. Fuck it all.
lmao, I bet they trained the stupid thing on recordings from massive call centers. It probably thinks that all the 'background' noise is just part of how humans communicate.
it probably thinks
It doesn't think. People need to stop anthropomorphizing the statistical probability machine.
ah yes, reddit, the most well-mannered and measured of social media platforms that never indulges itself in spreading hate in misinformation.
which are spread by the same actors as the other social media. Russia, ISRAEL, PALANTIR, AND THE US, India/sr lanki is also getting thier hands in, likely being paid as intermediary for the others(of the above mentioned), so its to ofuscate who the actual actors behind the misinformation.
what a wonderful world, eh?
It is normal to glue pepperoni to your pizza to keep it from sliding off when baking.
facts, bro!
Google has been cannibalising the net for a decade or so now
I... Is that baby eating a human?
Perhaps
More Perfect Union did a video on Google's descent into evil. I think it's this one
TLDW: Once Google pivoted from being a search service to an advertising agency, it was motivated to keep users from hyperlinking away from Google, and so offered summaries and alternatives controlled by Alphabet that allowed it to keep offering you ads.
So this AI service is just a natural iteration.
I don't understand how these companies want to seem and think they are so smart by choosing new niche data (scraped) to train AI in a bid to try and make it "smart"....
Has any other living being become "smart" by only ingesting information directly from the Internet? You can train other animals to perform many tasks and can probably say they are smart when they perform them as expected. I doubt any of the training methods is to tape headphones, a screen and sometimes a microphone to their faces forever (I kinda don't wanna know if this false 😶).
The best example we have, is ourselves, and even though we use the Internet, babies are not taught how to walk and talk by only interacting with the Internet.
I feel like I might be saying too much, but I think the best AI we're gonna get is to unplug it from the Internet, and then fucking raise it for 20 years like a normal, super fast-thinking child prodigy. Then just make copies of that and train further by having it go to school for the things needed.

It's the same arc every monopolistic corporation has taken before it, AI is just accelerating the pace of consuming your customer/product because profits must always increase.
There will be no large scale shift from these experiences because most people are either ok, apathetic or blissfully ignorant to the situation, the best you can do is to remove yourself from the exploitation of the userbase. Linux instead of Windows or Android, Almost any search engine other than Google, fediverse instead of reddit, etc.
It's a self defeating strategy as more people turn to ai, less content gets produced so ai becomes static.
I truly believe the token model will kill AI, it will become too expensive
It already is too expensive and adding more compute doesn't make it cheaper lol it just causes a race to the bottom among data center providers and an eventual crash there too.
"There is no cannibalism in AI! And when I say 'none,' I mean there is a certain amount."