Cool, now how much power was consumed before even a single prompt was ran in training that model, and how much power is consumed on an ongoing basis adding new data to those AI models even without user prompts. Also how much power was consumed with each query before AI was shoved down our throats, and how many prompts does an average user make per day?
Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
I did some quick math with metas llama model and the training cost was about a flight to Europe worth of energy, not a lot when you take in the amount of people that use it compared to the flight.
Whatever you're imagining as the impact, it's probably a lot less. AI is much closer to video games then things that are actually a problem for the environment like cars, planes, deep sea fishing, mining, etc. The impact is virtually zero if we had a proper grid based on renewable.
Please show your math.
One Nvidia H100 DGX AI server consumes 10.2kW at 100% utilization, meaning that ~~one hour’s~~ 42 day's use of one server is equivalent to the electricity consumption of the average USA home in one year. This is just a single 8-GPU server; it excludes the electricity required by the networking and storage hardware elsewhere in the data center, let alone the electricity required to run the facility’s climate control.
xAI alone has deployed hundreds of thousands of H100 or newer GPUs. Let’s SWAG 160K GPUs = ~20K DGX servers = >200MW for compute alone.
H100 is old. State of the art GB200 NVL72 is 120kW per rack.
Musk is targeting not 160K, but literally one million GPUs deployed by the end of this year. He has built multiple new natural gas power plants which he is now operating without any environmental permits or controls, to the detriment of the locals in Memphis.
This is just one company training one typical frontier model. There are many competitors operating at similar scale and sadly the vast majority of their new capacity is running on hydrocarbons because that’s what they can deploy at the scale they need today.
I should have specified it was an earlier llama model. They have scaled up to more then a flight or two. You are mostly right except for how much a house uses. It's about 10,500 kW per year, you're off by a thousand. It uses in an hour about 8 hours of house time, which is still a lot though, specially when you consider musks 1 million gpus.
Their first model took 2 600 000 kwh, a plane takes about 500 000. The actual napkin math was 5 flights. I had done the math like 2 years ago but yeah, I was mistaken and should have at least specified it was for their first model. Their more recent ones have been a lot more energy intensive I think.
Thanks for catching, you are right that the average USA home is 10.5MWh/year instead of kWh. I was mistaken. :)
Regarding the remainder, my point is that the scale of modern frontier model training, and the total net-new electricity demand that AI is creating is not trivial. Worrying about other traditional sources of CO2 emissions like air travel and so forth is reasonable, but I disagree with the conclusion that AI infrastructure is not a major environmental and climate change concern. The latest projects are on the scale of 2-5GW per site, and the vast majority of that new electricity capacity will come from natural gas or other hydrocarbons.
If their energy consumption actually was so small, why are they seeking to use nuclear reactors to power data centres now?
Because demand for data centers is rising, with AI as just one of many reasons.
But that's not as flashy as telling people it takes the energy of a small country to make a picture of a cat.
Also interesting that we're ignoring something here -- big tech is chasing cheap sources of clean energy. Don't we want cheap, clean energy?
Sir we do not make reasonable points in here, you’re supposed to hate AI irrationally and shut up.
AI is the driver of the parabolic spike in global data center buildouts. No other use case comes close in terms of driving new YoY growth in tech infra capex spend.
Didn’t xitter just install a gas powered data center that’s breaking EPA rules for emissions?
Yes, yes it did. And as far as I can tell, it's still belching it out, just so magats can keep getting owned by it. What a world
Sure we do. Do we want the big tech corporations to hold the reins of that though?
To be fair, nuclear power is cool as fuck and would reduce the carbon footprint of all sorts of bullshit.
Because the training has diminishing returns, meaning the small improvements between (for example purposes) GPT 3 and 4 will need exponentially more power to have the same effect on GPT 5. In 2022 and 2023 OpenAI and DeepMind both predicted that reaching human accuracy could never be done, the latter concluding even with infinite power.
So in order to get as close as possible then in the future they will need to get as much power as possible. Academic papers outline it as the one true bottleneck.
Volume of requests and power consumption requirements unrelated to requests made, at least I have to assume. Certainly doesn't help that google has forced me to make a request to their ai every time I run a standard search.
Seriously. I'd be somewhat less concerned about the impact if it was only voluntarily used. Instead, AI is compulsively shoved in every nook and cranny of digital product simply to justify its own existence.
The power requirement for training is ongoing, since mere days after Sam Altman released a very underehelming GPT-5, he begins hyping up the next one.
I also never saw a calculation that took into amount my VPS costs. The fckers scrape half the internet, warming up every server in the world connected to the internet. How much energy is that?
I'd like to understand what this math was before accepting this as fact.
I usually liken it to video games, ya. Is it worse that nothing? Sure, but that flight or road trip, etc, is a bigger concern. Not to mention even before AI we've had industrial usage of energy and water usage that isn't sustainable... almonds in CA alone are a bigger problem than AI, for instance.
Not that I'm pro-AI cause it's a huge headache from so many other perspectives, but the environmental argument isn't enough. Corpo greed is probably the biggest argument against it, imo.
In total, the median prompt—one that falls in the middle of the range of energy demand—consumes 0.24 watt-hours of electricity, the equivalent of running a standard microwave for about one second. The company also provided average estimates for the water consumption and carbon emissions associated with a text prompt to Gemini.
There are zero downsides when mentally associating an energy hog with "1 second of use time of the device that is routinely used for minutes at a time."
With regard to sugar: when I started counting calories I discovered that the actual amounts of calories in certain foods were not what I intuitively assumed. Some foods turned out to be much less unhealthy than I thought. For example, I can eat almost three pints of ice cream a day and not gain weight (as long as I don't eat anything else). So sometimes instead of eating a normal dinner, I want to eat a whole pint of ice cream and I can do so guilt-free.
Likewise, I use both AI and a microwave, my energy use from AI in a day is apparently less than the energy I use to reheat a cup of tea, so the conclusion that I can use AI however much I want to without significantly affecting my environmental impact is the correct one.
You should probably not eat things because of how much calories they have or don't have, but because of how much of their nutrients you need, and how much they lack other, dangerous shit. Also eat slowly until you're full and no more. Also move a lot.
We shouldn't need calculators for this healthy lifestyle.
The reason for needing to know which foods are healthy is because... well, we forgot.
I'm not saying that ice cream is healthier than a normal dinner, just that if I really crave something sweet then the cost to my health of eating it periodically is actually quite low, whereas the cost of some other desserts (baked sweets are often the worst offenders) is relatively high. That means that a lot can be gained simply by replacing one dessert with a different, equally tasty dessert. Hence my ice cream advocacy.
This doesn't really track with companies commissioning power plants to support power usage of AI training demand
They want to handle lots of prompts.
The article also mentions each enquiry also evaporates 0.26 of a milliliter of water... or "about five drops".
In addition:
This report was also strictly limited to text prompts, so it doesn’t represent what’s needed to generate an image or a video.
This feels like PR bullshit to make people feel like AI isn't all that bad. Assuming what they're releasing is even true. Not like cigarette, oil, or sugar companies ever lied or anything and put out false studies and misleading data.
However, there are still details that the company isn’t sharing in this report. One major question mark is the total number of queries that Gemini gets each day, which would allow estimates of the AI tool’s total energy demand.
Why wouldn't they release this. Even if each query uses minimal energy, but there are countless of them a day, it would mean a huge use of energy.
Which is probably what's happening and why they're not releasing that number.
That's because it is. This is to help fence riders feel better about using a product that factually consumes insane amounts of resources.
The company has signed agreements to buy over 22 gigawatts of power from sources including solar, wind, geothermal, and advanced nuclear projects since 2010.
None of those advanced nuclear projects are yet actually delivering power, AFAIK. They're mostly in planning stages.
The above isn't all to run AI, of course. Nobody was thinking about datacenters just for AI training in 2010. But to be clear, there are 94 nuclear power plants in the US, and a rule of thumb is that they produce 1GW each. So Google is taking up the equivalent of roughly one quarter of the entire US nuclear power industry, but doing it with solar/wind/geothermal that could be used to drop our fossil fuel dependence elsewhere.
How much of that is used to run AI isn't clear here, but we know it has to be a lot.
The real question is why anyone would want to use more power than a regular search engine to get answers that might confidently lie to you.
Google processes over 5 trillion search queries per year. Attaching an AI inference call to most if not all of those will increase electricity consumption by at least an order of magnitude.
Edit: using their own 0.24Wh number, that equates to 1.2 billion kWh per year, or about the equivalent of 114,285 USA homes.
if it's Google that they would use us the search engine, search results are turning to shit. it just often doesn't show you the relevant stuff. The AI overview is wrong. Ads sometimes take up the entire first page of results. so I see why someone would just want to show a question into the void and get a quick response instead of having to sort through five crappy results, after filtering that down from 15 possibly relevant ones
There were people estimating 40w in earlier threads on lemmy which was ridiculous.
This seems more realistic.
median prompt size
Someone didn't pass statistics, but did pass their marketing data presention classes.
Wake me up when they release useful data.
It is indeed very suspicious that they talk about "median" and not "average".
For those who don't understand what the difference is, think of the following numbers:
1, 2, 3, 34, 40
The median is 3, because it's in the middle.
The average is 16 (1+2+3+34+40=80, 80/5=16).
Now do training centers, since it's obvious they are never going to settle on a final model as they pursue the Grail of AGI. I could do the exact same comparison with my local computer and claim that running a prompt only uses X amount of watts because the GPU heats up for a few seconds and is done. But if I were to do some fine tuning or other training, that fan will stay on for hours. A lot different.
Nice share! Mistral also shared data about one of its largest model (not the one that answer in LeChat, since that one is Medium, a smaller model, that I guess has smaller energetic requirements)
https://mistral.ai/news/our-contribution-to-a-global-environmental-standard-for-ai
Let’s see OpenAI’s numbers