How does AI use so much power?

SnausagesinaBlanket@lemmy.world · 4 months ago

How does AI use so much power?

Em Adespoton@lemmy.ca · 4 months ago

Supercomputers once required large power plants to operate, and now we carry around computing devices in out pockets that are more powerful than those supercomputers.

There’s plenty of room to further shrink the computers, simplify the training sets, formalize and optimize the training algorithms, and add optimized layers to the AI compute systems and the I/O systems.

But at the end of the day, you can either simplify or throw lots of energy at a system when training.

Just look at how much time and energy goes into training a child… and it’s using a training system that’s been optimized over hundreds of thousands of years (and is still being tweaked).

AI as we see it today (as far as generative AI goes) is much simpler, just setting up and executing probability sieves with a fancy instruction parser to feed it its inputs. But it is using hardware that’s barely optimized at all for the task, and the task is far from the least optimal way to process data to determine an output.

null_dot@lemmy.dbzer0.com · 4 months ago

Supercomputers once required large power plants to operate, and now we carry around computing devices in out pockets that are more powerful than those supercomputers.

This is false. Supercomputers never required large [dedicated] power plants to operate.

Yes they used a lot of power, yes that has reduced significantly, but it’s not at the same magnitude as AI

BussyCat@lemmy.world · 4 months ago

It is also a very large data set it has to go through the average English speaker knows 40kish words and it has to pull from a large data set and attempt to predict what’s the most likely word to come next and do that a hundred or so times per response. Then most people want the result in a very short period of time and with very high accuracy (smaller tolerances on the convergence and divergence criteria) so sure there is some hardware optimization that can be done but it will always be at least somewhat taxing.

Nibodhika@lemmy.world · 4 months ago

Your answer is intuitively correct, but unfortunately has a couple of flaws

Supercomputers once required large power plants to operate

They didn’t, not that much anyways, a Cray-1 used 115kW to produce 160 MFLOPS of calculations. And while 150kW is a LOT, it’s not in the “needs its own power plant to operate” category, since even a small coal power plant (the least efficient electricity generation method) would produce a couple of orders of magnitude more than that.

and now we carry around computing devices in out pockets that are more powerful than those supercomputers.

Indeed, our phones are in the Teraflops range for just a couple of watts.

There’s plenty of room to further shrink the computers,

Unfortunately there isn’t, we’ve reached the end of Moore’s law, processors can’t get any smaller because they require to block electrons from passing on given conditions, and if we built transistors smaller than the current ones electrons would be able to quantum leap across them making them useless.

There might be a revolution in computing by using light instead of electricity (which would completely and utterly revolutionize computers as we know them), but until that happens computers are as small as they’re going to get, or more specifically they’re as space efficient as they’re going to get, i.e. to have more processing power you will need more space.

Semisimian@startrek.website · 4 months ago

This is an astute answer. Bravo.