Tip

Assessing the environmental impact of large language models

Large language models like ChatGPT consume massive amounts of energy and water during training and after deployment. Learn how to understand and reduce their environmental impact.

John Burke

By

John Burke, Nemertes Research

Published: 11 Sep 2023

ChatGPT has made a splash across industries due to its ability to create humanlike, conversational dialogue.

But to produce the desired output, the LLMs behind generative AI applications require a tremendous amount of energy to train, develop and expand, which can have serious adverse effects on the environment. Explore where LLMs consume the most energy and methods to begin reducing their energy consumption and environmental impact.

The problem with LLMs

The environmental problems with LLMs spring from the large aspect. The amount of power consumed by the current generation of LLMs is associated with the size of the data sets they are trained on. An LLM's size can be characterized in part by the number of parameters used in its inference operations. More parameters means more data to move around and more computations to make use of that data.

Today's LLMs have orders of magnitude more parameters than earlier models. For example, Google's Bidirectional Encoder Representation from Transformers, or BERT, LLM, which achieved state-of-the-art performance when it was released in 2018, had 340 million parameters. In contrast, GPT-3.5, the LLM behind ChatGPT, has 175 billion.

Paralleling parameter counts, the power necessary to train some LLMs has jumped by four to six orders of magnitude. Power consumption has become a significant consideration when deciding how much training to perform -- along with cost, as some LLMs cost millions of dollars to train.

Training cycles consume the full attention of energy-hungry GPUs and CPUs. Extensive computational loads plus storing and moving massive amounts of data, contribute to large electrical draw and huge heat exhaust.

Heat load, in turn, means that more power goes toward cooling. Some data centers use water-based liquid cooling. But this method raises water temperatures, which can have adverse impacts on local ecosystems. Moreover, some water-based methods pollute the water used.

In comparison to training, the power consumed by an individual inference for a deployed model can seem miniscule. But that comparatively tiny amount must be multiplied by the number of inferences run when using that model in production.

In addition, many deployed models can only be used for a short time -- weeks or months -- before the model needs to be retrained. Addressing the problem of model drift requires repeating steps from the original training process and consuming a similar amount of power.

Reducing the environmental impact of language models

To address these problems, developers can reduce the size of their AI model and training operations.

LLMs are not the only kind of generative AI or natural language processing model. Smaller models, which have lower training costs and less significant environmental impacts, can perform nearly as well in many situations. For example, the Alpaca model from researchers at Stanford University and Meta's Large Language Model Meta AI, or Llama, are small enough to run on a desktop and can be trained for hundreds of dollars rather than millions.

Another way to reduce training costs over the full model lifecycle is with one-shot or few-shot training. Using this technique, trained LLMs can learn to deal with new input from one or more examples and adapt to deal with similar inputs thereafter.

In addition, enterprises can make their hardware more efficient by using different chip architectures or different architecture tools based on that hardware. The SpiNNaker2 chip architecture, for example, emulates biological neural networks by supporting locally dense computation across a sparsely active network. That is, where nothing is currently happening in the neural net, the chips consume nearly no power. Despite being built on larger transistors, the chip architecture consumes much less power compared with most current CPUs and GPUs, while accomplishing a similar amount of computational work.

To deliver sustainable tools and continue to make profits, AI companies need to make quick shifts to more efficient technologies and practices. Customers and prospects should be holding their feet to the fire on their environmental impacts and demand plans for mitigation in the near future.

Next Steps

Designing systems that reduce the environmental impact of AI

How AI can assist industries in environmental protection efforts

Dig Deeper on AI technologies

Search Business Analytics

What makes an effective data science team structure?
Data science team structures vary in strength, and their success depends on how roles and leadership align with business goals to...
Synthetic data vs. real data for predictive analytics
Synthetic data helps simulate rare events and meet privacy compliance, while real data preserves natural variability needed to ...
7 predictive analytics skills to improve simulation modeling
Predictive analytics skills such as statistical analysis, data preprocessing and model evaluation can help data professionals ...

Search CIO

How to attract tech talent in 2025: 7 essentials
In this time of 'the great churn,' finding and keeping great tech talent sounds merely aspirational. Read on for seven methods ...
Intel CEO's potential China links a warning for U.S. companies
President Donald Trump called for Intel CEO Lip-Bu Tan to resign, another signal of the administration's heightened focus on ...
How to become a Web 3.0 developer: Required skills and guide
Becoming a Web 3.0 expert means mixing old and new skills.

Search Data Management

Top data quality management tools in 2025
Data quality management tools provide profiling, cleansing and monitoring features that keep enterprise data accurate and ...
Is Apache Iceberg worth a full migration?
Apache Iceberg delivers modern data lake features, but adoption depends on existing architecture, team resources and tolerance ...
Hadoop vs. Spark for modern data pipelines
Hadoop and Spark differ in architecture, performance, scalability, cost and deployment. They offer distinct strengths for modern ...

Search ERP

Is geospatial data the real game changer for digital twins?
In the podcast, the CEO of TwinMatrix Technologies explains the benefits and challenges of adding geospatial capabilities to ...
AI and ERP: The digital labor evolution in manufacturing
Despite hype and growing pains, agentic AI finds a home in the enterprise with manufacturing process functionality.
9 top ERP software picks for the retail industry
Some ERP software is better than others for companies that are in the retail industry and need certain functionality. Learn some ...

Close