What makes you ecologically responsible nowadays? Well, there are some no-brainers most of us have assimilated, like turning off the tap while brushing your teeth, buying locally grown vegetables and fruits, or using a bike or public transportation instead of a car (even if it’s electric). But that’s just the tip of the iceberg. There is an immense body of carbon emissions floating deep under the water line that we unleash every time we interact with the digital world. The surge of generative AI and the frenzy with which it has been hyped as a universal problem solver is dramatically deteriorating the carbon footprint of the whole ICT (Information and Communications Technology) industry. Let’s try to quantify this and see what each of us can undertake to curb the energy consumption of AI.

Beware of the carbon footprint of your digital life

“How bad are bananas?” In this bestseller first published in 2009, Mike Berners-Lee nurtured the ambition to educate us about the carbon footprint of… well, everything in our daily lives. In 2020, given the pace at which digital disruptions were happening, he published an update integrating major innovations like Bitcoin and the Cloud. Since then, calculating the carbon footprint of human activities has become a favorite discipline of researchers, climate activists and journalists alike. A plethora of not always congruous figures are vehiculated throughout the Web – which, ironically, does not serve the purpose of reducing carbon emissions.

Here is a comparison of the CO2 emissions of our digital activities, gleaned from several sources:

Activity	CO2 emissions	Unit
Browsing the internet	0,8g	Per page
Sending a request to ChatGPT	4,32g	Per request
Sending an email with attachment	50g	Per email
TikTok consumption	158g	Per hour
Video conference between 2 persons	270g	Per hour
Video streaming on Netflix	432g – 1.682g	Per hour

The carbon emissions of calling ChatGPT compared to sending an email or watching a video do not frighten you? Well, they should. It’s all about the sheer quantity of interactions happening every single minute with a large language model. Continue reading!

In the context of global warming and what we know of the energy greediness of AI, estimating the carbon footprint of AI should appear at the top of the agenda of Google, Meta and the other tech giants developing large language models or integrating them in their products. But it doesn’t. The tech giants do not share figures about the carbon footprint of training and using LLMs, and there are no standardized methods of measuring the emissions of AI either. Interestingly, we owe the first findings about the carbon footprint of LLMs to an AI startup called Hugging Face. Their team has researched both the emissions produced by training an LLM and by running it for different tasks like text classification, summarization and image generation.

The first interesting finding is that your AI carbon footprint may vary a lot depending on your location: it will be much lower in a country like France using a relatively clean power grid, compared to other regions like parts of the US or China still largely dependent on fossil energy. Including the emissions from manufacturing the computing infrastructure, Hugging Face’s researchers estimated the total emissions of their own LLM, Bloom, which was trained on a supercomputer in France, to 50 tons of CO2 – significantly less than similar LLMs such as Meta’s OPT (75 tons) and OpenAI’s GPT-3 (500 tons).

Training an LLM	CO2 emissions
Bloom	50 tons
OPT	75 tons
GPT-3	500 tons

We all knew that training an LLM consumes a lot of energy (see previous blog posts). The interesting discovery here is that where the LLM is trained has a huge impact on its carbon footprint. The biggest surprise from this research, however, comes as a knock-out: Training the LLM does not make for the biggest part of its carbon footprint. Running it is the real energy guzzler.

Looking at the emissions for single tasks does not look frightening though – except for image generation whose emissions skyrocket to 1.8kg per 1.000 requests, all other tasks incl. text summarization remain well below 100g per 1.000 requests:

Compared to 500 tons CO2 for training a single model, 1,8kg for generating an image does not sound like a threat to our climate. The problem here is not the seemingly low level of emission for each single interaction with a generative AI. It is the fact that all major tech companies integrate generative AI capabilities in their systems and that billions of interactions are done every day by the users of Microsoft, Meta, Adobe, Salesforce and the likes. Whether you write an email, draft a presentation, summarize a call, or create a social media post, now your favorite applications give you a direct access to large language models for any other task. The more a LLM is used, the more likely the carbon footprint from inferences will be higher than from training. For popular models like ChatGPT with 10 millions of users every day, the model’s usage emissions are estimated to have exceeded the training emissions within a couple of weeks!

Is sending less requests to ChatGPT and refraining from generating yet another funny image for your social media post the solution to the exploding AI carbon footprint? Certainly, as it would be wise to replace attachments with links in your emails or limit the number of videos you upload/watch on TikTok.

But the lion share of AI carbon emissions belongs to the companies training or leveraging LLMs in their daily operations. If your organization is experimenting with generative AI, here is what you can do to keep your carbon emissions as low as possible.

How to surf the AI wave and preserve the environment

Use a large language model only if it brings a significant benefit

Very often, LLMs are an overkill. For example, developing an application to search your intranet or classify your emails does not require an LLM. Such applications can be built in-house with pre-trained models, or simply bought from specialized vendors. Also think twice before switching to an LLM just to get 1 or 2% more accuracy. These extra points probably won’t justify to use 2 or 3 times more power, will they?

Fine-tune existing models, rather than train new models from scratch

If you really think you need your own generative AI model, take an existing one and customize it to your specific domain. Look at open-source models, which your AI experts and data scientists can fine-tune. Researchers found out that classifying movie reviews with an LLM consumes 30 times more energy than with a smaller model fine-tuned for this task. The reason is that LLMs try to perform several tasks like generate, classify, and summarize text, instead of just one task.

Assess the eco-friendliness of your cloud provider or data center

Find out where your computing infrastructure is getting its power from: What is the proportion of renewable energy versus fossil fuels? Deploying models in a region with a higher share of clean power can reduce operational emissions by 75%, as shown by this practice.

Track the CO2 emission of your AI activity – and integrate it in your reporting

There are more and more tools to help you monitor the emissions of your AI activities. They can be included in your code at runtime to estimate your emissions and facilitate reporting. For more information, visit CodeCarbon, Green algorithms, or ML CO2 Impact.

I have a dream: Sustainable AI for everyone

As long as reporting about the AI carbon footprint remains optional, as long as no international standards are established to structure and compare the different estimations of CO2 emissions, reducing the environmental impact of AI remains a dream. But you and I can contribute to making this dream a reality. Today, I have refrained from sending a dozen requests to an AI image generator to get a striking illustration for this post – I chose instead a stock photo. Tomorrow, I will talk with a large US insurer who has decided to use Cortical.io SemanticPro for extracting and classifying information from documents – a system that needs much less computing power than LLMs to process documents. What about you?

Cookie	Duration	Description
__cf_bm	1 hour	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
__cfruid	session	Cloudflare sets this cookie to identify trusted web traffic.
__hssc	1 hour	HubSpot sets this cookie to keep track of sessions and to determine if HubSpot should increment the session number and timestamps in the __hstc cookie.
__hssrc	session	This cookie is set by Hubspot whenever it changes the session cookie. The __hssrc cookie set to 1 indicates that the user has restarted the browser, and if the cookie does not exist, it is assumed to be a new session.
_GRECAPTCHA	6 months	Google Recaptcha service sets this cookie to identify bots to protect the website against malicious spam attacks.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Advertisement" category.
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	CookieYes sets this cookie to record the default button state of the corresponding category and the status of CCPA. It works only in coordination with the primary cookie.
JSESSIONID	session	New Relic uses this cookie to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
wordpress_test_cookie	session	WordPress sets this cookie to determine whether cookies are enabled on the users' browsers.

Cookie	Duration	Description
_lscache_vary	2 days	Litespeed sets this cookie to provide the prevention of cached pages.
li_gc	6 months	Linkedin set this cookie for storing visitor's consent regarding using cookies for non-essential purposes.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
__hstc	6 months	Hubspot set this main cookie for tracking visitors. It contains the domain, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).
_ga	1 year 1 month 4 days	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_gcl_au	3 months	Google Tag Manager sets the cookie to experiment advertisement efficiency of websites using their services.
_gid	1 day	Google Analytics sets this cookie to store information on how visitors use a website while also creating an analytics report of the website's performance. Some of the collected data includes the number of visitors, their source, and the pages they visit anonymously.
AnalyticsSyncHistory	1 month	Linkedin set this cookie to store information about the time a sync took place with the lms_analytics cookie.
hubspotutk	6 months	HubSpot sets this cookie to keep track of the visitors to the website. This cookie is passed to HubSpot on form submission and used when deduplicating contacts.

Cookie	Duration	Description
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser IDs.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
IDE	1 year 24 days	Google DoubleClick IDE cookies store information about how the user uses the website to present them with relevant ads according to the user profile.
li_sugr	3 months	LinkedIn sets this cookie to collect user behaviour data to optimise the website and make advertisements on the website more relevant.
test_cookie	15 minutes	doubleclick.net sets this cookie to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	6 months	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
VISITOR_PRIVACY_METADATA	6 months	YouTube sets this cookie to store the user's cookie consent state for the current domain.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.

Cookie	Duration	Description
_cfuvid	session	The _cfuvid cookie is only used to allow the Cloudflare WAF to distinguish individual users who share the same IP address. Visitors who do not provide the cookie are likely to be grouped together and may not be able to access the site if there are many other visitors from the same IP address.
_gat_form_6	1 minute	This cookie is set by Google Universal Analytics and is used to throttle the request rate - limiting the collection of data on high traffic sites.
cf_clearance	1 year	Cloudfare clearance Cookie stores the proof of challenge passed. It is used to no longer issue a challenge if present. It is required to reach an origin server.
et_bloom_optin_optin_3_39_imp	1 year	Determines if the users already dismissed a specific popup.
et_bloom_optin_optin_7_2115_imp	1 year	Determines if the users already dismissed a specific popup.
etBloomCookie_optin_3	5 days	Determines if the users already dismissed a specific popup.
etBloomCookie_optin_7	5 days	Determines if the users already dismissed a specific popup.

Why using generative AI is not eco-friendly… and what you can do about it.

Beware of the carbon footprint of your digital life

How to surf the AI wave and preserve the environment

Use a large language model only if it brings a significant benefit

Fine-tune existing models, rather than train new models from scratch

Assess the eco-friendliness of your cloud provider or data center

Track the CO2 emission of your AI activity – and integrate it in your reporting

I have a dream: Sustainable AI for everyone

Recent Posts

Stay informed!

Subscribe to our newsletter to keep track of what happens at Cortical.io.

You have Successfully Subscribed!