Today, all large AI companies are placing their bets on a brute force approach. Yet throwing huge amounts of data at machine learning algorithms and deploying massive processing power is neither efficient nor future-proof. AI needs to get much smarter and by magnitudes more efficient if we want to avoid another winter.

I am sure you all have seen charts comparing the energy consumption of Bitcoin to the one of smaller nations. Currently, transactions around this most popular blockchain project require more energy than Greece, a country of 10 million people. The outcry on social media is huge, and rightly so, in times where Russia’s attack on Ukraine is creating much uncertainty on the energy markets and the climate crisis demands using energy sustainably.

To a certain extent, the AI industry can be thankful for the distraction Bitcoin is providing. Its energy balance might be even worse. Research carried out by the University of Massachusetts, Amherst, indicates that “training a single AI model can emit as much carbon as five cars in their lifetimes”. That is one model. The MIT article reports that “final, paper-worthy models require training almost 5.000 models in total”. Now do the math on climate impact.

The race for larger AI models goes hand in hand with a race for more computing power – leading to the creation of more and more powerful supercomputers. These machines do not only need a lot of space, but also require millions of gallons of water to cool down and consume tremendous quantities of power: According to an article in Nature from 2018, data centers use an estimated 200-terawatt hours (TWh) each year. It will most probably just have gotten worse since then.

Yes, advances made by OpenAI with GPT-3 or by Google with BERT are impressive – but their approach is not one we can sustain. So what now? The modern dream of creating “intelligent” machines has been around since the 1950s. We have seen phases of progress followed by AI winters when we met dead ends. Is the current brute force attack leading us into another AI winter?

Short answer: Yes. We need a different approach to building and training AI models in the age of sustainability. One that is not just smart but also intelligent, like evolution always ends up being. This means that we should, this time, design learning systems that actually work like the human brain and not just like its naive over-simplification.

An inspiration for how this paradigm shift could look like can be found in the history of Big Pharma: 50 years ago, developing new medication was not unlike today’s approach to AI: Pharmaceutical companies where testing the influence of vast amounts of plant samples, gathered in the rainforests of the world, on a manifold of different pathologies like blood pressure, cholesterol levels, infections, inflammations and even cancer cells, to find out which natural molecule might produce a statistically relevant improvement of the condition. Researchers were used to “brute force” medical progress, often without necessarily understanding the theoretical functioning of the underlying biological mechanisms. Today, molecular biology has explicitly clarified most of the important metabolic pathways and relevant cellular receptors to an unimaginable level of detail, allowing modern pharmaceutical research to replace the millions of cross-matching experiments with molecular CAD systems able to model, simulate and synthesize any substance, efficiently targeting a specific clinical goal by design.

Over the last decades, neuroscience has produced evidence for a much better understanding of the principles behind the human brain. No, we still cannot map it out in all details, and we have not understood all the fine mechanics, but by now we know enough about the computational principles of the neocortex to try to create a better AI. Why should we? Because the brain merely needs 20 watts (!!!) to outperform the best AI models.

So far, the efforts for producing useful AI models were mainly focused on improving their precision compared to human performance. But whoever tried to practically implement such an AI System will soon have found out that the actual limiting factor is rather efficiency than accuracy. So the question we need to ask is: can we actually afford the required precision given its energy, training data sourcing and computation costs?

Cortical.io is working on just that – better, more efficient (in all ways) natural language understanding (NLU) models inspired by actual neuroscience. We turned our first breakthroughs into business products – but there is still a whole universe of neuro-semantics to explore, to transfer to the AI community, and a long path ahead to educate the market and to generalize the use of efficient semantic models for sustainable language-AI software.

Where we are at today:

We have formulated the “Semantic Folding” theory, a machine learning methodology for creating semantic models using unsupervised training on very limited amounts of reference data. The methodology tries to functionally copy the computational principles discovered in the human neocortex. Semantic Folding introduces a new way of representing information based on sparse distributed representations (SDR), called a Semantic Fingerprint. Semantic fingerprints capture the semantics (actual meanings) of words, sentences and paragraphs in context and enable to reach high levels of accuracy and efficiency when applying computational operators on text.

We have created a product based on Semantic Folding called “SemanticPro” which showcases the quality of our approach for intelligent document processing. SemanticPro can analyze high volumes of messages and complex documents in a very similar way to humans – but incomparably quicker. Due to the underlying technology, SemanticPro requires an order of magnitude less training data than other deep learning solutions. 100 reference documents, e.g. contracts, data sheets or emails are sufficient to train SemanticPro on a new use case. This product is already live in large enterprises around the world.

We are working on combining our Semantic Folding-based algorithms with dedicated high-performance hardware in order to speed up the processing of large volumes of text by orders of magnitude, such reducing the computing resources needed to perform intelligent document processing at scale.

Semantic Folding can be applied to any language. Given the low amount of training data needed, models can be developed in equal quality for languages used by smaller speaker communities.

These are first steps that need to find an echo in the AI ecosystem to truly make a difference.

There are so many areas where consumers and businesses alike will benefit from efficient NLU models – like machine translation and speech recognition, to name just two areas where current solutions consume way too much computing resources while delivering mitigated results (or are you happy with the way Siri and Alexa understand your requests?). Given the vast amounts of data needed – again, brute force approaches are based on millions or billions of samples and still require thousands of samples to fine tune for any given application – and the even vaster energy demand, mass adoption cannot and must not be tried with current brute force approaches. The AI community needs to embrace more efficient approaches like Semantic Folding. If we continue down the current path, an AI winter might not even be one of our biggest problems.

Cookie	Duration	Description
__cf_bm	1 hour	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
__cfruid	session	Cloudflare sets this cookie to identify trusted web traffic.
__hssc	1 hour	HubSpot sets this cookie to keep track of sessions and to determine if HubSpot should increment the session number and timestamps in the __hstc cookie.
__hssrc	session	This cookie is set by Hubspot whenever it changes the session cookie. The __hssrc cookie set to 1 indicates that the user has restarted the browser, and if the cookie does not exist, it is assumed to be a new session.
_GRECAPTCHA	6 months	Google Recaptcha service sets this cookie to identify bots to protect the website against malicious spam attacks.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Advertisement" category.
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	CookieYes sets this cookie to record the default button state of the corresponding category and the status of CCPA. It works only in coordination with the primary cookie.
JSESSIONID	session	New Relic uses this cookie to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
wordpress_test_cookie	session	WordPress sets this cookie to determine whether cookies are enabled on the users' browsers.

Cookie	Duration	Description
_lscache_vary	2 days	Litespeed sets this cookie to provide the prevention of cached pages.
li_gc	6 months	Linkedin set this cookie for storing visitor's consent regarding using cookies for non-essential purposes.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
__hstc	6 months	Hubspot set this main cookie for tracking visitors. It contains the domain, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).
_ga	1 year 1 month 4 days	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_gcl_au	3 months	Google Tag Manager sets the cookie to experiment advertisement efficiency of websites using their services.
_gid	1 day	Google Analytics sets this cookie to store information on how visitors use a website while also creating an analytics report of the website's performance. Some of the collected data includes the number of visitors, their source, and the pages they visit anonymously.
AnalyticsSyncHistory	1 month	Linkedin set this cookie to store information about the time a sync took place with the lms_analytics cookie.
hubspotutk	6 months	HubSpot sets this cookie to keep track of the visitors to the website. This cookie is passed to HubSpot on form submission and used when deduplicating contacts.

Cookie	Duration	Description
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser IDs.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
IDE	1 year 24 days	Google DoubleClick IDE cookies store information about how the user uses the website to present them with relevant ads according to the user profile.
li_sugr	3 months	LinkedIn sets this cookie to collect user behaviour data to optimise the website and make advertisements on the website more relevant.
test_cookie	15 minutes	doubleclick.net sets this cookie to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	6 months	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
VISITOR_PRIVACY_METADATA	6 months	YouTube sets this cookie to store the user's cookie consent state for the current domain.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.

Cookie	Duration	Description
_cfuvid	session	The _cfuvid cookie is only used to allow the Cloudflare WAF to distinguish individual users who share the same IP address. Visitors who do not provide the cookie are likely to be grouped together and may not be able to access the site if there are many other visitors from the same IP address.
_gat_form_6	1 minute	This cookie is set by Google Universal Analytics and is used to throttle the request rate - limiting the collection of data on high traffic sites.
cf_clearance	1 year	Cloudfare clearance Cookie stores the proof of challenge passed. It is used to no longer issue a challenge if present. It is required to reach an origin server.
et_bloom_optin_optin_3_39_imp	1 year	Determines if the users already dismissed a specific popup.
et_bloom_optin_optin_7_2115_imp	1 year	Determines if the users already dismissed a specific popup.
etBloomCookie_optin_3	5 days	Determines if the users already dismissed a specific popup.
etBloomCookie_optin_7	5 days	Determines if the users already dismissed a specific popup.

Third AI Winter ahead? Why OpenAI, Google & Co are heading towards a dead-end

Recent Posts

Stay informed!

Subscribe to our newsletter to keep track of what happens at Cortical.io.

You have Successfully Subscribed!