Home
Science
Semantic Folding

A new brain model of language

Name: Semantic Fingerprinting & Semantic Folding
Brand: Cortical.io
Availability: InStock
Rating: 5 (101 reviews)

Hierarchical Temporal Memory including Cortical Learning Algorithms

Semantic Folding Theory
and its application in Semantic Fingerprinting

A Cortical.io White Paper Version 1.0
Author: Francisco E. De Sousa Webber

Natural Language Understanding inspired by neuroscience

With Semantic Folding:

Words, sentences and whole texts can be compared to each other

NLP tasks like classification and semantic search are highly efficient

The system is trained in a fully unsupervised manner

No need for large language models nor expensive computing resources

Taking the Hierarchical Temporal Memory (HTM) theory, a computational theory of the human cortex developed by Numenta, as a starting point, Cortical.io has developed Semantic Folding, a corresponding theory of language representation.

Semantic Folding describes a method of converting text into a semantically grounded representation called a semantic fingerprint. Semantic fingerprints are Sparse Distributed Representations (SDR) of words: large binary vectors that are very sparsely filled, with every bit representing distinct semantic information.

Many practical problems of statistical Natural Language Processing (NLP) systems and, more recently, of Transformer models, like the necessity of creating large training data sets, the high cost of computation, the fundamental incongruity of precision and recall, the complex tuning procedures, etc., can be elegantly overcome by applying Semantic Folding to text processing.

Read the White Paper

Semantic Folding Simply Explained:
Watch a Short Video

Semantic Folding converts text in semantic fingerprints, encapsulating meaning in a topographical representation.

Semantic fingerprints allow direct comparison of the meanings of any two pieces of text, showing thousands of semantic relations.

If two semantic fingerprints look similar, it means that the texts are semantically similar too.

With Semantic Folding, semantic spaces are stable across languages, enabling direct comparison of text across languages without machine translation.

How does Semantic Folding work?

To begin with, we select reference material that represents the domain the system will work in – Wikipedia for applications using general English, or domain-related collections of documents for industry-specific applications.

Then, the reference documents are cut into context-based snippets which are distributed over a 2D matrix, in such a way that snippets with similar topics (sharing many common words) are placed close to each other on the map. This process creates a 2D semantic map.

In the next step, a vector is created for each word contained in the reference documents, by activating the positions of all snippets containing this word. This produces a large, binary, very sparsely filled vector called a Semantic Fingerprint.

A Semantic Fingerprint is a vector of 16,384 bits (128×128) where every bit stands for a concrete context (topic) that can be realized as a bag of words of the training snippets at this position.

The whole Semantic Folding process is fully unsupervised.

Applications of Semantic Folding

Semantic Folding builds the basis for high-level natural language processing functionalities that can be integrated in many different applications.

Semantic fingerprints can be generated for language elements like words, sentences and entire documents.
Any two pieces of text can be compared, regardless of length or language.
Computational operations can be performed on the meaning of text data by measuring the overlap of semantic fingerprints.

Semantic fingerprints work particularly well for NLP tasks like:

Classification: instead of training the classifier with many labeled examples, one reference fingerprint can be used to describe a class
Semantic search: comparing the semantic overlap between the semantic fingerprint of a query in natural language and the fingerprints of the indexed documents proves to be both highly accurate and efficient.

Advantages of Semantic Folding

High Accuracy

Semantic fingerprints leverage a rich semantic feature set of 16k parameters, enabling a fine-grained disambiguation of words and concepts.

High Efficiency

Semantic Folding requires order of magnitude less training material (100s vs, 1’000s) and less compute resources because it uses sparse distributed vectors.

High Transparency & Explainability

Each semantic feature can be inspected at the document level so that biases can be eliminated in the models and results explained.

High Flexibility & Scalability

Semantic Folding can be applied to any language and use case and business users can easily customize models.

Explore the difference with other approaches

The Future of AI is High Efficiency AI

Watch the Video

Cookie	Duration	Description
__cf_bm	1 hour	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
__cfruid	session	Cloudflare sets this cookie to identify trusted web traffic.
__hssc	1 hour	HubSpot sets this cookie to keep track of sessions and to determine if HubSpot should increment the session number and timestamps in the __hstc cookie.
__hssrc	session	This cookie is set by Hubspot whenever it changes the session cookie. The __hssrc cookie set to 1 indicates that the user has restarted the browser, and if the cookie does not exist, it is assumed to be a new session.
_GRECAPTCHA	6 months	Google Recaptcha service sets this cookie to identify bots to protect the website against malicious spam attacks.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Advertisement" category.
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	CookieYes sets this cookie to record the default button state of the corresponding category and the status of CCPA. It works only in coordination with the primary cookie.
JSESSIONID	session	New Relic uses this cookie to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
wordpress_test_cookie	session	WordPress sets this cookie to determine whether cookies are enabled on the users' browsers.

Cookie	Duration	Description
_lscache_vary	2 days	Litespeed sets this cookie to provide the prevention of cached pages.
li_gc	6 months	Linkedin set this cookie for storing visitor's consent regarding using cookies for non-essential purposes.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
__hstc	6 months	Hubspot set this main cookie for tracking visitors. It contains the domain, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).
_ga	1 year 1 month 4 days	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_gcl_au	3 months	Google Tag Manager sets the cookie to experiment advertisement efficiency of websites using their services.
_gid	1 day	Google Analytics sets this cookie to store information on how visitors use a website while also creating an analytics report of the website's performance. Some of the collected data includes the number of visitors, their source, and the pages they visit anonymously.
AnalyticsSyncHistory	1 month	Linkedin set this cookie to store information about the time a sync took place with the lms_analytics cookie.
hubspotutk	6 months	HubSpot sets this cookie to keep track of the visitors to the website. This cookie is passed to HubSpot on form submission and used when deduplicating contacts.

Cookie	Duration	Description
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser IDs.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
IDE	1 year 24 days	Google DoubleClick IDE cookies store information about how the user uses the website to present them with relevant ads according to the user profile.
li_sugr	3 months	LinkedIn sets this cookie to collect user behaviour data to optimise the website and make advertisements on the website more relevant.
test_cookie	15 minutes	doubleclick.net sets this cookie to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	6 months	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
VISITOR_PRIVACY_METADATA	6 months	YouTube sets this cookie to store the user's cookie consent state for the current domain.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.

Cookie	Duration	Description
_cfuvid	session	The _cfuvid cookie is only used to allow the Cloudflare WAF to distinguish individual users who share the same IP address. Visitors who do not provide the cookie are likely to be grouped together and may not be able to access the site if there are many other visitors from the same IP address.
_gat_form_6	1 minute	This cookie is set by Google Universal Analytics and is used to throttle the request rate - limiting the collection of data on high traffic sites.
cf_clearance	1 year	Cloudfare clearance Cookie stores the proof of challenge passed. It is used to no longer issue a challenge if present. It is required to reach an origin server.
et_bloom_optin_optin_3_39_imp	1 year	Determines if the users already dismissed a specific popup.
et_bloom_optin_optin_7_2115_imp	1 year	Determines if the users already dismissed a specific popup.
etBloomCookie_optin_3	5 days	Determines if the users already dismissed a specific popup.
etBloomCookie_optin_7	5 days	Determines if the users already dismissed a specific popup.

A new brain model of language

Hierarchical Temporal Memory including Cortical Learning Algorithms

Semantic Folding Theoryand its application in Semantic Fingerprinting

Natural Language Understanding inspired by neuroscience

With Semantic Folding:

Semantic Folding Simply Explained:Watch a Short Video

Semantic Folding converts text in semantic fingerprints, encapsulating meaning in a topographical representation.

Semantic fingerprints allow direct comparison of the meanings of any two pieces of text, showing thousands of semantic relations.

If two semantic fingerprints look similar, it means that the texts are semantically similar too.

With Semantic Folding, semantic spaces are stable across languages, enabling direct comparison of text across languages without machine translation.

How does Semantic Folding work?

Applications of Semantic Folding

Advantages of Semantic Folding

High Accuracy

High Efficiency

High Transparency & Explainability

High Flexibility & Scalability

The Future of AI is High Efficiency AI

What is a Sparse Distributed Representation?

Semantic Folding Theory
and its application in Semantic Fingerprinting

Semantic Folding Simply Explained:
Watch a Short Video