Skip to content
PatentBrief
Get alertsTop ↑

How Computers Find Hidden Connections Between Different Fields of Knowledge

A method for finding related ideas in completely different subjects by using math to map how words appear together, even when the subjects use different vocabulary.

Granted 2003ExpiredExpired 2020Owned by Huntsman International LLCInvented by Herbert R. Gillis

Original patent title: “Method for retrieving semantically distant analogies

Plain-English explanation by SahiLast reviewed · June 15, 2026

A method for finding related ideas in completely different subjects by using math to map how words appear together, even when the subjects use different vocabulary. Granted to Huntsman International LLC in 2003 with 46 claims and 456 forward citations.

Key facts

Patent numberUS 6523026
StatusExpired
FieldAI & Machine Learning
AssigneeHuntsman International LLC
InventorHerbert R. Gillis
Filed2000
Granted2003
Claims46
Times cited456
LitigationNone on record
Value · $96K$307KModest

Coverage

What does this patent actually cover?

This patent describes a way to find 'analogies' between two unrelated fields, such as finding a biological solution to a mechanical engineering problem. It works by first analyzing a source domain (like a library of biology papers) to create a high-dimensional map of how words appear together. It then uses this map—represented as vectors in a multi-dimensional space—to search a second, completely different domain (like a library of engineering patents) where the same words might not even appear. By comparing the 'meaning' of the words based on their context in the first domain, the system can pull up relevant documents from the second domain that share a functional relationship despite having no overlapping keywords.

The gap

What does this patent NOT cover?

  • Does not cover simple keyword-based search engines that rely on matching exact words or synonyms.
  • Does not cover systems that require the target domain to contain the same vocabulary as the source domain.
  • Does not cover manual categorization or human-led tagging of documents.
  • Does not cover basic vector search that does not specifically map relationships across semantically distant domains.

These exclusions are unique to PatentBrief — derived from the actual claim language, not patent-office boilerplate.

What made this novel

The system ignores the actual words in the target domain and instead maps them into the 'semantic space' of the source domain, allowing it to find functional equivalents without needing a direct translation or shared dictionary.

Method for retrieving semantic…(Primary claim)ai mlsoftwaretelecommunications

Schematic visualization of the patent's claim structure. Hand-drawn diagrams in progress for each landmark patent.

Where you've seen this

Real-world examples

01

Cross-disciplinary research tools

02

Automated patent landscape analysis

03

AI-driven scientific discovery platforms

Why it matters

The bigger picture

This technology is a precursor to modern cross-domain knowledge discovery and semantic search. It addresses the 'vocabulary mismatch' problem, which is a major hurdle in AI and data science where different industries use different jargon to describe the same underlying physical or logical processes.

Filed

October 2, 2000

Granted

February 18, 2003

Market context

Who's building on this

Companies in this space

Large-scale search companies like Google and specialized AI research firms are building on these principles of high-dimensional semantic mapping. The core concept of embedding data into vector spaces is now a foundational technique in modern Large Language Models and retrieval-augmented generation systems.

Market impact

This patent helped formalize the transition from simple string-matching search to semantic search. It enabled organizations to break down information silos by allowing automated systems to identify functional parallels across disparate industries, which is now a standard capability in enterprise data analytics.

Claim 1 — Plain English

What this patent covers

This patent describes a way to find 'analogies' between two unrelated fields, such as finding a biological solution to a mechanical engineering problem. It works by first analyzing a source domain (like a library of biology papers) to create a high-dimensional map of how words appear together. It then uses this map—represented as vectors in a multi-dimensional space—to search a second, completely different domain (like a library of engineering patents) where the same words might not even appear. By comparing the 'meaning' of the words based on their context in the first domain, the system can pull up relevant documents from the second domain that share a functional relationship despite having no overlapping keywords.

The clever bit

The system ignores the actual words in the target domain and instead maps them into the 'semantic space' of the source domain, allowing it to find functional equivalents without needing a direct translation or shared dictionary.

What it does not cover

  • Does not cover simple keyword-based search engines that rely on matching exact words or synonyms.
  • Does not cover systems that require the target domain to contain the same vocabulary as the source domain.
  • Does not cover manual categorization or human-led tagging of documents.
  • Does not cover basic vector search that does not specifically map relationships across semantically distant domains.

Patent timeline

Filing

Application submitted to the patent office

Publication

Application published, typically 18 months after filing

Grant

Patent officially issued

PatentBrief Score

Impact Score

Strong

Citation count

40/40

Highly cited

Claim breadth

20/20

Very broad protection

Recency

0/20

Older than 20 years

Assignee scale

0/20

Independent or smaller assigneeassigneeThe entity that owns the patent — usually the inventor's employer or a company.Read more →

PatentBrief Impact Score — based on citation count, claim breadth, recency, and assignee scale. Not a legal assessment.

Heuristic Value Estimate

What this patent might be worth

Modest

$96K$307K

Midpoint $192K · expired or expiring · industry ×1.6

Adjust inputs →

Heuristic only — blends forward/backward citation counts, claim scope, time remaining, litigation history, and CPC-derived industry baseline. Real valuations need a professional appraisal.

The original legal language

Original claims

46 claims as filed with the patent office.

Concepts involved

ClaimPrior artNon-obviousnessNoveltySpecificationAssigneePatent term

Citations

Patent lineage

Cites earlier patents

2

earlier patents this invention cites as foundations

View prior art →

Cited by later patents

456

later patents that build on this invention

View patents →

Cite this patent

Gillis, H. R. (2003). How Computers Find Hidden Connections Between Different Fields of Knowledge (U.S. Patent No. 6,523,026). U.S. Patent and Trademark Office. https://patentbrief.org/patent/us/6523026/google-search-query-processing

Auto-generated from the patent record. Double-check author order and the issue date against the official USPTO document before submitting.

Embed

Add this patent to your site

Drop this plain-English patent card into any blog post or article — free, no signup. It always links back to the full breakdown here.

<div data-patentlens-widget data-patent-number="US6523026"></div>
<script src="https://patentbrief.org/embed.js" async></script>

Stay in the loop

Get a weekly digest of new patents.

One email per week. No spam. Unsubscribe anytime.

Keep exploring

Related patents you should know

US 4683195 · 1987

How to Make Billions of Copies of a DNA Segment

This patent describes the Polymerase Chain Reaction (PCR), a method to rapidly create many copies of a specific piece of DNA or RNA, enabling its detection and analysis.

Cetus Corp

US 8697359 · 2014

How to Edit Genes in Human Cells Using an Engineered CRISPR System

This patent describes an engineered CRISPR-Cas9 system for precisely cutting DNA in eukaryotic cells to change how genes work, opening the door for gene editing in complex organisms.

Massachusetts Institute of Technology

US 7657849 · 2010

How the iPhone's Slide-to-Unlock Gesture Works

Apple's 2010 patent describes unlocking a device by dragging a specific graphical image across the touchscreen along a predefined path, a gesture that became iconic with the original iPhone.

Apple Inc

US 4733665 · 1988

How Doctors Implant a Permanent Stent Using a Balloon

This patent describes the method for placing a permanent, expandable wire mesh tube inside a blood vessel or other body tube using a balloon-tipped catheter to widen it and keep it open.

Expandable Grafts Partnership

US 4965188 · 1990

How to Make Many Copies of a DNA Piece with Heat

This patent describes the Polymerase Chain Reaction (PCR) method, a technique to make millions of copies of a specific DNA segment using a heat-resistant enzyme and repeated temperature changes.

Cetus Corp

US 4235871 · 1980

How to Encapsulate Active Materials in Lipid Bubbles Efficiently

This patent describes a method for trapping biologically active substances inside tiny, multi-layered fat bubbles called liposomes, using a specific water-in-oil emulsion and gel-forming process to improve how much material gets captured.

Individual

More to explore

More in AI & Machine Learning

Browse all AI & Machine Learning

New to patents?

What is a patent?How to read a patentAnatomy of a claimHow strong is this patent?What the citations meanWhat it doesn't coverPatent glossary

Common Questions

Frequently Asked Questions

What does How Computers Find Hidden Connections Between Different Fields of Knowledge cover?

A method for finding related ideas in completely different subjects by using math to map how words appear together, even when the subjects use different vocabulary.

Who owns patent US 6523026?

Huntsman International LLC owns this patent, granted in 2003.

When does this patent expire?

This patent has expired and is now in the public domain — anyone can use the invention freely.

What is patent US 6523026 cited by?

This patent has been cited by 456 later patents that build on its ideas.

What problem does this patent solve?

This technology is a precursor to modern cross-domain knowledge discovery and semantic search. It addresses the 'vocabulary mismatch' problem, which is a major hurdle in AI and data science where different industries use different jargon to describe the same underlying physical or logical processes.

What does this patent NOT cover?

Does not cover simple keyword-based search engines that rely on matching exact words or synonyms.

Patent monitoring

Get notified when Huntsman International LLC files a new patent

Get notified when this company files a new patent. Weekly digest · Confirm via email · Unsubscribe anytime.

Last reviewed: June 15, 2026 · PatentBrief is not a law firm and this is not legal advice.