How Computers Find Hidden Connections Between Different Fields of Knowledge
A method for finding related ideas in completely different subjects by using math to map how words appear together, even when the subjects use different vocabulary.
Patent Number
US 6523026
Status
Active
Filing Date
October 2, 2000
Grant Date
February 18, 2003
Expiration
~October 2020 (estimated)
Claims
46
Assignee
Huntsman International LLC
Inventors
Herbert R. Gillis
Citations
456 forward · 2 backward
What it covers
This patent describes a way to find 'analogies' between two unrelated fields, such as finding a biological solution to a mechanical engineering problem. It works by first analyzing a source domain (like a library of biology papers) to create a high-dimensional map of how words appear together. It then uses this map—represented as vectors in a multi-dimensional space—to search a second, completely different domain (like a library of engineering patents) where the same words might not even appear. By comparing the 'meaning' of the words based on their context in the first domain, the system can pull up relevant documents from the second domain that share a functional relationship despite having no overlapping keywords.
What it doesn't cover
- —Does not cover simple keyword-based search engines that rely on matching exact words or synonyms.
- —Does not cover systems that require the target domain to contain the same vocabulary as the source domain.
- —Does not cover manual categorization or human-led tagging of documents.
- —Does not cover basic vector search that does not specifically map relationships across semantically distant domains.
The clever bit
The system ignores the actual words in the target domain and instead maps them into the 'semantic space' of the source domain, allowing it to find functional equivalents without needing a direct translation or shared dictionary.
Why it matters
This technology is a precursor to modern cross-domain knowledge discovery and semantic search. It addresses the 'vocabulary mismatch' problem, which is a major hurdle in AI and data science where different industries use different jargon to describe the same underlying physical or logical processes.
Real-world examples
- 1.Cross-disciplinary research tools
- 2.Automated patent landscape analysis
- 3.AI-driven scientific discovery platforms
Generated by PatentBrief · Not legal advice · patentbrief.org
US 6523026 · 2026