How Computers Shrink Data by Finding Repeated Patterns
This patent describes a method for compressing data by finding the longest repeating sequences of characters, assigning them short codes, and building a dictionary of these sequences for both compression and decompression.
Original patent title: “High speed data compression and decompression apparatus and method”
What this patent covers
The actual claim
The patent details a system for compressing "data character signals" (Claim 1). First, it builds a "string table" (Claim 1) that stores sequences of characters it has seen before. When processing new data, it searches for the "longest match" (Claim 1) between the incoming data and the strings already in its table. Once the longest match is determined, the system sends a short "code signal" (Claim 1) representing that matched string instead of the actual characters. It then adds a new, slightly longer string to its table, formed by taking the "longest match" and adding the very next character from the input data (Claim 1). For example, if the input is "banana" and "ban" is the longest match, it sends the code for "ban" and then adds "bana" to its table. Decompression works by receiving these codes and rebuilding a similar string table to translate the codes back into the original character sequences.
What this patent does NOT cover
The boundaries
- Compression methods that do not use a "string table" to store encountered data character signals (Claim 1).
- Compression methods that do not search for the "longest match" between input data and stored strings (Claim 1).
- Compression methods that do not assign a unique "code signal" to each stored string for transmission (Claim 1).
- Compression methods that do not build an "extended string" by adding the next character to the longest match (Claim 1).
- Compression methods that do not involve "data character signals" (Claim 1), implying it's not for raw binary data without character interpretation.
These exclusions are unique to PatentBrief — derived from the actual claim language, not patent-office boilerplate.
What made this novel
The core innovation is the dynamic creation of a dictionary (the "string table") on the fly, where frequently occurring sequences of characters are identified and replaced with shorter codes. This adaptive approach, combined with efficiently searching and updating the dictionary using a "limited search hashing procedure" (Abstract, Claim 5), allows for effective compression without needing a pre-defined dictionary.
Schematic visualization of the patent's claim structure. Hand-drawn diagrams in progress for each landmark patent.
Where you've seen this
Real-world examples
GIF image format
TIFF image format
Some forms of ZIP compression
Early modem data compression
Why it matters
The bigger picture
This patent is a foundational work in dictionary-based data compression, specifically relating to the LZW (Lempel-Ziv-Welch) algorithm. LZW became widely adopted for its efficiency and simplicity, influencing many common file formats and protocols. Its commercial impact was significant, leading to widespread use in image compression, archive formats, and network protocols.
Filed
June 20, 1983
Granted
December 10, 1985
Claim 1 — Plain English
What this patent covers
The patent details a system for compressing "data character signals" (Claim 1). First, it builds a "string table" (Claim 1) that stores sequences of characters it has seen before. When processing new data, it searches for the "longest match" (Claim 1) between the incoming data and the strings already in its table. Once the longest match is determined, the system sends a short "code signal" (Claim 1) representing that matched string instead of the actual characters. It then adds a new, slightly longer string to its table, formed by taking the "longest match" and adding the very next character from the input data (Claim 1). For example, if the input is "banana" and "ban" is the longest match, it sends the code for "ban" and then adds "bana" to its table. Decompression works by receiving these codes and rebuilding a similar string table to translate the codes back into the original character sequences.
The clever bit
The core innovation is the dynamic creation of a dictionary (the "string table") on the fly, where frequently occurring sequences of characters are identified and replaced with shorter codes. This adaptive approach, combined with efficiently searching and updating the dictionary using a "limited search hashing procedure" (Abstract, Claim 5), allows for effective compression without needing a pre-defined dictionary.
What it does not cover
- Compression methods that do not use a "string table" to store encountered data character signals (Claim 1).
- Compression methods that do not search for the "longest match" between input data and stored strings (Claim 1).
- Compression methods that do not assign a unique "code signal" to each stored string for transmission (Claim 1).
- Compression methods that do not build an "extended string" by adding the next character to the longest match (Claim 1).
- Compression methods that do not involve "data character signals" (Claim 1), implying it's not for raw binary data without character interpretation.
Patent Journey
From filing to expiry
Patent Filed
1983
Patent Granted
1985 · 2yr after filing
Highly Cited
347 patents cite this
Patent Expired
2003
PatentBrief Score
Impact Score
Strong
Citation count
40/40
Highly cited
Claim breadth
20/20
Very broad protection
Recency
0/20
Older than 20 years
Assignee scale
0/20
Independent or smaller assignee
PatentBrief Impact Score — based on citation count, claim breadth, recency, and assignee scale. Not a legal assessment.
The original legal language
Original claims
183 claims as filed with the patent office.
Citations
Patent lineage
Stay in the loop
Get a weekly digest of new patents.
One email per week. No spam. Unsubscribe anytime.
Keep exploring
Related patents you should know
US 12564871 · 2026
A Fixture for Cleaning Showerheads with Multiple Separate Chambers
This patent describes a cleaning device for showerheads that uses a fixture with three or more separate internal compartments and channels to direct cleaning fluid to the showerhead's upper surfaces.
ASM IP HOLDING BV
US 12324579 · 2025
Surgical Stapler Battery Health Check During Operation
This patent describes a powered surgical stapler that can detect if some of its rechargeable battery cells are damaged while it's actually firing staples, helping ensure the procedure finishes safely.
CILAG GMBH INT
US 12471982 · 2025
Surgical Tool That Combines Energy Treatment and Stapling
CILAG's patent details a surgical instrument that applies therapeutic energy to tissue, monitors its properties, then deploys staples, adapting the stapling based on the initial energy treatment and monitoring.
CILAG GMBH INT
US 11918209 · 2024
Real-Time Surgical Instrument Status on Live Video During Operations
This patent describes a surgical system that shows live video from inside the body and overlays important information about the surgical tool directly onto the screen, helping surgeons operate more precisely.
CILAG GMBH INT
US 8697359 · 2014
How to Use CRISPR-Cas9 to Edit Genes in Human Cells
This patent describes a method and system for precisely altering gene expression in eukaryotic cells, including human cells, using an engineered CRISPR-Cas9 system that targets and cleaves specific DNA sequences.
Massachusetts Institute of Technology
US 4683195 · 1987
How to Make Many Copies of a Specific DNA Segment
This patent describes the Polymerase Chain Reaction (PCR), a fundamental process for making millions of copies of a specific DNA or RNA segment from a tiny sample, enabling its detection.
Cetus Corp
Semantically similar
You might also find these interesting
US 4063220 · 1977 · Xerox Corp
How Computers Share a Network Cable Without Crashing
US 4405829 · 1983 · Massachusetts Institute of Technology
How RSA Public-Key Encryption Secures Digital Messages
US 7469381 · 2008 · Apple Inc
How Touchscreens Make Documents Bounce When You Scroll Too Far
US 10452978 · 2019 · Google LLC
How AI Models Understand Language Using Self-Attention
Patent monitoring