PatentBrief

How Computers Shrink Data by Finding Repeated Patterns

This patent describes a method for compressing data by finding the longest repeating sequences of characters, assigning them short codes, and building a dictionary of these sequences for both compression and decompression.

Granted 1985activeExpired 2003Owned by Sperry CorpInvented by Terry A. Welch

Original patent title: “High speed data compression and decompression apparatus and method

What this patent covers

The actual claim

The patent details a system for compressing "data character signals" (Claim 1). First, it builds a "string table" (Claim 1) that stores sequences of characters it has seen before. When processing new data, it searches for the "longest match" (Claim 1) between the incoming data and the strings already in its table. Once the longest match is determined, the system sends a short "code signal" (Claim 1) representing that matched string instead of the actual characters. It then adds a new, slightly longer string to its table, formed by taking the "longest match" and adding the very next character from the input data (Claim 1). For example, if the input is "banana" and "ban" is the longest match, it sends the code for "ban" and then adds "bana" to its table. Decompression works by receiving these codes and rebuilding a similar string table to translate the codes back into the original character sequences.

What this patent does NOT cover

The boundaries

  • Compression methods that do not use a "string table" to store encountered data character signals (Claim 1).
  • Compression methods that do not search for the "longest match" between input data and stored strings (Claim 1).
  • Compression methods that do not assign a unique "code signal" to each stored string for transmission (Claim 1).
  • Compression methods that do not build an "extended string" by adding the next character to the longest match (Claim 1).
  • Compression methods that do not involve "data character signals" (Claim 1), implying it's not for raw binary data without character interpretation.

These exclusions are unique to PatentBrief — derived from the actual claim language, not patent-office boilerplate.

What made this novel

The core innovation is the dynamic creation of a dictionary (the "string table") on the fly, where frequently occurring sequences of characters are identified and replaced with shorter codes. This adaptive approach, combined with efficiently searching and updating the dictionary using a "limited search hashing procedure" (Abstract, Claim 5), allows for effective compression without needing a pre-defined dictionary.

High speed data compression an…(Primary claim)softwaretelecommunicationsconsumer electronics

Schematic visualization of the patent's claim structure. Hand-drawn diagrams in progress for each landmark patent.

Where you've seen this

Real-world examples

01

GIF image format

02

TIFF image format

03

Some forms of ZIP compression

04

Early modem data compression

Why it matters

The bigger picture

This patent is a foundational work in dictionary-based data compression, specifically relating to the LZW (Lempel-Ziv-Welch) algorithm. LZW became widely adopted for its efficiency and simplicity, influencing many common file formats and protocols. Its commercial impact was significant, leading to widespread use in image compression, archive formats, and network protocols.

Filed

June 20, 1983

Granted

December 10, 1985

Claim 1 — Plain English

What this patent covers

The patent details a system for compressing "data character signals" (Claim 1). First, it builds a "string table" (Claim 1) that stores sequences of characters it has seen before. When processing new data, it searches for the "longest match" (Claim 1) between the incoming data and the strings already in its table. Once the longest match is determined, the system sends a short "code signal" (Claim 1) representing that matched string instead of the actual characters. It then adds a new, slightly longer string to its table, formed by taking the "longest match" and adding the very next character from the input data (Claim 1). For example, if the input is "banana" and "ban" is the longest match, it sends the code for "ban" and then adds "bana" to its table. Decompression works by receiving these codes and rebuilding a similar string table to translate the codes back into the original character sequences.

The clever bit

The core innovation is the dynamic creation of a dictionary (the "string table") on the fly, where frequently occurring sequences of characters are identified and replaced with shorter codes. This adaptive approach, combined with efficiently searching and updating the dictionary using a "limited search hashing procedure" (Abstract, Claim 5), allows for effective compression without needing a pre-defined dictionary.

What it does not cover

  • Compression methods that do not use a "string table" to store encountered data character signals (Claim 1).
  • Compression methods that do not search for the "longest match" between input data and stored strings (Claim 1).
  • Compression methods that do not assign a unique "code signal" to each stored string for transmission (Claim 1).
  • Compression methods that do not build an "extended string" by adding the next character to the longest match (Claim 1).
  • Compression methods that do not involve "data character signals" (Claim 1), implying it's not for raw binary data without character interpretation.

Patent Journey

From filing to expiry

Patent Filed

1983

Patent Granted

1985 · 2yr after filing

Highly Cited

347 patents cite this

Patent Expired

2003

PatentBrief Score

Impact Score

60/ 100

Strong

Citation count

40/40

Highly cited

Claim breadth

20/20

Very broad protection

Recency

0/20

Older than 20 years

Assignee scale

0/20

Independent or smaller assignee

PatentBrief Impact Score — based on citation count, claim breadth, recency, and assignee scale. Not a legal assessment.

The original legal language

Original claims

183 claims as filed with the patent office.

Citations

Patent lineage

Cites earlier patents

1

earlier patents this invention cites as foundations

View prior art →

Cited by later patents

347

later patents that build on this invention

View patents →

Stay in the loop

Get a weekly digest of new patents.

One email per week. No spam. Unsubscribe anytime.

Keep exploring

Related patents you should know

US 12564871 · 2026

A Fixture for Cleaning Showerheads with Multiple Separate Chambers

This patent describes a cleaning device for showerheads that uses a fixture with three or more separate internal compartments and channels to direct cleaning fluid to the showerhead's upper surfaces.

ASM IP HOLDING BV

US 12324579 · 2025

Surgical Stapler Battery Health Check During Operation

This patent describes a powered surgical stapler that can detect if some of its rechargeable battery cells are damaged while it's actually firing staples, helping ensure the procedure finishes safely.

CILAG GMBH INT

US 12471982 · 2025

Surgical Tool That Combines Energy Treatment and Stapling

CILAG's patent details a surgical instrument that applies therapeutic energy to tissue, monitors its properties, then deploys staples, adapting the stapling based on the initial energy treatment and monitoring.

CILAG GMBH INT

US 11918209 · 2024

Real-Time Surgical Instrument Status on Live Video During Operations

This patent describes a surgical system that shows live video from inside the body and overlays important information about the surgical tool directly onto the screen, helping surgeons operate more precisely.

CILAG GMBH INT

US 8697359 · 2014

How to Use CRISPR-Cas9 to Edit Genes in Human Cells

This patent describes a method and system for precisely altering gene expression in eukaryotic cells, including human cells, using an engineered CRISPR-Cas9 system that targets and cleaves specific DNA sequences.

Massachusetts Institute of Technology

US 4683195 · 1987

How to Make Many Copies of a Specific DNA Segment

This patent describes the Polymerase Chain Reaction (PCR), a fundamental process for making millions of copies of a specific DNA or RNA segment from a tiny sample, enabling its detection.

Cetus Corp

Semantically similar

You might also find these interesting

SEARCH ALL

US 4063220 · 1977 · Xerox Corp

How Computers Share a Network Cable Without Crashing

US 4405829 · 1983 · Massachusetts Institute of Technology

How RSA Public-Key Encryption Secures Digital Messages

US 7469381 · 2008 · Apple Inc

How Touchscreens Make Documents Bounce When You Scroll Too Far

US 10452978 · 2019 · Google LLC

How AI Models Understand Language Using Self-Attention

Patent monitoring

Get notified when Sperry Corp files a new patent

Get notified when this company files a new patent. Weekly digest · Confirm via email · Unsubscribe anytime.

Last reviewed: May 25, 2026 · PatentBrief is not a law firm and this is not legal advice.