PatentBrief — patentbrief.org

11615208 USView original on Google PatentsCompare ↗Brief ↗

How Cloud Systems Automatically Create and Train AI Data Models

Name: PatentBrief
Address: Phoenix, AZ, US
Price range: Free

A cloud-based system that generates fake, privacy-safe data to train AI models, ensuring they remain accurate while protecting sensitive personal information.

Granted 2023ActiveExpires 2038Owned by Capital One Services LLCInvented by Austin Walters, Kate Key, Mark Watson + 8 more

Original patent title: “Systems and methods for synthetic data generation”

Plain-English explanation by SahiLast reviewed · June 15, 2026

A cloud-based system that generates fake, privacy-safe data to train AI models, ensuring they remain accurate while protecting sensitive personal information. Granted to Capital One Services LLC in 2023 with 23 claims and 4 forward citations.

Coverage

What does this patent actually cover?

This patent describes a cloud system that automates the creation of AI models by using synthetic data—fake data that mimics the statistical properties of real, sensitive information. The system takes a reference dataset, turns categories into numbers, and uses a 'dataset generator' to create synthetic versions. During training, the system constantly compares the model's output to the original data, using a 'similarity metric' and a 'prediction metric' to ensure the model is both accurate and statistically similar to real-world patterns. If the model drifts too far from the desired accuracy or similarity, the system applies a penalty to a loss function, forcing the model to adjust itself until it meets specific quality criteria.

The gap

What does this patent NOT cover?

Does not cover the use of real, unmasked personal data for training purposes.
Does not cover manual, non-automated methods of data labeling or model training.
Does not cover hardware-specific AI acceleration (e.g., specific GPU architectures).
Does not cover methods that do not involve a penalty-based loss function for synthetic data generation.

These exclusions are unique to PatentBrief — derived from the actual claim language, not patent-office boilerplate.

Key facts

Patent number	US 11615208
Status	Active
Field	AI & Machine Learning
Assignee	Capital One Services LLC
Inventors	Austin Walters, Kate Key, Mark Watson and 8 others
Filed	2018
Granted	2023
Claims	23
Times cited	4
Litigation	None on record

Value · $94K–$300KModest

What made this novel

The system uses a feedback loop that treats the 'similarity' of the synthetic data as a constraint in the loss function, effectively forcing the AI to learn the structure of the data without ever seeing the actual sensitive values.

Schematic visualization of the patent's claim structure. Hand-drawn diagrams in progress for each landmark patent.

Where you've seen this

Real-world examples

Fraud detection systems in banking

Credit risk assessment models

Automated customer service chatbots

Privacy-compliant data analysis platforms

Why it matters

The bigger picture

In industries like banking, companies cannot use real customer data (like account numbers or social security numbers) to train AI models due to strict privacy laws. This patent provides a technical framework for 'privacy-preserving' AI development, allowing companies to build powerful machine learning tools without risking data breaches or violating regulations like GDPR or CCPA.

Filed

October 4, 2018

Granted

March 28, 2023

Market context

Who's building on this

Companies in this space

Capital One is the primary assignee and continues to integrate these methods into their internal machine learning pipelines. Other major financial institutions and cloud providers like AWS and Google Cloud are actively developing similar synthetic data generation tools to solve the 'data privacy vs. model utility' trade-off.

Market impact

This patent reinforces the shift toward 'synthetic data' as a standard industry practice for regulated sectors. It helps move the industry away from risky data-sharing practices and toward automated, secure model training pipelines that satisfy both data scientists and legal compliance teams.

Claim 1 — Plain English

What this patent covers

The clever bit

What it does not cover

Does not cover the use of real, unmasked personal data for training purposes.
Does not cover manual, non-automated methods of data labeling or model training.
Does not cover hardware-specific AI acceleration (e.g., specific GPU architectures).
Does not cover methods that do not involve a penalty-based loss function for synthetic data generation.

Patent timeline

FilingOct 4, 2018

Application submitted to the patent office

PublicationMar 28, 2023

Application published, typically 18 months after filing

GrantMar 28, 2023

Patent officially issued

PatentBrief Score

Impact Score

Moderate

Citation count

14/40

Early citations

Claim breadth

15/20

Broad claims

Recency

20/20

Granted within 5 years

Assignee scale

0/20

Independent or smaller assignee

PatentBrief Impact Score — based on citation count, claim breadth, recency, and assignee scale. Not a legal assessment.

Heuristic Value Estimate

What this patent might be worth

Modest

$94K – $300K

Midpoint $187K · 12.2 yr remaining · industry ×1.6

Adjust inputs →

Heuristic only — blends forward/backward citation counts, claim scope, time remaining, litigation history, and CPC-derived industry baseline. Real valuations need a professional appraisal.

Claim text not yet imported for this patent

The original legal language

Original claims

23 claims as filed with the patent office.

Concepts involved

Claim Prior art Non-obviousness Novelty Specification Assignee Patent term

Citations

Patent lineage

Cites earlier patents

121

earlier patents this invention cites as foundations

View prior art →

Cited by later patents

later patents that build on this invention

View patents →

Cite this patent

Walters, A., Key, K., Watson, M., Goodsitt, J., Walters, M., Pham, V., TATSUMI, N., Truong, A., Taylor, K., Farivar, R., & Abad, F. A. T. (2023). How Cloud Systems Automatically Create and Train AI Data Models (U.S. Patent No. 11,615,208). U.S. Patent and Trademark Office. https://patentbrief.org/patent/us/11615208/dall-e-text-to-image-generation

Auto-generated from the patent record. Double-check author order and the issue date against the official USPTO document before submitting.

Embed

Add this patent to your site

Drop this plain-English patent card into any blog post or article — free, no signup. It always links back to the full breakdown here.

<div data-patentlens-widget data-patent-number="US11615208"></div>
<script src="https://patentbrief.org/embed.js" async></script>

Stay in the loop

Get a weekly digest of new patents.

One email per week. No spam. Unsubscribe anytime.

Keep exploring

Related patents you should know

US 4683195 · 1987

How to Make Billions of Copies of a DNA Segment

This patent describes the Polymerase Chain Reaction (PCR), a method to rapidly create many copies of a specific piece of DNA or RNA, enabling its detection and analysis.

Cetus Corp

US 8697359 · 2014

How to Edit Genes in Human Cells Using an Engineered CRISPR System

This patent describes an engineered CRISPR-Cas9 system for precisely cutting DNA in eukaryotic cells to change how genes work, opening the door for gene editing in complex organisms.

Massachusetts Institute of Technology

US 7657849 · 2010

How the iPhone's Slide-to-Unlock Gesture Works

Apple's 2010 patent describes unlocking a device by dragging a specific graphical image across the touchscreen along a predefined path, a gesture that became iconic with the original iPhone.

Apple Inc

US 4733665 · 1988

How Doctors Implant a Permanent Stent Using a Balloon

This patent describes the method for placing a permanent, expandable wire mesh tube inside a blood vessel or other body tube using a balloon-tipped catheter to widen it and keep it open.

Expandable Grafts Partnership

US 4965188 · 1990

How to Make Many Copies of a DNA Piece with Heat

This patent describes the Polymerase Chain Reaction (PCR) method, a technique to make millions of copies of a specific DNA segment using a heat-resistant enzyme and repeated temperature changes.

Cetus Corp

US 4235871 · 1980

How to Encapsulate Active Materials in Lipid Bubbles Efficiently

This patent describes a method for trapping biologically active substances inside tiny, multi-layered fat bubbles called liposomes, using a specific water-in-oil emulsion and gel-forming process to improve how much material gets captured.

Individual

Semantically similar

You might also find these interesting

SEARCH ALL

US 12518214 · 2026 · Nant Holdings IP

Training AI on Private Data Without Seeing It

US 10599957 · 2020 · Capital One Services

How to Automatically Detect and Fix Changes in AI Model Data

US 11836577 · 2023 · Amazon Technologies

Training Robot AI Models Faster Using Smart Simulations

US 12443890 · 2025 · Google

How Devices Train Shared AI Models While Keeping Your Data Private

More to explore

Frequently Asked Questions

What does How Cloud Systems Automatically Create and Train AI Data Models cover?

A cloud-based system that generates fake, privacy-safe data to train AI models, ensuring they remain accurate while protecting sensitive personal information.

Who owns patent US 11615208?

Capital One Services LLC owns this patent, granted in 2023.

When does this patent expire?

This patent is expected to expire on March 28, 2043, when the invention enters the public domain.

What is patent US 11615208 cited by?

This patent has been cited by 4 later patents that build on its ideas.

What problem does this patent solve?

What does this patent NOT cover?

Does not cover the use of real, unmasked personal data for training purposes.

Patent monitoring

Get notified when Capital One Services LLC files a new patent

Last reviewed: June 15, 2026 · PatentBrief is not a law firm and this is not legal advice.

How Cloud Systems Automatically Create and Train AI Data Models

What does this patent actually cover?

What does this patent NOT cover?

Key facts

Real-world examples

The bigger picture

Who's building on this

What this patent covers

Patent timeline

Impact Score

What this patent might be worth

Original claims

Patent lineage

Cite this patent

Add this patent to your site

Get a weekly digest of new patents.

Related patents you should know

How to Make Billions of Copies of a DNA Segment

How to Edit Genes in Human Cells Using an Engineered CRISPR System

How the iPhone's Slide-to-Unlock Gesture Works

How Doctors Implant a Permanent Stent Using a Balloon

How to Make Many Copies of a DNA Piece with Heat

How to Encapsulate Active Materials in Lipid Bubbles Efficiently

You might also find these interesting

Training AI on Private Data Without Seeing It

How to Automatically Detect and Fix Changes in AI Model Data

Training Robot AI Models Faster Using Smart Simulations

How Devices Train Shared AI Models While Keeping Your Data Private

More in AI & Machine Learning

How AI Models Understand Language Using 'Attention'

How Computers Find Hidden Connections Between Different Fields of Knowledge

How Facebook Uses Deep Learning to Predict What You Might Like

How AI Learns New Tasks Using Old Data Labels

Frequently Asked Questions

Get notified when Capital One Services LLC files a new patent