IP Strategy

Data Privacy & IP

Privacy-enhancing technology patents; differential privacy; federated learning; homomorphic encryption; zero-knowledge proofs; GDPR and trade secret protection for proprietary datasets.

What privacy-enhancing technology (PET) innovations are patentable, and who holds major differential privacy and federated learning patents?

Privacy-enhancing technologies represent one of the most active areas of new patent filing at the intersection of cryptography; machine learning; and regulatory compliance.

Differential Privacy

Definition

Mathematical privacy guarantee
Adding calibrated noise to query results or model training to prevent identification of individuals while preserving statistical utility

Apple

Has deployed differential privacy in iOS since iOS 10 (2016) for collecting usage statistics
QuickType keyboard
Emoji usage
Apple uses differential privacy in iOS health data aggregation
Apple has filed patents on specific differential privacy mechanisms (calibrated Laplace mechanism implementations; local differential privacy protocols)

Google

Pioneered RAPPOR (Randomized Aggregatable Privacy-Preserving Ordinal Response) for Chrome telemetry
Differential privacy team (former Frank McSherry; Ulfar Erlingsson)
Tensorflow Privacy library
Google has filed patents on specific DP mechanism implementations
APPLE vs.

Generic Dp

'adding noise to protect privacy' is potentially an abstract idea
Specific technical implementations of the Laplace mechanism calibrated to specific epsilon/delta privacy budgets + specific data schema + specific utility preservation algorithms are more likely patentable

Federated Learning

Definition

Training ML models across multiple devices/parties without centralizing raw data
The model gradient (not the data) is shared
Apple: uses FL for next-word prediction (QuickType keyboard)
Siri improvement
Emoji suggestions
Without Apple ever seeing the raw keystrokes
Google: Gboard keyboard prediction
First major deployment of FL at scale

Patents

Google US10,572,822 (federated learning for mobile keyboard prediction)
Specific gradient aggregation methods
Specific client selection strategies (FedAvg algorithm)
Apple patents on on-device training without server round-trips

Microsoft Azure ML

FL for enterprise (multiple hospital FL training without patient data leaving hospital)
Specific FL aggregation algorithms
Privacy budget tracking

Open Source

TensorFlow Federated
PySyft (OpenMined)
Flower (federated learning framework)
Significant open source creates prior art landscape

Company Patent Strategy

File provisionals on specific FL implementations before publishing research
Specific gradient compression + privacy budget = more defensible than generic FL claims.

How does GDPR and CCPA affect patent prosecution, trade secret protection, and AI model privacy?

Privacy regulations like GDPR and CCPA create complex interactions with IP law — particularly around trade secrets and AI models — that every company operating with personal data must understand.

Gdpr-patent Prosecution Interaction

Publication of Patent Applications

Patent applications publish 18 months after filing
For AI-related patents, the application must describe the technical invention sufficiently (enabling disclosure)

Tension with Gdpr. If the AI model was trained on personal data, does disclosing the training methodology in the patent application violate any GDPR obligations?

Generally no

Patent applications describe technical methods
They do not need to include the actual personal data used for training
Describing 'we trained on a dataset of X medical records' is technical description, not disclosure of the personal data itself
GDPR RIGHT TO ERASURE (ARTICLE 17) vs.

Trade Secrets

Machine Unlearning

GDPR Article 17 gives individuals the right to have their data deleted
For AI models, this potentially requires 'unlearning' the contribution of specific individuals' data
This creates tension with trade secrets: if model weights are trade secrets and a user requests erasure → can the company modify the trade secret model in response without defeating trade secret protection?

Machine Unlearning Patents

Google
Amazon
Microsoft have filed patents on methods for efficiently removing the influence of specific training data points from trained ML models without full retraining
This is both a privacy compliance tool and an ML efficiency tool

Ccpa and Data Trade Secrets

CCPA (California Consumer Privacy Act) gives California residents right to know what personal data is collected
Right to deletion
California courts recognize that trade secrets in data (the compiled customer database itself) can be protected even while giving consumers access to their own individual data

However

A company that protects its customer database as a trade secret must still respond to individual opt-out and deletion requests
Trade secret protection does not override individual data rights

Gdpr Data Transfers and IP. STANDARD CONTRACTUAL CLAUSES (SCCs) for EU-US data transfers can affect ML model training workflows

Schrems Ii (Cjeu 2020)

Privacy Shield invalidated
SCCs now primary mechanism
Implications: training EU personal data on US servers requires SCCs or adequacy decision
IP owned by US entities from EU-funded training may require careful structuring
GDPR and the NEW EU AI ACT (2024): the EU AI Act applies to AI systems used in the EU
High-risk AI systems (medical; employment; biometrics; critical infrastructure) require conformity assessment
Technical documentation
Incident reporting
The AI Act creates new IP strategy considerations: documentation required for compliance may need to be protected as trade secret
Risk management systems may themselves be patentable innovations.

What are homomorphic encryption and zero-knowledge proof patents, and who are the major players?

Homomorphic encryption (HE) and zero-knowledge proofs (ZKPs) are advanced cryptographic techniques that allow computation on private data — creating significant commercial value and a rapidly growing patent landscape.

Homomorphic Encryption — Fundamentals

Definition

Encryption scheme allowing computations on ciphertext such that the result, when decrypted, equals the result of the same computation on the plaintext
'compute without seeing'

Types

Partially Homomorphic Encryption (PHE): addition OR multiplication only
BGV
BFV
CKKS schemes
Fully Homomorphic Encryption (FHE): both addition AND multiplication — any computation possible
Computationally expensive

Major He Patent Holders

Ibm

Most significant contributor to practical HE research
IBM Research developed HElib library (open source)
IBM patents on: specific FHE scheme implementations
Hardware acceleration for HE bootstrapping
Specific error-correction in lattice-based cryptography

Microsoft

SEAL library (Simple Encrypted Arithmetic Library; open source)
Microsoft Research develops HE for Azure cloud services
Patents on specific optimizations for CKKS (Cheon-Kim-Kim-Song) scheme for approximate arithmetic

Google

Chrome certificate transparency uses cryptography patents
Tink cryptographic library

Inpher

Privacy-first data analytics using HE
ZAMA (France): open source FHE library (TFHE; Concrete)

Commercial He Use Cases

Privacy-preserving ML (compute on encrypted health data without seeing the data)
Private database queries (financial; healthcare)
Multi-party computation for collaborative analytics
ZERO-KNOWLEDGE PROOFS (ZKPs):

Definition

Cryptographic protocol allowing one party to prove knowledge of a fact to another party without revealing the fact itself
'prove you know without revealing what you know'

Zkp Patent Landscape

STARKWARE (STARK proofs — Scalable Transparent ARguments of Knowledge): Eli Ben-Sasson
Computational integrity proofs
Blockchain transaction validation without revealing transaction content
AZTEC PROTOCOL (PLONK proofs)

Aleo

Privacy-preserving blockchain using zk-SNARKs
COINBASE (zero-knowledge KYC verification)
STRIPE (privacy-preserving fraud detection)

Zkp Applications

Blockchain: private transactions
Identity verification without revealing private data
Supply chain: prove compliance without revealing commercial terms

AI. Zero-knowledge ML (prove model produced output without revealing model weights)

Patent Eligibility for Cryptography

Mathematical algorithms are abstract ideas under Alice
Anchor claims in: specific hardware implementation (ASIC for FHE acceleration; specific Montgomery multiplication optimizations)
Specific cryptographic parameter sets
Specific security reduction proofs
Claim the system implementation (server + client + specific cryptographic protocol) not just the abstract mathematical operation.

How should companies protect proprietary datasets as trade secrets, and what constitutes a legally protected data trade secret?

Data is often more valuable than the algorithms that process it — and in a world where ML model performance scales with data, protecting proprietary datasets as trade secrets is increasingly critical.

Trade Secret Protection for Datasets

The Legal Standard

DTSA and most UTSA states: trade secret = (1) information that has economic value from not being known
AND (2) reasonable measures to maintain secrecy

Datasets That Qualify

Compiled customer databases (specific selection + arrangement + annotations = value from not being known; the raw data from public sources alone may not qualify)
Annotated training datasets (the annotation work; curation decisions; and quality filtering that produced the ML-ready dataset = trade secret even if underlying raw content was public)
Proprietary benchmarks and evaluation datasets (specific test cases used to evaluate model performance; withheld from publication)
De-identified clinical datasets with specific cohort selection criteria
Financial transaction fraud labels (which transactions are confirmed fraud is extremely valuable and not publicly known)

What is not Readily Protected

Publicly available data downloaded from common sources (though the curation + cleaning pipeline applied to it may be protected)
Data that can be independently generated by competitors with reasonable effort

Reasonable Measures for Data Trade Secrets

Access Controls

Role-based access
Minimum necessary principle
Data access logs

Data Classification Policy

Label proprietary datasets with specific confidentiality classification
PHYSICAL + TECHNICAL SECURITY: encrypted at rest + in transit
Air-gapped systems for most sensitive

Vendor NDA. NDAs with all vendors who access the data

Employee Restrictions

NDA + IP assignment agreement
Training on data confidentiality
Exit interview to confirm no data was taken

API Access Control. If providing API access to model trained on proprietary data, structure API to prevent model extraction attacks

Dataset Theft Cases

Epic Systems v. Tata Consultancy Services: $940M trade secret verdict (2016)
TCS improperly accessed Epic's software
Demonstrated that data access by vendors creates misappropriation risk
LinkedIn v. hiQ Labs: web scraping of public LinkedIn profiles
Courts found computer fraud issues but not trade secret (public data)

Privacy-trade Secret Tension

Proprietary medical datasets may contain HIPAA-protected PHI
HIPAA de-identification + trade secret protection can coexist
De-identification must be genuine but the de-identified dataset can then be a trade secret

The Gdpr Erasure Tension

If a training dataset contains a user who later requests erasure under GDPR Article 17, the company must remove that individual's data AND may need to retrain the model ('machine unlearning')
This process — if done incorrectly — can inadvertently reveal other trade secrets (model architecture details; other training data characteristics)
Handle carefully.

Data Privacy &amp; IP

What privacy-enhancing technology (PET) innovations are patentable, and who holds major differential privacy and federated learning patents?

How does GDPR and CCPA affect patent prosecution, trade secret protection, and AI model privacy?

What are homomorphic encryption and zero-knowledge proof patents, and who are the major players?

How should companies protect proprietary datasets as trade secrets, and what constitutes a legally protected data trade secret?

Data Privacy & IP