PatentBrief · Patent BriefUS 11501192

How to Automatically Tune Machine Learning Settings Using Smart Math

A system that uses advanced statistical modeling to automatically find the best settings for complex machine learning models, saving engineers from manual trial and error.

Patent Number

US 11501192

Status

Active

Filing Date

September 4, 2018

Grant Date

November 15, 2022

Expiration

~September 2038 (estimated)

Claims

Assignee

University of Toronto

Inventors

Richard Zemel, Kevin Swersky, Roland Jasper Snoek, Ryan P. Adams

Citations

0 forward · 15 backward

What it covers

This patent describes a way to automate the tuning of machine learning systems, specifically by finding the best hyper-parameters. Hyper-parameters are the configuration settings, like learning rates or layer sizes, that define how a model learns. The system uses a probabilistic model—a mathematical way of representing uncertainty—to predict which settings will yield the best performance. The core innovation is applying a non-linear mapping to these settings before the model processes them, which helps the system better understand complex, non-obvious relationships between settings and outcomes. By iteratively evaluating the system and updating the model, it converges on the optimal configuration.

What it doesn't cover

—Does not cover manual tuning of machine learning models by human engineers.
—Does not cover optimization methods that rely solely on linear transformations.
—Does not cover the actual architecture of the neural network being tuned, only the process of selecting its hyper-parameters.
—Does not cover non-probabilistic search methods like simple grid search or random search.

The clever bit

By using a non-linear one-to-one mapping to transform the input domain, the system can model complex, warped landscapes of hyper-parameters that traditional Gaussian processes might struggle to represent accurately.

Why it matters

Tuning machine learning models is notoriously time-consuming and expensive, often requiring massive computing resources. By automating this process, this technology enables faster development cycles and better-performing AI models. It represents a shift toward 'AutoML,' where the infrastructure itself learns how to optimize its own configuration.

Real-world examples

1.AutoML platforms for neural network architecture search
2.Hyper-parameter optimization frameworks for deep learning
3.Automated model training pipelines in cloud AI services

Generated by PatentBrief · Not legal advice · patentbrief.org

US 11501192 · 2026