How to Automatically Tune Machine Learning Settings Using Smart Math
A system that uses advanced statistical modeling to automatically find the best settings for complex machine learning models, saving engineers from manual trial and error.
Patent Number
US 11501192
Status
Active
Filing Date
September 4, 2018
Grant Date
November 15, 2022
Expiration
~September 2038 (estimated)
Claims
24
Assignee
University of Toronto
Inventors
Richard Zemel, Kevin Swersky, Roland Jasper Snoek, Ryan P. Adams
Citations
0 forward · 15 backward
What it covers
This patent describes a way to automate the tuning of machine learning systems, specifically by finding the best hyper-parameters. Hyper-parameters are the configuration settings, like learning rates or layer sizes, that define how a model learns. The system uses a probabilistic model—a mathematical way of representing uncertainty—to predict which settings will yield the best performance. The core innovation is applying a non-linear mapping to these settings before the model processes them, which helps the system better understand complex, non-obvious relationships between settings and outcomes. By iteratively evaluating the system and updating the model, it converges on the optimal configuration.
What it doesn't cover
- —Does not cover manual tuning of machine learning models by human engineers.
- —Does not cover optimization methods that rely solely on linear transformations.
- —Does not cover the actual architecture of the neural network being tuned, only the process of selecting its hyper-parameters.
- —Does not cover non-probabilistic search methods like simple grid search or random search.
The clever bit
By using a non-linear one-to-one mapping to transform the input domain, the system can model complex, warped landscapes of hyper-parameters that traditional Gaussian processes might struggle to represent accurately.
Why it matters
Tuning machine learning models is notoriously time-consuming and expensive, often requiring massive computing resources. By automating this process, this technology enables faster development cycles and better-performing AI models. It represents a shift toward 'AutoML,' where the infrastructure itself learns how to optimize its own configuration.
Real-world examples
- 1.AutoML platforms for neural network architecture search
- 2.Hyper-parameter optimization frameworks for deep learning
- 3.Automated model training pipelines in cloud AI services
Generated by PatentBrief · Not legal advice · patentbrief.org
US 11501192 · 2026