# How Voice Assistants Change Their Speech Based on How You Talk

> This patent describes a system where a voice-controlled device adjusts its text-to-speech output characteristics, like speed or tone, based on the user's speaking habits or current situation, making responses feel more natural and personalized.

- **Patent:** US 10276149
- **Original title:** Dynamic text-to-speech output
- **Owner:** Amazon Technologies
- **Granted:** 2019
- **Status:** Active
- **Times cited:** 26
- **Field:** consumer_electronics, software, telecommunications, ai_ml

## What it does

This patent details a method for a computer system to dynamically adjust how it speaks to a user. First, it receives a user's spoken command, called a "first utterance," and figures out how many words were in it (claim 1). It then compares this to the user's typical speaking patterns, stored in their "user profile data," specifically an "average number of words of a user command." If the current command's word count "deviates from the average," the system uses this difference to select a special speech style, or "template text data." It then takes the actual response text and applies this special style, generating audio with a "non-default characteristic of speech" (claim 1). For example, if a user usually speaks long commands but suddenly gives a very short one, the system might interpret that as urgency and respond faster. The patent also covers adjusting speech rate if a user's calendar shows they are busy soon (claim 2) or based on their geographic location (claim 6).

## What it does NOT cover

- Does not cover a system that always responds with a default speech characteristic, regardless of user input or context.
- Does not cover adjusting speech characteristics solely based on the content of the response, without considering user input characteristics or contextual data.
- Does not cover a system that changes its speech without first comparing the current user input to an 'average' characteristic from the user's profile.
- Does not cover adjusting speech based on biometric data like heart rate or facial expression, unless those directly influence a speech characteristic like 'number of words'.
- Does not cover a system that adjusts its output purely based on the server's processing load or network conditions.

## The clever bit

The clever part is not just responding to a command, but dynamically altering the *way* the response is delivered based on the user's current speaking pattern (specifically, how it deviates from their average) or relevant contextual information like their calendar or location. This allows for a more adaptive and human-like interaction.

## Real-world examples

1. Amazon Alexa
2. Google Assistant
3. Apple Siri
4. Microsoft Cortana
5. Most modern smartphone voice assistants

## Why it matters

This patent is significant because it allows voice assistants to provide more personalized and context-aware responses, moving beyond generic, one-size-fits-all interactions. By adapting speech characteristics like speed or tone, it can make interactions feel more natural and efficient for the user. This kind of personalization is crucial for improving user experience in devices like Amazon's Alexa, Google Assistant, and Apple's Siri.

## Frequently asked questions

### What does How Voice Assistants Change Their Speech Based on How You Talk cover?

This patent describes a system where a voice-controlled device adjusts its text-to-speech output characteristics, like speed or tone, based on the user's speaking habits or current situation, making responses feel more natural and personalized.

### Who owns patent US 10276149?

Amazon Technologies owns this patent, granted in 2019.

### When does this patent expire?

This patent is expected to expire on December 21, 2036, when the invention enters the public domain.

### What is patent US 10276149 cited by?

This patent has been cited by 26 later patents that build on its ideas.

### What problem does this patent solve?

This patent is significant because it allows voice assistants to provide more personalized and context-aware responses, moving beyond generic, one-size-fits-all interactions. By adapting speech characteristics like speed or tone, it can make interactions feel more natural and efficient for the user. This kind of personalization is crucial for improving user experience in devices like Amazon's Alexa, Google Assistant, and Apple's Siri.

### What does this patent NOT cover?

Does not cover a system that always responds with a default speech characteristic, regardless of user input or context.

**Full plain-English explainer:** https://patentbrief.org/patent/us/10276149/dynamic-text-to-speech-output

**Original patent:** https://patents.google.com/patent/US10276149

---

_Source: PatentBrief — https://patentbrief.org. Patent facts are from public records; the plain-English explanation is PatentBrief's._