智能音箱语音技术概览

  1. 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
  2. 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
  3. 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。

Voice Technologies Come Together for Smart Speakers

Nuance Communications

April 2018

Company Overview

Nuance Communications, Inc.

Leader in Conversational and Cognitive Solutions to Increase Business Productivity and Amplify Intelligence

Trusted Advisor

World-Class Technology Global Footprint

Market Focus and Expertise 14 Billion customer engagements per year across enterprises

4,300patents and applications 14 Billion

cloud transactions in auto, IoT and services

10,000

Healthcare

organizations use Nuance solutions

80

languages across voice, NLU and text input

300 Million

patient stories shared annually

160 Million

voice-enabled vehicles shipped globally

111 Billion

output documents managed

150 Million

voiceprints enrolled globally

Voice Technologies Overview

Technology Stack

All Comes Together for Smart Speakers

Speech Signal Enhancement

Wakeup

Voice Biometrics

ASR NLU

Dialog & VUI

AI & Reasoning

TTS

Hey,

Speaker. Robust Audio Capture

Audio capture in noisy environments

requires advanced signal acquisition

Features

-Noisy audio handling (NR)

-360-degree WuW coverage (ASL,Multi-

source)

-Suppression of off-axis interferers (BF,

SSB)

-Echo cancellation for barge-in use cases

(AEC)

-Distant talk

Self-Steering Beamformer

Dual Microphone Babble-Noise Rejection

Interference Cancellation Interference Suppression

MIC 1

MIC 2

Adaptation Control

Speech

Noise

Noise

Noise

De-reverberation

Specific Environment at Home

Reverberation

Suppression

Spectral Estimation

of Reverberation

Energy Direct-Sound Signal Room Reverberation

Gateway to Speech Experience

A hands-free, always available technology enables the access to the next generation of intelligent agents

Wakeup

DSP/Sensor Hub Integrations

Providing persistent wakeup capability with minimal battery drain

Customizable Phrases

OEM can define branded wakeup phrase or enable users to create their own personalized phrases

Frictionless Personalization

Best-in-breed unification of core

technologies enabling convenient user profile creation and speaker identification Rich SDK and API support for a range of customizations to create specialized application experiences

Voice Biometrics

Real-World Applications

Easily coupled with wakeup word to support one-shot device/application wakeup + user authentication

Delivers convenient access to multiple user profiles on a single, shared device

Local and Secure

All processing and data is handled locally on the device with no requirement for cloud services

相关文档
最新文档