智能音箱语音技术概览
- 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
- 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
- 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
Voice Technologies Come Together for Smart Speakers
Nuance Communications
April 2018
Company Overview
Nuance Communications, Inc.
Leader in Conversational and Cognitive Solutions to Increase Business Productivity and Amplify Intelligence
Trusted Advisor
World-Class Technology Global Footprint
Market Focus and Expertise 14 Billion customer engagements per year across enterprises
4,300patents and applications 14 Billion
cloud transactions in auto, IoT and services
10,000
Healthcare
organizations use Nuance solutions
80
languages across voice, NLU and text input
300 Million
patient stories shared annually
160 Million
voice-enabled vehicles shipped globally
111 Billion
output documents managed
150 Million
voiceprints enrolled globally
Voice Technologies Overview
Technology Stack
All Comes Together for Smart Speakers
Speech Signal Enhancement
Wakeup
Voice Biometrics
ASR NLU
Dialog & VUI
AI & Reasoning
TTS
Hey,
Speaker. Robust Audio Capture
Audio capture in noisy environments
requires advanced signal acquisition
Features
-Noisy audio handling (NR)
-360-degree WuW coverage (ASL,Multi-
source)
-Suppression of off-axis interferers (BF,
SSB)
-Echo cancellation for barge-in use cases
(AEC)
-Distant talk
Self-Steering Beamformer
Dual Microphone Babble-Noise Rejection
Interference Cancellation Interference Suppression
MIC 1
MIC 2
Adaptation Control
Speech
Noise
Noise
Noise
De-reverberation
Specific Environment at Home
Reverberation
Suppression
Spectral Estimation
of Reverberation
Energy Direct-Sound Signal Room Reverberation
Gateway to Speech Experience
A hands-free, always available technology enables the access to the next generation of intelligent agents
Wakeup
DSP/Sensor Hub Integrations
Providing persistent wakeup capability with minimal battery drain
Customizable Phrases
OEM can define branded wakeup phrase or enable users to create their own personalized phrases
Frictionless Personalization
Best-in-breed unification of core
technologies enabling convenient user profile creation and speaker identification Rich SDK and API support for a range of customizations to create specialized application experiences
Voice Biometrics
Real-World Applications
Easily coupled with wakeup word to support one-shot device/application wakeup + user authentication
Delivers convenient access to multiple user profiles on a single, shared device
Local and Secure
All processing and data is handled locally on the device with no requirement for cloud services