语音合成技术及应用

合集下载

相关主题

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。

摘要 (1)

关键词 (1)

Abstract (1)

Key words (2)

引言（或绪论） (2)

1 语音合成技术及其发展 (3)

1.1 语音合成技术 (3)

1.2 语音合成技术的发展 (4)

2 语音合成的关键技术 (5)

2.1 语音合成技术简介 (5)

2.2 TTS系统的组成 (5)

2.2.1 文本生成 (6)

2.2.2韵律的生成 (6)

2.2.3 语音生成 (6)

3 汉语语音合成技术的难点 (7)

3.1汉语语音的特征 (7)

3.2汉语语音合成的难点 (7)

4 语音合成技术的应用 (8)

5 总结 (9)

致谢 (9)

参考文献 (9)

语音合成技术及应用

电子信息工程学生刘志坚

指导教师杨尚国

摘要：现代社会已经进入数字化信息时代,网络技术和多媒体技术获得迅猛发展,计算机与人之间的交互日益频繁。如何使电脑具有类似于人一样的听、说能力,成为自90年代以来信息产业的研究热点。要建立一个具有听、说能力的计算机语音系统,必需的两项关键技术就是语音识别技术与语音合成技术。同语音识别技术相比,语音合成技术相对成熟一些,是该领域中近期最有希望产生突破性进展并形成产业化的技术,而汉语语音合成的实用化更将成为中国计算机产业的下一个亮点。介绍信息技术处理领域的一项前沿技术——语音合成技术。简述了语音合成技术的发展历史以及目前国内外在此研究领域的最新成果。讨论了在语音合成技术中用到的一些方法并对这些方法作了简单地分析。简述了语音合成技术的基本工作原理以及从文字信息到语音输出的工作流程。对于当前语音合成中热点的文本分析、韵律生成、语音合成三项关键技术进行了剖析,并针对中文的文语特点,指出了中文语音合成技术的难点所在。简介了语音合成技术的应用领域。

关键词:语音合成语音识别文语转换系统汉语文语转换系统TTS技术

Speech synthesis technique and its application

Student majoring in Electronic Information Engineering

Name liuzhijian

Tutor yangshangguo

Abstract: With the coming of the digital information era, network and multimedia technology are developing in a tremendous speed. The interaction between computer and man is increasing greatly.How to make the computer have the same listening and speaking ability as human being has becomeThe focus of research of the information industry since 1990s. To establish a computer system which has listening and speaking ability, Voice Identification and Voice Synthesis are the two key technologies. Comparing with the Voice Identification technology, Voice Synthesis technology is somewhat more mature and is the most promising technology which can bring forth breakthrough development and realize industrialization. Meanwhile, the utilization of Chinese voice synthesis will become the next hotspot of China computer industry.It recommends a forward position information disposal technology of the field, the synthetic technology of the pronunciation, sketches out the developing history of the research field and the recent achievements from China and over-seas, discusses and analyses briefly the methods used in pronunciation synthetic technology, explain the basic operation principles of the pronunciation synthetic technology and work flow from characters information to pronunciation output.This paper analyzes Text Analysis, RhythmGeneration and Speech

Generation, the three key technologies which are the hot spots of voice synthesis, and points out the difficulties that may come up according to the characteristics of Chinese language.In last,the application

field is recommended.

Keywords: voice synthesis; voice identification;text to speech system; Chinese text to speech system;TTS technology

引言通过对语音合成技术的学习和研究，掌握语音合成技术的基本理论并在此

基础上深入学习，阐述以前语音合成的方法并学习现在语音合成技术的主流方法。对此技术的应用也应知道，找到在应用时的难点。

1 语音合成技术及其发展

1.1 语音合成技术

在计算机系统中, 语音应用技术主要是指基于语音进行处理的技术, 主要包语音识别技术和语音合成技术, 是信息技术处理领域的一项前沿技术。

语音识别( SR, SpeechRecongnition) 技术是指计算机系统能够根据输入的语音识别出其代表的具体意义, 进而完成相应的功能。一般的方法是事先让用户朗读有一定数量文字、符号的文档, 通过录音装置输入到计算机, 于是计算机就准备好了用户的声音样本。以后, 当用户通过语音识别系统操作计算机时, 用户的声音通过转换装置进入计算机内部, 语音识别技术便将用户输入的声音与事先存储好的声音样本进行对比。系统根据对比结果, 输入一个它认为最“象”的声音样本序号, 就可以知道用户刚才念的声音是什么意义, 进而执行此命令。因此通过语音识别技术, 计算机可以“听”懂人类的语言。