模式识别国家重点实验室

中国科学院自动化研究所

设为首页加入收藏联系我们

English

网站首页

实验室概况

研究队伍

组织机构

学术交流

科研成果

人才培养

开放课题

创新文化

资源共享

联系我们

新闻标题搜索：

学术讲座

Text to Speech without the Text

模式识别学术大讲堂

Advanced Lecture Series in Pattern Recognition

题目 (TITLE)：Text to Speech without the Text

讲座人 (SPEAKER)：Prof. Alan W Black (Carnegie Mellon University)

主持人 (CHAIR)：Prof. Jianhua Tao

时间 (TIME)：July 23(Wednesday), 2014, 16:00PM

地点 (VENUE)：No.1 Conference Room (3rd floor), Intelligence Building

报告摘要（ABSTRACT）：

The quality of data driven text to speech has moved from being just about understandable to high quality, high understandability and near natural quality for the world’s major languages. However there are still issues in building speech technologies for the language beyond the top 10 languages of the world. Given that once we go beyond even the top 100, many of these languages have ill-defined written forms and sometimes no standard writing systems at all. Although some of the speakers of those languages may be literate in other languages, if we are to provide speech systems to everyone on the planet we need to address speech processing in environments where written forms do not exist.

This talk will describe initial steps in building text to speech systems for languages where no written form exists, or only a poor standard exists. We show how new symbolic representations can be derived from acoustics only, using current and novel statistical modeling techniques. The results are sufficient to build understandable synthesizers and we show how that representation may be used in practical speech technologies.

报告人简介（BIOGRAPHY）：

Alan W Black is a Professor in the Language Technologies Institute at Carnegie Mellon University. He was born in Edinburgh, Scotland, and did his bachelors in Coventry, England, and his masters and doctorate at the University of Edinburgh. Before joining the faculty at CMU in1999, he worked in the Centre for Speech Technology Research at the University of Edinburgh, and before that at ATR in Japan. He is one of the principal authors of the free software Festival Speech Synthesis System, the FestVox voice building tools and CMU Flite, a small footprint speech synthesis engine that is the basis for many research and commercial systems around the world. He also works in spoken dialog systems, the Lets Go Bus Information project and mobile speech-to-speech translation systems. Prof Black is an elected member of ISCA board (2007-2015). He has over 200 refereed publications and is one of the highest cited authors in his field.

友情链接

中科院自动化研究所模式识别国家重点实验室事业单位京ICP备14019135号-3
NLPR, INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES