中国科学院自动化研究所   设为首页   加入收藏  联系我们
 
English
网站首页     实验室概况     研究队伍     组织机构     学术交流     科研成果     人才培养     开放课题     创新文化     资源共享     联系我们
    学术讲座

Text to Speech without the Text

模式识别学术大讲堂

Advanced Lecture Series in Pattern Recognition

题    目 (TITLE):Text to Speech without the Text

(SPEAKER)Prof. Alan W Black (Carnegie Mellon University)

(CHAIR)Prof. Jianhua Tao

     (TIME):July 23(Wednesday), 2014, 16:00PM

    (VENUE):No.1 Conference Room (3rd floor), Intelligence Building

报告摘要(ABSTRACT):

The quality of data driven text to speech has moved from being just about understandable to high quality, high understandability and near natural quality for the world’s major languages.  However there are still issues in building speech technologies for the language beyond the top 10 languages of the world.  Given that once we go beyond even the top 100, many of these languages have ill-defined written forms and sometimes no standard writing systems at all.  Although some of the speakers of those languages may be literate in other languages, if we are to provide speech systems to everyone on the planet we need to address speech processing in environments where written forms do not exist.

This talk will describe initial steps in building text to speech systems for languages where no written form exists, or only a poor standard exists.  We show how new symbolic representations can be derived from acoustics only, using current and novel statistical modeling techniques. The results are sufficient to build understandable synthesizers and we show how that representation may be used in practical speech technologies.

报告人简介(BIOGRAPHY):

Alan W Black is a Professor in the Language Technologies Institute at Carnegie Mellon University.  He was born in Edinburgh, Scotland, and did his bachelors in Coventry, England, and his masters and doctorate at the University of Edinburgh. Before joining the faculty at CMU in1999, he worked in the Centre for Speech Technology Research at the University of Edinburgh, and before that at ATR in Japan.  He is one of the principal authors of the free software Festival Speech Synthesis System, the FestVox voice building tools and CMU Flite, a small footprint speech synthesis engine that is the basis for many research and commercial systems around the world.  He also works in spoken dialog systems, the Lets Go Bus Information project and mobile speech-to-speech translation systems.  Prof Black is an elected member of ISCA board (2007-2015).  He has over 200 refereed publications and is one of the highest cited authors in his field.

友情链接
 
中科院自动化研究所 模式识别国家重点实验室 事业单位  京ICP备14019135号-3
NLPR, INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES