中国科学院自动化研究所   设为首页   加入收藏  联系我们
 
English
网站首页     实验室概况     研究队伍     组织机构     学术交流     科研成果     人才培养     开放课题     创新文化     资源共享     联系我们
    学术讲座

2010年3月18日:模式识别系列讲座

题    目(TITLE):Automatic Ontology Construction
讲 座 人(SPEAKER): Prof. Lu Qin, Department of Computing, the Hong Kong Polytechnic University,
主 持 人 (CHAIR):Prof. Chengqing Zong
时    间 (TIME):10:00am, March 18, 2010
地    点 (VENUE): Room 1115, 11th floor
报告摘要ABSTRACT):
An ontology representing a domain specific knowledge space is constructed through domain specific terms.  The concepts behind these terms are described by certain attributes, and the relations among the different concepts.  The discovery of new terms is most useful only if it can be used to extend the knowledge of a domain. This requires the identification of new terms and finding relationship of these new terms with existing concepts in the ontology.
This talk is composed of two part. In the first part, a new terminology extraction method will be presented. The algorithm identifies features of the relatively stable and domain independent term delimiters rather than that of the terms. For term verification, a link analysis based method is proposed to calculate the relevance between term candidates and the sentences in the domain specific corpus from which the candidates are extracted.
In the second part, a clustering based method is presented for domain relevant relation extraction including both relation type discovery and relation instance extraction. Given two raw corpora, one in the general domain, one in an application domain, domain specific verbs connecting different instances are extracted based on syntactic dependency as well as a small set of domain concept instance seeds. Relation types are then discovered based on verb clustering followed by relation instance extraction.
报告人简介(BIOGRAPHY)
Prof. Lu has over 20 years of working experience both in academic research and in industrial applications on open systems design, standardization, and natural language processing. Her earlier work on codeset announcement has been widely adopted in modern operating systems and programming languages to handle different encodings for different language environments. She has successfully helped to make structured encoding of Chinese character components which lead to a much faster encoding process for Chinese characters. She spearheaded the standardization of the Hong Kong Supplementary Character Set, the first and only commonly adopted character set for Hong Kong. She is the Rapporteur of the ISO/IEC JTC1/SC2/WG2/IRG.
Prof. Lu’s research work is mostly focused on using natural language processing method on information extraction and text mining. She has conducted extensive work on Chinese collocation extraction, terminology extraction, and ontology construction. Her research has received over 2million funding from the CERG and over 10million funding from ITf. Her leadership has also lead to the completion of a Hong Kong Jockey Club funded project, ASAB98, using text-to-speech technology to assist the visually blind to access computers and the internet. Prof. Lu received her B.S. in E.E. from Beijing Normal University, M.S. and Ph.D. in computer science from the University of Illinois at Urbana-Champaign.
 
承办单位:模式识别国家重点实验室
友情链接
 
中科院自动化研究所 模式识别国家重点实验室 事业单位  京ICP备14019135号-3
NLPR, INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES