Human Machine Speech Interaction Group


Lab Name and Affiliation

Human Machine Speech Interaction Group

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences

Lab Director (or Principal Investigator)

Jianhua Tao (M'98) received the M.S. degree from Nanjing University, Nanjing, China, in 1996 and the Ph.D. degree from Tsinghua University, Beijing, China, in 2001.
He is currently a Professor with the National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing. His current research interests include speech synthesis and recognition, human-computer interaction, and emotional information processing. He has published more than 60 papers in major journals and proceedings, such as the IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, ICASSP, Interspeech, ICME, ICPR, ICCV, ICIP, etc. In 2006, he was elected as Vice-Chairperson of the ISCA Special Interest Group of Chinese Spoken Language Processing (SIG-CSLP), and Executive Committee member of the HUMAINE association. He is the subject editor for the Speech Communication (SPEECH COMMUN), the Editorial Board Member for the Journal on Multimodal User Interfaces (JMUI), the International Journal of Synthetic Emotions (IJSE), and the Steering Committee Member for the IEEE TRANSACTIONS ON AFFECTIVE COMPUTING.

Lab Introduction

The Institute of Automation of the Chinese Academy of Sciences (CASIA) has focused on the research of human machine speech interaction since 1980s. In these thirty years, the laboratory has undertaken many projects from National Natural Science Foundation of China, National High Technology 863 Program, National Key Fundamental Research Program, and Military Advanced Research Program in the field of human machine speech interaction as the lead group of projects for a long time. The developed speech synthesis product won the first prize in International TC-STAR speech synthesis evaluation competition in 2007. The laboratory is one of the main units who formulate national technical standard Chinese speech synthesis common technical specifications and international technical standard W3C: the Speech Synthesis Markup Language 1.1. It also the only Chinese representative to join the European Center of Excellent Speech Synthesis. More than 100 papers have been published in the IEEE Transactions on Audio, Speech, Language and other top international journals and conferences related to the field of speech technology. The laboratory devotes to acoustic model, rhythm model, text analysis, rhythm description language, speech digital coding, multimedia and other relative research and development. In 2005, it developed a new generation of TTS software product. The technologies have been granted five national invention patents and three software copyrights.

Lab Contact E-mail