基本信息  

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

研究领域

 

李雅,女,硕导,中国科学院自动化研究所,模式识别国家重点实验室
副研究员,硕士生导师,主要从事语音合成、情感计算、人机交互等研究。先后在国内外重要期刊和会议上发表论文50余篇,包括Speech Communication,INTERSPEECH,ICASSP等,获得国家专利授权1项。担任国际期刊IEEE ASLP、Speech Communication,IEEE TAC等的审稿人。曾多次应邀担任领域内重要会议如ACII,ISCSLP,NCMMSC等会议的分会主席和PC委员。目前主持国家自然科学基金项目一项,作为科研骨干参与多项国家自然基金项目和国家863项目等。研究成果“具有个性化自适应能力的高性能语音处理技术及应用”获2014年北京市科学技术奖二等奖,指导研究生曾获语音领域顶级会议INTERSPEECH 2016的最佳学生论文,以及国际维度情感识别竞赛AVEC2014,AVEC2015第二名。
电子邮件 yli@nlpr.ia.ac.cn
通信地址:海淀区中关村东路95号
邮政编码:100190

 
智能人机语音交互技术是一种简单、便捷的人机交互方式。语音交互技术包括了语音识别、情感识别、语义理解、语音合成等技术。随着大数据和计算技术的快速发展,特别是深度神经网络的发展,智能语音交互技术在近几年取得了一系列的重大突破,为智能语音交互技术走向实用化提供了可能。
主要研究领域包括语音合成、多模态情感识别和自然人机对话技术。
 
 
教育背景
 
2007年9月-2012年6月,中国科学院自动化研究所,模式识别国家重点实验室,博士,导师:陶建华
2003年9月-2007年6月,中国科学技术大学,自动化系,学士
 
工作经历
 
2015年11月-至今,中国科学院自动化研究所,模式识别国家重点实验室,副研究员
2014年5月-2014年9月,爱尔兰圣三一学院,计算机与统计学院,Research Fellow
2012年12月,日本东京大学,访问学者
2012年7月-2015年10月,中国科学院自动化研究所,模式识别国家重点实验室,助理研究员

社会兼职:
全国人机语音通讯学术会议常设机构,委员,2015-
NCMMSC2015,情感计算特殊议题,共同主席,2015
Program committee member of ISCSLP,  2014-
Sponsorship Chair of International Conference on Affective Computing and Intelligent Interaction 2015,  2015-
Program Chair of AMAI workshop, 2015
Program Chair of Multimodal Emotion Recognition Challenge, 2016-
 
 
专利与奖励
 
专利:
1) 一种对普通话重音进行层次化建模和预测的方法,ZL201110200330.1,陶建华,李雅
奖励:
1) “具有个性化自适应能力的高性能语音处理技术及应用”,2014年北京市科学技术奖二等奖,第二完成人
2) 指导研究生曾获顶级会议INTERSPEECH 2016的最佳学生论文。第三完成人
3) AVEC2014,AVEC2015国际维度情感识别竞赛第二名。
4) “采用重音调整模型的HMM语音合成系统”,2011年全国人机语音通讯学术会议最佳学生论文提名奖,第一完成人。
 
出版信息
 
[1]. Ya Li, Jianhua Tao, Wei Lai, Xiaoying Xu, Quantitative intonation modeling of interrogative sentences for Mandarin Speech Synthesis, Speech communication, 2017.
[2]. Ya Li, Jianhua Tao, Linlin Chao,Wei Bao, Yazhu Liu, CHEAVD: a Chinese natural emotional audio–visual database, Journal of Ambient Intelligence and Humanized Computing, 2016, DOI: 10.1007/s12652-016-0406-z.
[3]. Ya Li, Jianhua Tao, Keikichi Hirose, Xiaoying Xu, Wei Lai, Hierarchical stress modeling and generation in mandarin for expressive Text-to-Speech, Speech communication, 2015, Vol. 72,pp.59-73.
[4]. Hao Che,Ya Li*,Jianhua Tao,Zhengqi Wen, "Investigating Effect of Rich Syntactic Features on Mandarin Prosodic Phrase Boundaries Prediction" Journal of Signal Processing Systems,2016, 82(2):263-271
[5]. Wei Lai, Jiahong Yuan, Ya Li, Xiaoying Xu, Mark Liberman, The rhythmic constraint of prosodic boundaries in Chinese Mandarin based on corpora of silent reading and speech perception, INTERSPEECH 2016, pp.87-91. 最佳学生论文
[6]. Yibin Zheng, Ya Li, Zhengqi Wen, Xingguang Ding, Jianhua Tao, ”Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approaches”, Interspeech 2016, PP:3201-3205, Sep 8-12, 2016.
[7]. Ya Li, Tao J, Schuller B, Shan S, Jiang D, Jia J (2016) MEC 2016: The multimodal emotion recognition challenge of CCPR 2016. In: Chinese Conference on Pattern Recognition (CCPR), Chengdu, China, pp. 667-678.
[8]. Zhengqi Wen, Ya Li and Jianhua Tao, “The Parameterized Phoneme Identity Feature as a Continuous Real-Valued Vector for Neural Network based Speech Synthesis”, Interspeech 2016, PP:2248-2252, Sep 8-12, 2016.
[9]. Yibin Zheng, Ya Li, Zhengqi Wen, Bin Liu, Jianhua Tao, ”Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin”, 10th International Symposium on Chinese Spoken Language Processing (ISCSLP2016), Oct 17-20, 2016.
[10]. Yibin Zheng, Ya Li, Zhengqi Wen, Bin Liu, Jianhua Tao, “Text-based sentential stress prediction using continuous lexical embedding for Mandarin speech synthesis”, 10th International Symposium on Chinese Spoken Language Processing (ISCSLP2016), Oct 17-20, 2016.
[11]. Ya Li, Nick Campbell, Jianhua Tao, Voice Quality: Not Only About “You” But Also About “Your Interlocutor”, ICASSP 2015, pp. 4739-4743.
[12]. Ya Li, Yazhu Liu, Wei Bao, Linlin Chao, Jianhua Tao, From Simulated Speech to Natural Speech, What are the Robust Features for Emotion Recognition?, The sixth International Conference on Affective Computing and Intelligent Interaction (ACII2015), pp. 368-373.
[13]. Ya Li, Jianhua Tao, Keikichi Hirose,Wei Lai,Xiaoying Xu, "Hierarchical stress generation with Fujisaki model in expressive speech synthesis" SPEECH PROSODY 2014,May 20-23,PP:1032-1036,Dublin Ireland.
[14]. Wei Bao, Ya Li, Mingliang Gu, Jianhua Tao, Linlin Chao, Shanfeng Liu, "Combining Prosodic and Spectral Features for Mandarin Intonation Recognition" The 9th International Symposium on Chinese Spoken Language Processing(2014 ISCSLP), pp.497-500,September 12 - 14,2014, Singpore.
[15]. Wei Bao, Ya Li, Mingliang Gu, Minghao Yang, Hao Li, Linlin Chao, Jianhua Tao, "Building a Chinese Natural Emotional Audio-visual Database" 2014 International Conference on Signal Processing(ICSP 2014), pp.583-587, Oct 19-23, Hangzhou, China.
[16]. Wei Lai, Ya Li, Hao Che,Shanfeng Liu,Jianhua Tao,Xiaoying Xu, "Final Lowering Effect in Questions and Statements of Chinese Mandarin Based on a Large-scale Natural Dialogue Corpus Analysis" SPEECH PROSODY 2014, May 20-23, PP: 653-657, Dublin Ireland.
[17]. Ya Li, Xuefei Liu, Xiaoying Xu, Jianhua Tao, Assign Stress for Interrogative Sentences via Syntax Structure Mapping, Speech Prosody 2012, May 22-25.
[18]. Ya Li, Jianhua Tao, Xiaoying Xu, Hierarchical Stress Modeling in Mandarin Text-to-Speech, InterSpeech 2011, Florence, Italy, 2013-2016.
[19]. Ya Li, Jianhua Tao, Meng Zhang, Shifeng Pan, Xiaoying Xu, Text-based unstressed syllable prediction in Mandarin, InterSpeech 2010, 26-30, September, Makuhari, Chiba,Japan,pp.1752-1755
[20]. Ya Li, Shifeng Pan, Jianhua Tao, HMM-based Expressive Speech Synthesis with a Flexible Mandarin Stress Adaptation Model, ICSP 2010,  Beijing, Oct 24-28.pp.625-628.



 
 
科研活动
 
1) 语音评价中的韵律建模和评价方法研究,自然基金,2014-2016,负责人
2) 基于维度模型的情感语音建模及生成方法研究,自然基金,2013-2015, 骨干
3) 面向移动终端的多模态自然交互技术,863项目,2015-2017,课题骨干
4) 社会情感的语音生成与认知的跨语言跨文化研究,社科重点,2014-2018,参与
5) 基于跨语言韵律模型的自适应语音合成,2010-2012,自然基金,参与
 
合作情况
 
与百度、三星、东芝,腾讯等公司在语音技术方面展开多次合作。