|
|
Haizhou Li, IEEE Fellow,
Presidential Chair Professor |
CUHK Email: haizhouli at cuhk
dot edu dot cn Personal: http://www.colips.org/~eleliha/ |
|
|
|
|||

Biography
Haizhou Li is the Dean
and X. Q. Deng Presidential Chair Professor at the School of Artificial Intelligence,
The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), China. He is
also with the Department of Electrical and Computer Engineering, National
University of Singapore (NUS), Singapore.
Haizhou Li received the
B.Sc, M.Sc,
and Ph.D degrees in electrical and electronic engineering from South China
University of Technology, Guangzhou, China in 1984, 1987, and 1990
respectively. He has worked on speech and language technology in academia and
industry since 1988. As an educator, he taught in The University of Hong Kong
(1988-1989), South China University of Technology in Guangzhou, China
(1990-1994), Nanyang Technological University in Singapore (2006-2016),
University of Eastern Finland (2009), and University of New South Wales (2011).
He was a researcher at CRIN/INRIA in France (1994-1995), a Research Manager in Apple-ISS Research
Centre (1996-1998), Research Director of Lernout & Hauspie Asia Pacific
(1999-2001), Vice President of InfoTalk Corp. Ltd and
General Manager of InfoTalk Technology (Singapore)
Pte Ltd (2001-2003), the Principal Scientist and Department Head of Human
Language Technology at the Institute for Infocomm
Research (2003-2016), and the Research Director of the Institute for Infocomm Research (2014-2016), the Agency for Science,
Technology and Research, Singapore. He co-founded Baidu-I2R Research Centre in
Singapore (2012). Dr. Li was known for his technical contributions to several
award-winning speech products, such as Apple's Chinese Dictation Kits for
Macintosh (1996) and Lernout & Hauspie's Speech-Pen-Keyboard Text Entry
Solution for Asian languages (1999). He was the architect of a series of major
technology deployments that include TELEFIQS voice-automated call centre service in Singapore Changi International Airport
(2001), voiceprint engine for Lenovo A586 Smartphone (2012), and Baidu Music
Search (2013).
Dr. Li's research
interests include automatic speech recognition, natural language processing and
neuromorphic computing. He has served as the Editor-in-Chief of IEEE/ACM
TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING (2015-2018), Associate
Editor (2008-2012) and Senior Area Editor (2014-2016) of IEEE/ACM TRANSACTIONS
ON AUDIO, SPEECH AND LANGUAGE PROCESSING, Associate Editor (2012-2013) of ACM
TRANSACTIONS ON SPEECH AND LANGUAGE PROCESSING, Computer Speech and Language
(2012-2024), Springer International Journal of Social Robotics (2008-2024), and
a Member of IEEE Speech and Language Processing Technical Committee
(2013-2015), Awards Board (2021-2023), Publications Board (2015-2018), and
Conference Board (2023-2024) of IEEE Signal Processing Society. He has served
as the President of the International Speech Communication Association (ISCA,
2015-2017), the President of Asia Pacific Signal and Information Processing
Association (APSIPA, 2015-2016), the President of the Chinese and Oriental
Language Information Processing Society (COLIPS, 2015-2024), the President of
the Asian Federation of Natural Language Processing (AFNLP, 2017-2018), the
Vice President (Conferences) of IEEE Signal Processing Society (2024-2026). He
was the General Chair of ACL 2012, INTERSPEECH 2014, IWSDS 2019, ASRU 2019,
IEEE ICASSP 2022, APSIPA ASC 2025, the Local Arrangement Chair of
SIGIR 2008, ACL-IJCNLP 2009, EMNLP 2023, and the Technical Program Chair of
ISCSLP 1998, APSIPA Annual Summit and Conference 2010, IEEE Spoken Language
Technology Workshop 2014, and IEEE ChinaSIP 2015.
Dr. Li was the
recipient of National Infocomm Awards 2002,
Institution of Engineers Singapore (IES) Prestigious Engineering Achievement
Award 2013 and 2015, President's Technology Award 2013, and MTI Innovation
Activist Gold Award 2015 in Singapore. He was named one of the two Nokia
Visiting Professors in 2009 by Nokia Foundation, IEEE Fellow in 2014 for leadership
in multilingual, speaker and language recognition, ISCA Fellow in 2018 for
contributions to multilingual speech information processing, U Bremen
Excellence Chair Professor in 2019, Fellow of the Academy of Engineering
Singapore in 2022, Fellow of Asia Pacific Artificial Intelligence Association
in 2022, and DFG Mercator Fellow in 2022. Dr. Li is a member of ACL, ACM, and
APSIPA.
Dr. Li is currently
leading the following research laboratories,
·
Director,
Human Language Technology Lab (HLT) at National University of Singapore,
Singapore
·
Director,
Machine Listening Lab (MLL) at University of Bremen, Germany
·
Director,
Shenzhen Key Laboratory of Cross-Modal Cognitive Computing (C3 Lab), China
·
Director, Language, Intelligence, and
Machines Centre (LIMA), Shenzhen Loop Area Institute, China
·
Director,
Neuromorphic Auditory Perception Lab (NAP), Program for Guangdong
Innovative and Entrepreneurial Teams, Guangdong, China
Distinctions
1.
Fellow,
DFG Mercator Fellow, 2022
2.
Fellow,
Asia-Pacific Artificial Intelligence Association, 2022
3.
Fellow,
Academy of Engineering Singapore, 2022
4.
Bremen
Excellence Chair Professor, Germany, 2019
5.
Fellow
of the International Speech Communication Association 2018 (citation: for
contributions to multilingual speech information processing)
6.
First
Prize at 2nd International Collegiate Competition for Brain-Inspired Computing,
Beijing, China, 2018
7.
A*STAR
Awards 2016 (A*STAR Borderless Awards: Autonomous Vehicle Programme)
8.
PS21
ExCEL Awards 2015 Innovation Champion (Bronze), Prime
Minister's Office, Singapore (Citation: for efforts in practicing application
research to pursue fundamental understandings of speech recognition and machine
translation technologies)
9.
ASEAN
Outstanding Engineering Achievement Award 2015, ASEAN Federation of Engineering
Organizations (Citation: in recognition of an outstanding engineering project
which has made significant contributions to the country's development - Speak
to Me in My Language)
10.
MTI
Innovation Activist Gold Award 2015, Ministry of Trade and Industry, Singapore
11.
Best
Technology Show and Tell Award, INTERSPEECH 2014
12.
IEEE Fellow 2014 (Citation: for leadership in
multilingual, speaker and language recognition.)
13.
President's Technology Award 2013, Singapore (Citation: for the
outstanding contributions to human language technology that have empowered the
industry and benefited the Asian society. see
also photo and speech by Minister S. Iswaran at
National Archives of Singapore)
14.
IES
Prestigious Engineering Achievement Award 2013, Singapore (voiceprint
technology)
15.
The
Most Cited Article, Speech Communication, 2007-2013
16.
Distinguished
Alumni Awards, South China University of Technology, 2012 (SCUT 60th
Anniversary)
17.
Nokia Visiting Professor 2009, Nokia Foundation
18.
Achiever
of the Year 2007/08, Institute for Infocomm Research,
A*STAR
19.
The
Enterprise Challenge Awards 2004, Prime Minister's Office, Singapore
20.
National
Infocomm Awards 2002, Infocomm
Development Authority, Singapore
Best
Papers
1.
Best
Paper Award, Qibing Bai, Shuai Wang, Zhijun Liu, Mingyang Zhang, Wei Rao,
Yannan Wang, Haizhou Li, Diffusion-Based Method with TTS Guidance for Foreign
Accent Conversion, The 14th ISCA International
Symposium on Chinese Spoken Language Processing, Beijing, 7-10 November, 2024
2.
CVPR
2022 Best Paper Finalist, Egocentric Vision (EgoVis)
2022/2023 Distinguished Paper Award, Ego4D: Around the World in 3,000
Hours of Egocentric Video
3.
Best
Paper Award 2022, Peiwen Li, Enze
Su, Jia Li, Siqi Cai, Longhan Xie, and Haizhou Li,
ESAA: an EEG-Speed Audit Attention Detection Database, The 25th Conference of
the Oriental COCOSDA (O-COCOSDA 2022), Hanoi, Vietnam, November 24 to 26, 2022
4.
Best
Paper Award 2021, Chen Zhang, Luis Fernando D`Haro, Thomas Friedrichs, Haizhou
Li and Yiming Chen, Investigating the Impact of Pre-trained Language Models on
Dialog Evaluation, The 12th International Workshop on Spoken Dialog System
Technology, 15-17 November 2021, Singapore.
5.
Best
Paper Award 2021, Qian Xinyuan, Bidisha Sharma, Amine El Abridi and Haizhou Li,
SLoClas: A DATABASE FOR JOINT SOUND LOCALIZATION AND
CLASSIFICATION, The 24th Conference of the Oriental COCOSDA, 18-20 November
2021, Singapore.
6.
IEEE
Computational Intelligence Magazine Outstanding Paper Award 2019, How the Brain
Formulates Memory: A Spatio-Temporal Model, Jun Hu,
Huajin Tang, Kay Chen Tan and Haizhou Li, IEEE Computational Intelligence
Magazine, vol. 11, no. 4, pp. 56-68, May 2016
7.
Featured
productive
,
and innovative
author
in speech and language processing (1965-2015) by the NLP4NLP Corpus, 2019
8.
AI
2000 Speech Recognition Most Influential Scholars Honorable Mention (2009-2019)
9.
Poster
Presentation Award, A Dual Alignment Scheme for Improved Speech-to-Singing
Voice Conversion, The 9th APSIPA Annual Summit and Conference, 12-15 December,
2017, Kuala Lumper, Malaysia
10.
Best
Paper Award, Computer-Assisted Pronunciation Training: From Pronunciation
Scoring Towards Spoken Language Learning, Nancy F. Chen, Haizhou Li, The 8th
APSIPA Annual Summit and Conference, 13-16 December, 2016, Jeju,
Korea
11.
IEEE
Computational Intelligence Society Outstanding TNNLS Paper Award 2016, Rapid
Feedforward Computation by Temporal Encoding and Learning with Spiking Neurons,
Qiang Yu, Huajin Tang, Kay Chen Tan and Haizhou Li,
IEEE Transactions on Neural Networks and Learning Systems, Vol. 24, No. 10, pp.
1539-1552, 2013
12.
Best
Paper Award, Spoken Keyword Spotting Based on DTW, Jinyong
Hou, Lei Xie, Peng Yang, Xiong Xiao, Zhixiang Liang, Haihua Xu, Lei Wang, Bin
Ma, Hang Lu, Eng Siong Chng, Haizhou Li, China National Conference on
Man-Machine Speech Communication (NCMMSC) 2015, October 2015, Tianjin China
13.
Best
Paper Award, Parallel Inference of Dirichlet Process Gaussian Mixture Models
for Unsupervised Acoustic Modeling: A Feasibility Study, Hongjie Chen,
Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, ZeroSpeech
2015 Challenge at INTERSPEECH 2015, 6-10 September 2015, Dresden, Germany
14.
Best
Paper M. Anandakrishnan Award, A Cloud-based Large Vocabulary Speech
Recognition System for Tamil, Sunil Sivadas, Boon
Pang Lim, Thai Ngoc Thuy Hoang Helen, Muthalagu Meyyappan, Bin Ma, and Haizhou Li, 14th Tamil Internet
Conference 2015, 30 May - 1 June 2015, Singapore
15.
Best
Paper Award, The 4th Asia Pacific Signal and Information Processing Association
Annual Summit and Conference, A Study on Spoofing Attack in State-of-the-Art
Speaker Verification: the Telephone Speech Case,
Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, Haizhou Li, and Eliathamby
Ambikairajah, in Proc. APSIPA ASC 2012, 3-6 December, 2012, Hollywood,
California, USA
Best
Student Papers
1. Best
Student Paper Award, The Impact of Synchronized Visual and Auditory Attention
on Human Perception, Lichuan Jiang, Jiani Zhong, Muqing Jian, Xuanzhuo
Liu, Siqi Cai, Haizhou Li, 16th International Conference on Social
Robotics, 25-28 September 2024, Shenzhen, China
2. Best
Student Paper Award, Use of Claimed Speaker Models for Replay Detection, Gajan
Suthokumar, Kaavya Sriskandaraja,
Vidhyasaharan Sethu, Chamith Wijenayake,
Eliathamby Ambikairajah, Haizhou Li , The 10th APSIPA
Annual Summit and Conference, 12-15 November 2018 in Honolulu, USA
3. Best
Student Paper Award, Perceptual Evaluation of Singing Quality, Chitralekha
Gupta, Haizhou Li, Ye Wang, The 9th APSIPA Annual Summit and Conference, 12-15
December, 2017, Kuala Lumper, Malaysia
4. IEEE
Ganesh N. Ramaswamy Memorial Student Grant 2015, Source-Specific Informative
Prior for i-Vector Extraction, Sven Shepstone, Kong Aik Lee, Haizhou Li,
Zheng-Hua Tan, and Soren Holdt Jensen, ICASSP 2015,
19-24 April 2015, Brisbane, Australia
5. IEEE
Ganesh N. Ramaswamy Memorial Student Grant 2014, Minimum Divergence Estimation
of Speaker Prior in Multi-Session PLDA Scoring, Liping
Chen, Kong Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li Rong Dai, ICASSP 2014, 4-9
May 2014, Florence, Italy
6. ISCA
International Symposium on Chinese Spoken Language Processing, Best Student Paper
Award 2010, Factor Analysis based Spatial Correlation Modeling for Speaker
Verification, Eryu Wang, Kong Aik Lee, Bin Ma,
Haizhou Li, Wu Guo, and Lirong Dai, in Proc. ISCSLP,
pp. 166 - 170, 29 November - 3 December 2010, Sun Moon Lake, Taiwan
Professional
Leadership
1. Vice
President, IEEE Signal Processing Society 2024-2026
2. Member,
Awards Board, IEEE Signal Processing Society 2021-2023
3. Member,
Fellow Evaluation Committee, IEEE Signal Processing Society 2019
4. President,
Asian Federation of Natural Language Processing (AFNLP), 2017-2018
5. President,
International Speech Communication Association (ISCA), 2015-2017
6. Vice
President, Asian Federation of Natural Language Processing (AFNLP), 2015-2016
7. Member,
Publications Board, IEEE Signal Processing Society, 2015-2017
8. Vice
President, International Speech Communication Association (ISCA), 2013-2015
9. Board
Member, International Speech Communication Association (ISCA), 2009-2017
10. Board
Member, Asian Federation of Natural Language Processing (AFNLP), 2006-2012
11. Committee
Member, IEEE Speech and Language Processing Technical Committee, 2013-2015
12. Committee
Member, IEEE Singapore Computer Chapter, 2010-2011
13. Committee
Member, IEEE Singapore, Systems, Man, & Cybernetics Chapter, 2011-2014
14. President,
Teochew Doctorate Society, Singapore 2018-2022
15. President,
Chinese and Oriental Languages Information Processing Society, 2011-2022
16. President,
Asia Pacific Signal and Information Processing
Association, 2015-2016
17. President-Elect,
Asia Pacific Signal and Information Processing Association,
2013-2014
18. President
(2006-2014), Honorary President (2015-), South China University of Technology
Alumni Association (Singapore)
19. Chair,
ISCA Special Interest Group on Chinese Spoken Language Processing, ISCA,
2011-2014
20. Member
of Standing Committee, National Conference on Man-Machine Speech
Communications, China, 2006-
Editorial
Services
1. Editor-in-Chief,
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2015-2018
2. Guest
Associate Editor, Frontiers in Neuroscience, 2018
3. Editor,
Signal Processing Repository, IEEE Signal Processing Society, 2013-2014
4. Editor,
IEEE Speech and Language Processing Technical Committee Newsletter, 2013-2015
5. Senior
Area Editor, IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2014
6. Associate
Editor, IEEE TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2009-2012
7. Associate
Editor, Springer International Journal of Social Robotics, 2007-present
8. Associate
Editor, ACM TRANSACTIONS ON SPEECH AND LANGUAGE PROCESSING, 2011-2013
9. Associate
Editor, Journal of Multimedia, 2013-2014
10. Associate
Editor, Computer Speech and Language, 2012- present
11. Editor,
IEEE Speech and Language Processing Technical Committee Newsletter, 2013-2015
12. Guest
Editor, PROCEEDINGS OF THE IEEE, 2013 (Special Issue on Speech Information
Processing)
13. Guest
Editor, IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2014
(Special Issue on Continuous Space Language Modeling)
14. Guest
Editor, IEEE TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2013 (Special
Issue on Large Scale Optimization)
15. Guest
Editor, The Institute of the Electronics, Information and Communication
Engineers (IEICE), 2012 (Special Issue on Recent Advances in Multimedia Signal
Processing Techniques and Applications)
16. Guest
Editor, Computational Linguistics and Chinese Language Processing (CLCLP), 2007
17. Guest
Editor, International Journal of Computer Processing of Oriental Languages
(IJCPOL), 2007
18. Guest
Editor, ACM TRANSACTIONS ON ASIAN LANGUAGES INFORMATION PROCESSING, 2007
Conference
Services
1. General
Chair, The 17th Asia Pacific Signal and Information Processing Association
Annual Summit and Conference, APSIPA ASC 2025, Singapore
2. Honorary
Chair, 2024 IEEE Spoken Language Technology Workshop, 2-5 December 2024, Macau,
China
3. General
Chair, International Conference on Social Robotics (ICSR-InnoBiz)
2024, Shenzhen, China
4. Local
Chair, The 2023 Conference on Empirical Methods in
Natural Language Processing (EMNLP), 6-10 December, 2023, Singapore
5. General
Chair, The 47th International Conference on Acoustics, Speech, and Signal
Processing (ICASSP), 22-27 May 2022, Singapore
6. General
Chair, The 26th International Conference on Asian Language Processing (IALP),
27-28 Oct 2022, in Singapore and Shenzhen
7. General
Chair, The 22nd Annual Meeting of the Special Interest Group on Discourse and
Dialogue, 29-31 July 2021, Singapore
8. General
Chair, The 12th International Workshop on Spoken Dialog System Technology,
15-17 November 2021, Singapore
9. General
Chair, The 24th Oriental COCOSDA, 18-20 November 2021, Singapore
10. Senior
Area Chair, ACL-IJCNLP 2021, Thailand
11. Area
Chair, The 21th Annual Conference of the International Speech Communication
Association (INTERSPEECH), 2020, Shanghai (online)
12. Member
of Best Paper Committee, EMNLP 2020, Online conference
13. Publicity
Chair, AACL-IJCNLP 2020, Online conference
14. Senior
Area Chair, EMNLP-IJCNLP 2019, Hong Kong
15. General
Chair, IEEE Workshop on Automatic Speech Recognition and Understanding 2019,
December 2019, Singapore
16. Area
Chair, The 20th Annual Conference of the International Speech Communication
Association (INTERSPEECH), 2019, Graz, Austria
17. Area
Chair, The 19th Annual Conference of the International Speech Communication
Association (INTERSPEECH), 2018, Hyderabad, India
18. Associate
Editor, the 24th International Conference on Pattern Recognition (ICPR), 20-24
August 2018, Beijing, China
19. General
Chair, The 5th International Conference on Orange Technologies (ICOT), 8-10
December 2017, Singapore
20. Area
Chair, The 18th Annual Conference of the International Speech Communication
Association (INTERSPEECH), 2017 , Stockholm, Sweden
21. Area
Chair, The 17th Annual Conference of the International Speech Communication
Association (INTERSPEECH), 2016 , San Francisco, USA
22. Technical
Program co-Chair, The Third IEEE China Summit and
International Conference on Signal and Information Processing (ChinaSIP), 2015, Chengdu, China
23. Area
Chair, IEEE International Conference on Acoustics, Speech and Signal Processing
(ICASSP), 2015, Brisbane, Australia
24. Area
Chair, The 8th IAPR International Conference on Biometrics, 2015, Phuket,
Thailand
25. General
Chair, The 15th Annual Conference of the International Speech Communication
Association (INTERSPEECH), 2014 , Singapore
26. General
Chair, The 9th International Symposium on Chinese
Spoken Language Processing, (ISCA SIG CSLP) 2014, Singapore
27. Technical
Program Chair, IEEE Workshop on Spoken Language Technology (SLT) 2014, South
Lake Tahoe
28. Area
Chair, Conference on Empirical Methods in Natural Language Processing (EMNLP),
2013, Seattle, USA
29. Publicity
Chair, Automatic Speech Recognition and Understanding Workshop (ASRU), 2013,
Olomouc, Czech Republic
30. Publicity
Chair, 15th ACM International Conference on Multimodal Interaction (ICMI),
2013, Sydney, Australia
31. Area
Chair, IEEE China Summit & International Conference on Signal
and Information Processing (ChinaSIP), 2013, Beijing,
China
32. Area
Chair, International Conference on Pattern Recognition, 2012 (Tsukuba, Japan),
2014 (Stockholm, Sweden)
33. General
Chair, The 50th Annual Meeting of Association for Computational Linguistics
(ACL), 2012, Jeju, Korea
34. Organizing
Chair, The Speaker and Language Recognition Workshop (Odyssey), 2012
35. Posters
Chair, 5th ACM SIGGRAPH Conference and Exhibition on Computer Graphics and
Interactive Techniques in Asia, 2012, Singapore
36. Area
Coordinator, INTERSPEECH 2010, 26-30 September 2010, Makuhari,
Japan
37. Workshop
Chair, The 2nd Named Entities Workshop with Shared
Task on Machine Transliteration, ACL 2010, Sweden
38. Area
Chair, The 23rd International Conference on Computational Linguistics, COLING
2010, Beijing China
39. Area
Chair, 2010 Conference on Empirical Methods on Natural Language Processing,
EMNLP 2010, Cambridge, Massachusetts, USA
40. Program
Chair, The 2nd Asia Pacific Signal and Information Processing Association
Annual Summit and Conference, APSIPA ASC 2010, Singapore
41. Conference
Chair, International Conference on Social Robotics 2010, Singapore
42. General
Chair, International Conference on Asian Language Processing, IALP 2009, Singapore
43. Local
Organizing Chair, The 47th Annual Meeting of ACL - 4th International Conference
on Natural Language Processing, ACL-IJCNLP 2009, Singapore
44. Workshop
Chair, The 1st Named Entities Workshop with Shared
Task on Machine Transliteration, ACL-IJCNLP 2009 Workshop, Singapore
45. Local
Arrangements Chair, The 31st SIGIR (The 31st Annual International ACM SIGIR
Conference) 2008, Singapore
46. Workshop
Chair, The 3rd IJCNLP (International Joint Conf. on Natural Language
Processing) 2008, Hyderabad
47. Chair,
The 6th SIGHAN Workshop on Chinese Language Processing, 2008, Hyderabad,
48. Technical
Track Chair, 5th ACM SIGGRAPH International Conference on Virtual-Reality
Continuum and its Applications in Industry (VRCAI 2008), 8-9 December 2008,
Singapore
49. Program
Chair, Infocomm Horizons 2007
50. Senior
Researcher, Johns Hopkins University 2007 Summer Workshop on Human Language
Technology
51. Member
of Standing Committee, National Conference on Man-Machine Speech
Communications, China, 2006-
52. General
Chair, The 5th ISCSLP (International Symposium on
Chinese Spoken Language Processing), 2006
Scientific Committees
1. Member,
Academic Board, Master of Arts in Translation and
Interpretation (MTI) programme, Nanyang Technological
University, Singapore (2016-2020)
2. Member,
The Electrical and Computer Engineering Panel (2017-18), FCT, Portugal
3. Member,
The RGC Engineering Panel, University Grants Committee, Hong Kong (2017-2020)
4. Co-Chair,
A*STAR Lead User of RI Technologies Taskforce (2015)
5. Member,
National Robotics Taskforce (2014)
6. External
Reviewer, Research Grants Council of Hong Kong Government
7. Member
of Evaluation Panel, Singapore-Israel Industrial R&D Foundation
8. Member
of Organizing Committee, The Agency for Science, Technology & Research
(A*STAR) and Singapore National Academy of Science (SNAS) Young Scientist
Awards 2013, 2014
9. Chair,
Infocomms, Media & Computing Cluster Thematic
Oversight Committee, Science and Engineering Research Council, Singapore
2013-2014
10. Member
of Program Committee: INTERSPEECH, ICASSP, ASRU, ACL, EMNLP, IJCNLP, APSIPA
ASC, PACLIC, IWSLT, IWSDS, NEWS, Oriental COCOSDA, AIRS, SIGHAN, ODYSSEY, SLTU,
ICPR, Speech Prosody
Keynotes
and Invited Talks
1. Attentive
Listening by Humans and Machines, The 34th ACM Multimedia, Rio de Janeiro,
Brazil, 10-14 November 2026
2. A
Computational Perspective to Language and Intelligence, 2024 International
Conference on Translation Education, Shenzhen, China, 12-14 April 2024
3. Seeing
to Hear Better, The 25th Conference of the Oriental COCOSDA, Hanoi, Vietnam,
24-26 November, 2022
4. Recent
Advances in Selective Auditory Attention, The 2020 IEEE Symposium Series on
Computational Intelligence (IEEE SSCI), Canberra, Australia, 1-4 December, 2020.
5. Speech
Processing at Cocktail Party, The 15th IEEE Conference on
Industrial Electronics and Applications (ICIEA 2020), Kristiansand, Norway,
9-13 November 2020
6. The
Story of Artificial Intelligence, The 2nd
International Conference on Intelligent Autonomous Systems, 28 February - 2
March, 2019
7. Audio-visual speaker extraction, The 7th IEEE Global Conference on Signal and Information Processing (GlobalSIP) will be held at the SHAW Centre in Ottawa, Ontario, Canada on November 11-14, 2019
8. Exemplar-based
Sparse Representation for Voice Conversion, the 119th audio, speech information
processing symposium, 21 December 2017, Tokyo
9. Whither
Speech Recognition? Alibaba Technology Forum, Learning from the Deep World, 18
September 2017, Singapore
10. Recent
Advances in Singing Synthesis, 5th International Conference on Statistical
Language and Speech Processing, 23-25 October 2017, Le Mans, France
11. Speech
Synthesis Perfects Everyone's Singing, International Conference on Orange
Technologies, 17-20 December 2016, Melbourne, Australia
12. Mandarin
Chinese spoken by speakers of European origin, The Fifth Conference on Natural
Language Processing and Chinese Computing & The Twenty Fourth International
Conference on Computer Processing of Oriental Languages
(NLPCC-ICCPOL 2016), December 2-6, 2016, Kunming, China
13. iCALL Mandarin Corpus, Oriental COCOSDA
(International Committee for Coordination and Standardization of Speech
Databases and Assessment Techniques), 26-28 October 2016, Bali, Indonesia
14. Voice
conversion and spoofing countermeasures for speaker verification, Odyssey 2016, The
Speaker and Language Recognition Workshop, June 21-24, Bilbao, Spain
Professional
Membership
1. Member,
Association for Computing Machinery (ACM)
2. Fellow,
Institute of Electrical and Electronics Engineers (IEEE)
3. Fellow,
International Speech Communication Association (ISCA)
4. Member,
Association for Computational Linguistics (ACL)
5. Member,
Asia Pacific Signal and Information Processing
Association (APSIPA)
6. President,
Chinese and Oriental Languages Information Processing Society (COLIPS)
Journal
Articles
1. Rui
Liu, Zhenqi Jia, Feilong
Bao, Haizhou Li, Retrieval-Augmented Dialogue Knowledge Aggregation for
expressive conversational speech synthesis. Inf.
Fusion 118: 102948 (2025)
2. Rui
Liu, Hongyu Yuan, Guanglai
Gao, Haizhou Li, Listening and seeing again: Generative error correction
for audio-visual speech recognition. Inf.
Fusion 120: 103077 (2025)
3. Rui
Liu, Jinhua Zhang, Haizhou Li, Hierarchical multi-source cues fusion
for mono-to-binaural based Audio Deepfake Detection. Inf.
Fusion 120: 103097 (2025)
4. Xinyuan
Qian, Jiaran Gao, Yaodan Zhang, Qiquan Zhang, Hexin Liu, Leibny Paola García-Perera, Haizhou Li,
SAV-SE: Scene-Aware Audio-Visual Speech Enhancement With Selective State Space Model. IEEE J. Sel.
Top. Signal Process. 19(4): 623-634 (2025)
5. Wenxuan Wu, Xueyuan Chen, Shuai Wang, Jiadong Wang, Lingwei Meng, Xixin Wu, Helen Meng, Haizhou Li: C2AV-TSE: Context and Confidence-Aware Audio Visual Target Speaker Extraction. IEEE J. Sel. Top. Signal Process. 19(4): 646-657 (2025)
6. Kristen Grauman et al, Ego4D: Around the World in 3,600 Hours of Egocentric Video. IEEE Trans. Pattern Anal. Mach. Intell. 47(11): 9468-9509 (2025)
7. Xinyuan
Qian, Xianghu Yue, Jiadong Wang, Huiping
Zhuang, Haizhou Li, Analytic Class Incremental Learning for Sound Source
Localization With Privacy Protection. IEEE Signal
Process. Lett. 32: 726-730 (2025)
8. Yi
Ma, Shuai Wang, Tianchi Liu, Haizhou Li, ExPO:
Explainable Phonetic Trait-Oriented Network for Speaker Verification. IEEE
Signal Process. Lett. 32: 731-735 (2025)
9. Jiqing Zhang, Malu Zhang, Yuanchen Wang, Qianhui Liu, Baocai
Yin, Haizhou Li, Xin Yang, Spiking Neural Networks with Adaptive
Membrane Time Constant for Event-Based Tracking. IEEE Trans. Image
Process. 34: 1009-1021 (2025)
10. Ruijie
Tao, Xinyuan Qian, Rohan Kumar Das, Xiaoxue Gao, Jiadong
Wang, Haizhou Li, Enhancing Real-World Active Speaker Detection with
Multi-Modal Extraction Pre-Training. IEEE Trans. Multim. 27: 2362-2373 (2025)
11. Ruihang
Ji, Dongyu Li, Shuzhi Sam Ge, Haizhou
Li, Tunnel Prescribed Control of Nonlinear Systems with Unknown Control
Directions. IEEE Trans. Neural Networks Learn.
Syst. 36(1): 1383-1395 (2025)
12. Malu
Zhang, Xiaoling Luo, Jibin Wu, Ammar Belatreche, Siqi
Cai, Yang Yang, Haizhou Li, Toward Building
Human-Like Sequential Memory Using Brain-Inspired Spiking Neural Models. IEEE Trans. Neural Networks Learn.
Syst. 36(6): 10143-10155 (2025)
13. Yan
Xiao, Yaochu Jin, Bin
Wang, Yan
Zhang, Kuangrong Hao, Haizhou
Li, Zero-Shot Relation Classification Through Inference on Category
Attributes. IEEE Trans. Neural Networks Learn.
Syst. 36(7): 13135-13148 (2025)
14. Qianhui
Liu, Meng Ge, Haizhou Li, Intelligent event-based lip-reading word
classification with spiking neural networks using spatio-temporal
attention features and triplet loss. Inf. Sci. 675: 120660 (2024)
15. Jiaqi
Yan, Qianhui Liu, Malu Zhang, Lang Feng, De Ma, Haizhou Li, Gang Pan, Efficient
spiking neural network design via neural architecture search. Neural Networks
173: 106172 (2024)
16. Xinyi
Chen, Qu Yang, Jibin Wu, Haizhou Li, Kay Chen Tan, A Hybrid Neural Coding
Approach for Pattern Recognition with Spiking Neural Networks. IEEE Trans.
Pattern Anal. Mach. Intell. 46(5): 3064-3078 (2024)
17. Shuai
Wang, Zhengyang Chen, Bing Han, Hongji
Wang, Chengdong Liang, Binbin
Zhang, Xu Xiang, Wen Ding, Johan Rohdin, Anna Silnova, Yanmin Qian, Haizhou Li,
Advancing speaker embedding learning: Wespeaker toolkit for research and production. Speech Commun. 162: 103104 (2024)
18. Jingru
Lin, Meng Ge, Wupeng Wang, Haizhou Li, Mengling Feng,
Selective HuBERT: Self-Supervised Pre-Training for
Target Speaker in Clean and Mixture Speech. IEEE Signal Process. Lett. 31:
1014-1018 (2024)
19. Duo
Ma, Xianghu Yue, Junyi Ao, Xiaoxue Gao, Haizhou Li, Text-Guided HuBERT: Self-Supervised Speech Pre-Training via Generative
Adversarial Networks. IEEE Signal Process. Lett. 31: 2055-2059 (2024)
20. Xiaoxue
Gao, Zexin Li, Yiming Chen, Cong Liu, Haizhou Li,
Transferable Adversarial Attacks Against ASR. IEEE Signal Process. Lett. 31:
2200-2204 (2024)
21. Rui
Liu, Haolin Zuo, Zheng Lian, Björn W. Schuller,
Haizhou Li, Contrastive Learning Based Modality-Invariant Feature Acquisition
for Robust Multimodal Emotion Recognition with Missing Modalities. IEEE Trans.
Affect. Comput. 15(4): 1856-1873 (2024)
22. Qu
Yang, Malu Zhang, Jibin Wu, Kay Chen Tan, Haizhou Li, LC-TTFS: Toward Lossless
Network Conversion for Spiking Neural Networks With
23. Siqi
Cai, Ran Zhang, Malu Zhang, Jibin Wu, Haizhou Li, EEG-Based Auditory Attention
Detection with Spiking Graph Convolutional Network. IEEE Trans. Cogn. Dev. Syst. 16(5): 1698-1706 (2024)
24. Koichiro
Yoshino, Yun-Nung Chen, Paul A. Crook, Satwik Kottur, Jinchao Li, Behnam Hedayatnia,
Seungwhan Moon, Zhengcong
Fei, Zekang Li, Jinchao Zhang, Yang Feng, Jie Zhou, Seokhwan Kim, Yang Liu, Di Jin, Alexandros Papangelis,
Karthik Gopalakrishnan, Dilek Hakkani-Tur, Babak Damavandi,
Alborz Geramifard, Chiori Hori, Ankit Shah, Chen
Zhang, Haizhou Li, João Sedoc, Luis F. D'Haro, Rafael
E. Banchs, Alexander Rudnicky, Overview of the Tenth
Dialog System Technology Challenge: DSTC10. IEEE ACM Trans. Audio Speech Lang.
Process. 32: 765-778 (2024)
25. Lei
Liu, Li Liu, Haizhou Li, Computation and Parameter Efficient Multi-Modal Fusion
Transformer for Cued Speech Recognition. IEEE ACM Trans. Audio Speech Lang.
Process. 32: 1559-1572 (2024)
26. Xuehao
Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li, Accented Text-to-Speech
Synthesis with Limited Data. IEEE ACM Trans. Audio Speech Lang. Process. 32:
1699-1711 (2024)
27. Rui
Liu, Berrak Sisman, Guanglai Gao, Haizhou Li,
Controllable Accented Text-to-Speech Synthesis with Fine and Coarse-Grained
Intensity Rendering. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2188-2201
(2024)
28. Tianchi
Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li,
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker
Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2324-2337
(2024)
29. Congcong Sun, Hui Tian, Peng Tian, Haizhou Li, Zhenxing Qian, Multi-Agent Deep Learning for the Detection
of Multiple Speech Steganography Methods. IEEE ACM Trans. Audio Speech Lang.
Process. 32: 2957-2972 (2024)
30. Mingyang
Zhang, Yi Zhou, Yi Ren, Chen Zhang, Xiang Yin, Haizhou
Li, RefXVC: Cross-Lingual Voice Conversion with
Enhanced Reference Leveraging. IEEE ACM Trans. Audio Speech Lang.
Process. 32: 4146-4156 (2024)
31. Wupeng
Wang, Zexu Pan, Xinke Li, Shuai
Wang, Haizhou Li, Speech Separation with Pretrained Frontend to
Minimize Domain Mismatch. IEEE ACM Trans. Audio Speech Lang. Process. 32:
4184-4198 (2024)
32. Zexu
Pan, Marvin Borsdorf, Siqi Cai, Tanja Schultz, Haizhou Li, NeuroHeed:
Neuro-Steered Speaker Extraction Using EEG Signals. IEEE ACM Trans. Audio
Speech Lang. Process. 32: 4456-4470 (2024)
33. Yicheng Gu, Xueyao
Zhang, Liumeng Xue, Haizhou Li, Zhizheng Wu, An
Investigation of Time-Frequency Representation Discriminators for High-Fidelity
Vocoders. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4569-4579 (2024)
34. Shuai
Wang, Zhengyang Chen, Kong Aik Lee, Yanmin Qian, Haizhou Li, Overview of Speaker Modeling and
Its Applications: From the Lens of Deep Speaker Representation Learning. IEEE
ACM Trans. Audio Speech Lang. Process. 32: 4971-4998 (2024)
35. Siqi
Cai, Tanja Schultz, Haizhou Li, Brain Topology Modeling With EEG-Graphs for
Auditory Spatial Attention Detection. IEEE Trans. Biomed. Eng. 71(1): 171-182
(2024)
36. Miao
Liu, Jing Wang, Xinyuan Qian, Haizhou Li, Audio-Visual Temporal Forgery
Detection Using Embedding-Level Fusion and Multi-Dimensional Contrastive Loss.
IEEE Trans. Circuits Syst. Video Technol. 34(8): 6937-6948 (2024)
37. Zhenyu
Weng, Huiping Zhuang, Fulin Luo, Haizhou Li, Zhiping
Lin, Few-Shot Contrastive Transfer Learning With
Pretrained Model for Masked Face Verification. IEEE Trans. Multim.
26: 3871-3883 (2024)
38. Xinyuan
Qian, Wei Xue, Qiquan Zhang, Ruijie Tao, Haizhou Li, Deep Cross-Modal Retrieval
Between Spatial Image and Acoustic Speech. IEEE Trans. Multim.
26: 4480-4489 (2024)
39. Siqi
Cai, Peiwen Li, Haizhou Li, A Bio-Inspired Spiking
Attentional Neural Network for Attentional Selection in the Listening Brain.
IEEE Trans. Neural Networks Learn. Syst. 35(12): 17387-17397 (2024)
40. Ruihang
Ji, Shuzhi Sam Ge, Kai Zhao, Haizhou Li, Event-Triggered Tracking Control for
Nonlinear Systems With Prescribed Performance. IEEE
Trans. Syst. Man Cybern. Syst. 54(6): 3547-3557
(2024)
41. Tao
Luo, Weng-Fai Wong, Rick Siow Mong Goh, Anh Tuan Do, Zhixian Chen, Haizhou Li,
Wenyu Jiang, Weiyun Yau, Achieving Green AI with
Energy-Efficient Deep Learning Using Neuromorphic Computing. Commun. ACM 66(7): 52-57 (2023)
42. Tingting Wang, Zexu Pan, Meng Ge, Zhen Yang,
Haizhou Li, Time-Domain Speech Separation Networks With
Graph Encoding Auxiliary. IEEE Signal Process. Lett. 30: 110-114 (2023)
43. Yi
Zhou, Zhizheng Wu, Mingyang Zhang, Xiaohai Tian, Haizhou Li, TTS-Guided
Training for Accent Conversion Without Parallel Data. IEEE Signal Process.
Lett. 30: 533-537 (2023)
44. Mingyang
Zhang, Xuehao Zhou, Zhizheng Wu, Haizhou Li, Towards Zero-Shot Multi-Speaker
Multi-Accent Text-to-Speech Synthesis. IEEE Signal Process. Lett. 30: 947-951
(2023)
45. Kun
Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li, Emotion
Intensity and its Control for Emotional Voice Conversion. IEEE Trans. Affect. Comput. 14(1): 31-48 (2023)
46. Hui
Tian, Yiqin Qiu, Wojciech Mazurczyk,
Haizhou Li, Zhenxing Qian, STFF-SM: Steganalysis
Model Based on Spatial and Temporal Feature Fusion for Speech Streams. IEEE ACM
Trans. Audio Speech Lang. Process. 31: 277-289 (2023)
47. Qiquan
Zhang, Xinyuan Qian, Zhaoheng Ni, Aaron Nicolson,
Eliathamby Ambikairajah, Haizhou Li, A Time-Frequency Attention Module for
Neural Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 31:
462-475 (2023)
48. Xinyuan
Qian, Zhengdong Wang, Jiadong Wang, Guohui Guan, Haizhou Li, Audio-Visual
Cross-Attention Network for Robotic Speaker Tracking. IEEE ACM Trans. Audio
Speech Lang. Process. 31: 550-562 (2023)
49. Chen
Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li, PoE:
A Panel of Experts for Generalized Automatic Dialogue Assessment. IEEE ACM
Trans. Audio Speech Lang. Process. 31: 1234-1250 (2023)
50. Ruijie
Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li,
Self-Supervised Training of Speaker Encoder With
Multi-Modal Diverse Positive Pairs. IEEE ACM Trans. Audio Speech Lang. Process.
31: 1706-1719 (2023)
51. Yi
Zhou, Zhizheng Wu, Xiaohai Tian, Haizhou Li, Optimization of Cross-Lingual
Voice Conversion With Linguistics Losses to Reduce
Foreign Accents. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1916-1926
(2023)
52. Xiaoxue
Gao, Chitralekha Gupta, Haizhou Li, PoLyScriber:
Integrated Fine-Tuning of Extractor and Lyrics Transcriber for Polyphonic
Music. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1968-1981 (2023)
53. Zhenyu
Weng, Huiping Zhuang, Haizhou Li, Balakrishnan
Ramalingam, Rajesh Elara Mohan, Zhiping Lin, Online Multi-Face Tracking With Multi-Modality Cascaded Matching. IEEE Trans. Circuits
Syst. Video Technol. 33(6): 2738-2752 (2023)
54. Yiqin Qiu, Hui Tian, Haizhou Li, Chin-Chen
Chang, Athanasios V. Vasilakos, Separable Convolution Network With Dual-Stream Pyramid Enhanced Strategy for Speech Steganalysis.
IEEE Trans. Inf. Forensics Secur. 18: 2737-2750
(2023)
55. Jibin
Wu, Yansong Chua, Malu Zhang, Guoqi Li, Haizhou Li,
Kay Chen Tan, A Tandem Learning Rule for Effective Training and Rapid Inference
of Deep Spiking Neural Networks. IEEE Trans. Neural Networks Learn. Syst.
34(1): 446-460 (2023)
56. Xianghu
Yue, Jingru Lin, Fabian Ritter Gutierrez, Haizhou Li, Self-Supervised Learning With Segmental Masking for Speech Representation. IEEE J.
Sel. Top. Signal Process. 16(6): 1367-1379 (2022)
57. Hongqiang
Du, Lei Xie, Haizhou Li, Noise-robust voice conversion with domain adversarial
training. Neural Networks 148: 74-84 (2022)
58. Jibin
Wu, Chenglin Xu, Xiao Han, Daquan Zhou, Malu Zhang, Haizhou Li, Kay Chen Tan,
Progressive Tandem Learning for Pattern Recognition With
Deep Spiking Neural Networks. IEEE Trans. Pattern Anal. Mach. Intell. 44(11): 7824-7840 (2022)
59. Kun
Zhou, Berrak Sisman, Rui Liu, Haizhou Li, Emotional voice conversion: Theory,
databases and ESD. Speech Commun. 137: 1-18 (2022)
60. Hongning
Zhu, Kong Aik Lee, Haizhou Li, Discriminative speaker embedding with serialized
multi-layer multi-head attention. Speech Commun. 144:
89-100 (2022)
61. Tianchi
Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li, Neural Acoustic-Phonetic
Approach for Speaker Verification With Phonetic
Attention Mask. IEEE Signal Process. Lett. 29: 782-786 (2022)
62. Zexu
Pan, Xinyuan Qian, Haizhou Li, Speaker Extraction With
Co-Speech Gestures Cue. IEEE Signal Process. Lett. 29: 1467-1471 (2022)
63. Haizhou
Li, A Unique ICASSP 2022: During an Unusual Time [Conference Highlights]. IEEE
Signal Process. Mag. 39(2): 159-160 (2022)
64. Zexu
Pan, Ruijie Tao, Chenglin Xu, Haizhou Li, Selective Listening by Synchronizing
Speech With Lips. IEEE ACM Trans. Audio Speech Lang.
Process. 30: 1650-1664 (2022)
65. Rui
Liu, Berrak Sisman, Guanglai Gao, Haizhou Li,
Decoding Knowledge Transfer for Neural Text-to-Speech Training. IEEE ACM Trans.
Audio Speech Lang. Process. 30: 1789-1802 (2022)
66. Xiaoxue
Gao, Chitralekha Gupta, Haizhou Li, Automatic Lyrics Transcription of
Polyphonic Music With Lyrics-Chord Multi-Task
Learning. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2280-2294 (2022)
67. Chitralekha
Gupta, Haizhou Li, Masataka Goto, Deep Learning
Approaches in Topics of Singing Information Processing. IEEE ACM Trans. Audio
Speech Lang. Process. 30: 2422-2451 (2022)
68. Zexu
Pan, Meng Ge, Haizhou Li, USEV: Universal Speaker Extraction With
Visual Cue. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3032-3045 (2022)
69. Enze Su, Siqi Cai, Longhan
Xie, Haizhou Li, Tanja Schultz, STAnet: A
Spatiotemporal Attention Network for Decoding Auditory Spatial Attention From
EEG. IEEE Trans. Biomed. Eng. 69(7): 2233-2242 (2022)
70. Siqi
Cai, Enze Su, Longhan Xie,
Haizhou Li, EEG-Based Auditory Attention Detection via Frequency and Channel
Neural Attention. IEEE Trans. Hum. Mach. Syst. 52(2): 256-266 (2022)
71. Malu
Zhang, Jiadong Wang, Jibin Wu, Ammar Belatreche,
Burin Amornpaisannon, Zhixuan
Zhang, Venkata Pavan Kumar Miriyala, Hong Qu, Yansong Chua, Trevor E. Carlson,
Haizhou Li, Rectified Linear Postsynaptic Potential Function for
Backpropagation in Deep Spiking Neural Networks. IEEE Trans. Neural Networks
Learn. Syst. 33(5): 1947-1958 (2022)
72. Jibin
Wu, Qi Liu, Malu Zhang, Zihan Pan, Haizhou Li, Kay Chen Tan, HuRAI: A brain-inspired computational model for human-robot
auditory interface. Neurocomputing 465: 103-113 (2021)
73. Rui
Liu, Berrak Sisman, Yixing Lin, Haizhou Li, FastTalker:
A neural text-to-speech architecture with shallow and group autoregression.
Neural Networks 141: 306-314 (2021)
74. Hongqiang
Du, Xiaohai Tian, Lei Xie, Haizhou Li, Factorized WaveNet
for voice conversion with limited data. Speech Commun.
130: 45-54 (2021)
75. Tharshini
Gunendradasan, Eliathamby Ambikairajah, Julien Epps, Vidhyasaharan Sethu,
Haizhou Li, An adaptive transmission line cochlear
model based front-end for replay attack detection. Speech Commun.
132: 114-122 (2021)
76. Bidisha
Sharma, Xiaoxue Gao, Karthika Vijayan, Xiaohai Tian, Haizhou Li, NHSS: A speech
and singing parallel database. Speech Commun. 133:
9-22 (2021)
77. Xinyuan
Qian, Qi Liu, Jiadong Wang, Haizhou Li, Three-Dimensional Speaker Localization:
Audio-Refined Visual Scaling Factor Estimation. IEEE Signal Process. Lett. 28:
1405-1409 (2021)
78. Rui
Liu, Berrak Sisman, Feilong Bao, Jichen Yang, Guanglai Gao, Haizhou Li, Exploiting Morphological and
Phonological Features to Improve Prosodic Phrasing for Mongolian Speech
Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 29: 274-285 (2021)
79. Mingyang
Zhang, Yi Zhou, Li Zhao, Haizhou Li, Transfer Learning from Speech Synthesis to
Voice Conversion with Non-Parallel Training Data. IEEE ACM Trans. Audio Speech
Lang. Process. 29: 1290-1302 (2021)
80. Rui
Liu, Berrak Sisman, Guanglai Gao, Haizhou Li,
Expressive TTS Training with Frame and Style Reconstruction Loss. IEEE ACM
Trans. Audio Speech Lang. Process. 29: 1806-1818 (2021)
81. Yi
Zhou, Xiaohai Tian, Haizhou Li, Language Agnostic Speaker Embedding for
Cross-Lingual Personalized Speech Generation. IEEE ACM Trans. Audio Speech
Lang. Process. 29: 3427-3439 (2021)
82. Chen Zhang,
Grandee
Lee, Luis
Fernando D'Haro,
Haizhou Li, D-Score: Holistic Dialogue Evaluation Without Reference. IEEE ACM
Trans. Audio Speech Lang. Process. 29: 2502-2516 (2021)
83. Zihan Pan,
Malu
Zhang, Jibin Wu, Jiadong Wang, Haizhou Li, Multi-Tone Phase Coding
of Interaural Time Difference for Sound Source Localization with Spiking Neural
Networks. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2656-2670 (2021)
84. Chenglin
Xu, Wei Rao, Jibin Wu, Haizhou Li, Target Speaker Verification with Selective
Auditory Attention for Single and Multi-Talker Speech. IEEE ACM Trans. Audio
Speech Lang. Process. 29: 2696-2709 (2021)
85. Berrak
Sisman, Junichi Yamagishi, Simon King, and Haizhou Li, An Overview of Voice
Conversion and its Challenges: From Statistical Modeling to Deep Learning,
IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 29, pp.
132-157, 2021, doi: 10.1109/TASLP.2020.3038524
86. Rui
Liu, Berrak Sisman, Feilong Bao, Jichen Yang, Guanglai Gao and Haizhou Li, Exploiting
morphological and phonological features to improve prosodic phrasing for
Mongolian speech synthesis, IEEE/ACM Transactions on Audio, Speech, and
Language Processing, 2020, doi:
10.1109/TASLP.2020.3040523
87. Rui
Liu, Berrak Sisman, Feilong Bao, Guanglai
Gao and Haizhou Li, Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTS, IEEE Signal Processing Letters, vol.
27, pp. 1470-1474, 2020
88. Yi
Zhou, Xiaohai Tian and Haizhou Li, Multi-Task WaveRNN
with an Integrated Architecture for Cross-lingual Voice Conversion, IEEE Signal
Processing Letters, vol. 27, pp. 1310-1314, 2020
89. Changhuai You and Jichen Yang, Device Feature
Extraction Based on Parallel Neural network training for replay spoofing
detection, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol.
28, pp 2308-2318, 2020
90. Mingyang
Zhang, Berrak Sisman, Li Zhao and Haizhou Li, DeepConversion:
Voice conversion with limited parallel training data, Speech Communication,
vol. 122, pp. 31-43, 2020
91. Chenglin
Xu, Wei Rao, Eng Siong Chng and Haizhou Li, SpEx:
Multi-Scale Time Domain Speaker Extraction Network, IEEE/ACM Transaction on
Audio, Speech, and Language Processing, vol. 28, pp. 1370-1384, 2020
92. Malu
Zhang, Xiaoling Luo, Jibin Wu, Yi Chen, Ammar Belatreche,
Zihan Pan, Hong Qu, and Haizhou Li, An Efficient Threshold-Driven Aggregate-Label
Learning Algorithm for Multimodal Information Processing, IEEE Journal of
Selected Topics in Signal Processing, 14(3), pp. 592-602, March 2020, doi: 10.1109/JSTSP.2020.2983547
93. Malu
Zhang, Jibin Wu, Ammar Belatreche, Zihan Pan, Xiurui Xie, Yansong Chua, Guoqi
Li, Hong Qu and Haizhou Li, Supervised Learning in Spiking Neural Networks with
Synaptic Delay-Weight Plasticity, Neurocomputing, vol. 409, pp. 103-118,
October 2020
94. Jibin
Wu, Emre Yılmaz, Malu Zhang, Haizhou Li and Kay Chen
Tan, Deep Spiking Neural Networks for Large Vocabulary Automatic Speech
Recognition, Frontiers in Neuroscience, 14(199), March 2020
95. Zihan
Pan, Yansong Chua, Jibin Wu, Malu Zhang, Haizhou Li and Eliathamby
Ambikairajah, An Efficient and Perceptually Motivated Auditory Neural Encoding
and Decoding Algorithm for Spiking Neural Networks, Frontiers in Neuroscience,
13(1420), January 2020
96. Jichen
Yang, Rohan Kumar Das and Haizhou Li, Significance of Subband
Features for Synthetic Speech Detection, IEEE Transactions on Information Forensics
and Security, vol. 15, pp. 2160-2170, 2020, doi:
10.1109/TIFS.2019.2956589
97. Chitralekha
Gupta, Haizhou Li and Ye Wang, Automatic Leaderboard: Evaluation of Singing
Quality Without a Standard Reference, IEEE/ACM Transactions on Audio, Speech,
and Language Processing, vol. 28, pp. 13-26, 2020, doi:
10.1109/TASLP.2019.2947737
98. Qiang Yu, Haizhou Li, Kay Chen Tan, Spike
Timing or Rate? Neurons Learn to Make Decisions for Both Through
Threshold-Driven Plasticity, IEEE Trans. Cybernetics 49(6): 2178-2189, 2019
99. Berrak
Sisman, Mingyang Zhang, Haizhou Li, Group Sparse Representation with WaveNet Vocoder Adaptation for Spectrum and Prosody
Conversion, IEEE/ACM Transactions on Audio, Speech, and Language Processing,
IEEE/ACM Trans. Audio, Speech & Language Processing 27(6): 1085-1097 (2019)
100. Karthika
Vijayan, Haizhou Li, Tomoki Toda, Speech-to-Singing Voice Conversion: The
Challenges and Strategies for Improving Vocal Conversion Processes, IEEE Signal
Processing Magazine. 36(1): 95-102, 2019
101. Luis
Fernando D'Haro, Rafael E. Banchs, Chiori Hori, Haizhou Li: Automatic
evaluation of end-to-end dialog systems with adequacy-fluency metrics, Computer
Speech & Language 55: 200-215, 2019
102. Chong
Zhang, Kay Chen Tan, Haizhou Li, Geok Soon Hong, A Cost-Sensitive Deep Belief
Network for Imbalanced Classification, IEEE Transactions on Neural Networks and
Learning Systems. 30(1): 109-122, 2019
103. Van
Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li,
Re-ranking spoken term detection with acoustic exemplars of keywords. Speech
Communication 104: 12-23, 2018
104. Longting Xu, Kong-Aik Lee, Haizhou Li, Zhen
Yang, Generalizing I-Vector Estimation for Rapid Speaker Recognition. IEEE/ACM
Trans. Audio, Speech & Language Processing 26(4): 749-759, 2018
105. Saad Irtza, Vidhyasaharan Sethu, Eliathamby Ambikairajah,
Haizhou Li, Using language cluster models in hierarchical language
identification. Speech Communication 100: 30-40, 2018
106. Kaavya Sriskandaraja,
Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li, Front-End for Antispoofing Countermeasures in Speaker Verification:
Scattering Spectral Decomposition, IEEE Journal of Selected Topics in Signal
Processing 11(4): 632-643, 2017
107. Hongjie
Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, Multitask Feature Learning
for Low-Resource Query-by-Example Spoken Term Detection, IEEE Journal of
Selected Topics in Signal Processing 11(8): 1329-1339, 2017
108. Xiaohai
Tian, Siu Wa Lee, Zhizheng Wu, Eng Siong Chng, Haizhou
Li, An Exemplar-Based Approach to Frequency Warping for Voice Conversion,
IEEE/ACM Trans. Audio, Speech & Language Processing 25(10): 1863-1876, 2017
109. Hongjie
Chen, Lei Xie, Cheung-Chi Leung, Xiaoming Lu, Bin Ma,
Haizhou Li, Modeling Latent Topics and Temporal Distance for Story Segmentation
of Broadcast News, IEEE/ACM Trans. Audio, Speech & Language Processing
25(1): 108-119, 2017
110. Jun
Hu, Huajin Tang, Kay Chen Tan, Haizhou Li, How the Brain Formulates Memory: A Spatio-Temporal Model, IEEE Computational Intelligence
Magazine, 11(2): 56-68, 2016
111. Xiong
Xiao, Shengkui Zhao, Duc Hoang Ha Nguyen, Xionghu Zhong, Douglas L. Jones, Eng Siong Chng, Haizhou
Li, Speech dereverberation for enhancement and recognition using dynamic
features constrained deep neural networks and feature adaptation, EURASIP
Journal Adv. Sig. Proc. 2016: 4, 2016
112. Zhizheng
Wu, Haizhou Li, On the study of replay and voice conversion attacks to
text-dependent speaker verification, Multimedia Tools Appl. 75(9): 5311-5327,
2016
113. Nancy
F. Chen, Darren Wee, Rong Tong, Bin Ma, Haizhou Li, Large-scale
characterization of non-native Mandarin Chinese spoken by speakers of European
origin: Analysis on iCALL. Speech Communication 84:
46-56, 2016
114. Sven
Ewan Shepstone, Kong-Aik Lee, Haizhou Li, Zheng-Hua Tan, Soren Holdt Jensen, Total Variability Modeling Using
Source-Specific Priors. IEEE/ACM Trans. Audio, Speech & Language Processing
24(3): 504-517, 2016
115. Duc
Hoang Ha Nguyen, Xiong Xiao, Eng Siong Chng, Haizhou Li, Feature Adaptation
Using Linear Spectro-Temporal Transform for Robust Speech Recognition. IEEE/ACM
Trans. Audio, Speech & Language Processing 24(6): 1006-1019, 2016
116. Qiang Yu, Rui Yan, Huajin Tang, Kay Chen
Tan, Haizhou Li, A Spiking Neural Network System for Robust Sequence
Recognition, IEEE Transactions on Neural Networks and Learning Systems, 27(3):
621-635, 2016, doi: 10.1109/TNNLS.2015.2416771
117. Yuma
Ueda, Longbiao Wang, Atsuhiko
Kai, Xiong Xiao, Eng Siong Chng, Haizhou Li, Single-channel Dereverberation for
Distant-Talking Speech Recognition by Combining Denoising Autoencoder and
Temporal Structure Normalization. Signal Processing Systems 82(2): 151-161,
2016
118. Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo,
Haizhou Li, Li-Rong Dai, Exploration of Local Variability in Text-Independent
Speaker Verification, Signal Processing Systems 82(2): 217-228, 2016
119. Dau-Cheng Lyu, Tien Ping Tan, Eng Siong
Chng, Haizhou Li: Mandarin-English code-switching speech corpus in South-East
Asia: SEAME. Language Resources and Evaluation 49(3): 581-600, 2015
120. Chang
Huai You, Haizhou Li, and Kong-Aik Lee, Relevance factor of maximum a
posteriori adaptation for GMM-NAP-SVM in speaker and language recognition,
Computer Speech and Language, vol.30, no.1, pp.116-134, 2015
121. Van
Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, Context-dependent Phone
Mapping for Acoustic Modeling of Under-resourced Languages, International
Journal of Asian Language Processing, vol.23, no.1, pp.21-33, 2015
122. Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin
Ma, and Haizhou Li, Acoustic Segment Modeling with Spectral Clustering Methods,
IEEE/ACM Transactions on Audio, Speech and Language Processing, vol.23, no.2,
pp.264-277, 2015
123. Rafael
E. Banchs, Luis F. D'Haro, and Haizhou Li, Adequacy-Fluency Metrics: Evaluating
MT in the Continuous Space Model Framework, IEEE/ACM Transactions on Audio,
Speech and Language Processing, vol.23, no.3, pp.472-482, 2015
124. Tze Yuang Chong, Rafael E. Banchs, Eng Siong Chng, Haizhou Li,
Decoupling Word-Pair Distance and Co-occurrence Information for Effective Long
History Context Language Modeling, IEEE/ACM Transactions on Audio, Speech and
Language Processing, 23(7): 1221-1232, 2015
125. Haizhou
Li, Inaugural editorial: Embracing Opportunities for Growth, IEEE/ACM
Transactions on Audio, Speech and Language Processing, 23(1): 5-6, 2015
126. Jonathan
William Dennis, Tran Huy Dat,
Haizhou Li: Generalized Hough Transform for Speech Pattern Classification.
IEEE/ACM Transactions on Audio, Speech & Language Processing 23(11):
1963-1972, 2015
127. Zhizheng
Wu, Nicholas Evans, Tomi Kinnunen, Junichi Yamagishi, Federico Alegre, Haizhou
Li, Spoofing and countermeasures for speaker verification: a survey, Speech
Communication, vol.66, Pages 130-153, 2015
128. Van
Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li,
Cross-Lingual Phone Mapping for Large Vocabulary Speech Recognition of
Under-Resourced Languages, IEICE Transactions 97-D(2):
285-295, 2014
129. Miaolong
Yuan, Huajin Tang, Haizhou Li, Real-Time Keypoint
Recognition Using Restricted Boltzmann Machine, IEEE Trans. Neural Netw. Learning Syst. 25(11): 2119-2126, 2014
130. Zhizheng
Wu, Haizhou Li, Voice conversion versus speaker verification: an overview,
APSIPA Transactions on Signal and Information Processing, vol.3, e17
doi:10.1017/ATSIP.2014.17, 2014
131. Zhizheng
Wu, Eng Siong Chng, Haizhou Li, Exemplar-based voice conversion using joint
nonnegative matrix factorization, Multimedia Tools and Applications, Springer,
2014
132. Zhizheng
Wu, Tuomas Virtanen, Eng Siong Chng, Haizhou Li,
Exemplar-based sparse representation with residual compensation for voice
conversion, IEEE/ACM Transactions on Audio, Speech and Language Processing,
vol. 22, No. 10, pp. 1506-1521, 2014
133. Anthony
Larcher, Kong Aik Lee, Bin Ma, Haizhou Li, Text-dependent speaker verification:
Classifiers, databases and RSR2015, Speech Communication, vol. 60, May 2014,
pp. 56-77
134. Qiang Yu, Huajin Tang, Kay Chen Tan, and
Haizhou Li, Precise-Spike-Driven Synaptic Plasticity: Learning
Hetero-Association of Spatiotemporal Spike Patterns, PLoS
ONE, 8(11): e78318, 2013, doi: 10.1371/journal.pone.0078318
135. Qiang Yu, Huajin Tang, Kay Chen Tan, Haizhou
Li: Rapid Feedforward Computation by Temporal Encoding and Learning With Spiking Neurons. IEEE Trans. Neural Networks Learning
System, 24(10): 1539-1552, 2013
136. S. J.
Wright, D. Kanevsky, L. Deng, X. He, G. Heigold, and H. Li, Optimization Algorithm and Applications
for Speech and Language Processing, IEEE Transactions on Audio, Speech and
Language Processing, 21(11):2231-2243, 2013
137. Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin
Ma, Haizhou Li, Shifted-Delta MLP Features for Spoken Language Recognition.
IEEE Signal Process. Lett. 20(1): 15-18, 2013
138. Jun
Hu, Huajin Tang, Kay Chen Tan, Haizhou Li and Luping
Shi, A Spike-Timing Based Integrated Model for Pattern Recognition. Neural
Computation, vol. 25, no. 2, pp. 450-472, 2013
139. Raymond
W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li: Spoken Language
Recognition with Prosodic Features. IEEE Transactions on Audio, Speech &
Language Processing, 21(9): 1841-1853, 2013
140. V. Hautamaki, T. Kinnunen, F. Sedlak,
Kong Aik Lee, Bin Ma, and Haizhou Li, Sparse Classifier Fusion for Speaker
Verification, IEEE Transactions on Audio, Speech and Language Processing,
21(8): 1622-1631, August 2013
141. Douglas
D. O'Shaughnessy, Li Deng, Haizhou Li: Speech Information Processing: Theory
and Applications [Scanning the Issue]. Proceedings of the IEEE vol. 101, No. 5
pp. 1034-1037, May 2013
142. Haizhou
Li, Kong Aik Lee, and Bin Ma, Spoken Language Recognition: From Fundamentals to
Practice, Proceedings of the IEEE, vol. 101, No. 5, pp. 1136 – 1159, May 2013
143. Jiali Yu, Huajin Tang, Haizhou Li, Dynamics
Analysis of a Population Decoding Model, IEEE Transactions on Neural Networks
and Learning Systems, vol. 24, No. 3, 2013
144. Jiali Yu, Huajin Tang, Haizhou Li, Luping Shi, Dynamical properties of continuous attractor
neural network with background tuning, Neurocomputing, vol. 99, pp. 439 - 447,
2013
145. Zhizheng
Wu, Tomi Kinnunen, Eng Siong Chng, Haizhou Li, Mixture of factor analyzers
using priors from non-parallel speech for voice conversion, IEEE Signal
Processing Letters, 19(12), pp. 914-917, 2012
146. Omid Dehzangi, Bin Ma, Eng-Siong Chng and Haizhou Li,
Discriminative Feature Extraction for Speech Recognition Using Continuous
Output Codes, Pattern Recognition Letters, 33 (2012), pp. 1703-1709.
147. Liyuan
Li, Shuicheng Yan, Xinguo Yu, Yeow Kee Tan, and
Haizhou Li, Robust Multiperson Detection and Tracking
for Mobile Service and Social Robots, IEEE Transactions on Systems, Man, and
Cybernetics - PART B: CYBERNETICS, vol. 42, No. 5, 2012
148. T.
Kinnunen, R. Saeidi, F. Sedlak, Kong Aik Lee, J.
Sandberg, M. Hansson-Sandsten, Haizhou Li,
Low-Variance Multitaper MFCC Features: a Case Study in Robust Speaker Verification, IEEE
Transactions on Audio, Speech and Language Processing, 20(7): 1990-2001,
September 2012
149. Andreea
Niculescu, Betsy van Dijk, Anton Nijholt, Haizhou Li,
See Swee Lan: Making Social Robots More Attractive: The Effects of Voice Pitch,
Humor and Empathy. International Journal of Social Robotics 5(2): 171-191
(2013)
150. Wenliang
Chen, Jun'ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang,
Kentaro Torisawa, Haizhou Li, Bitext Dependency
Parsing With Auto-Generated Bilingual Treebank, IEEE
Transactions on Audio, Speech and Language Processing, 20(5): 1461-1472 (2012)
151. Xiaoxuan Wang, Lei Xie, Mimi Lu, Bin Ma, Engsiong Chng, Haizhou Li, Broadcast News Story
Segmentation Using Conditional Random Fields and Multimodal Features. IEICE
Transactions on Information and Systems, vol. E95-D, No.5, pp.1206-1215, 2012
152. Yi Ren
Leng, Tran Huy Dat, Norihide Kitaoka, Haizhou Li:
Selective Gammatone Envelope Feature for Robust Sound Event Recognition. IEICE
Transactions 95-D (5): 1229-1237, 2012
153. Rui
Yan, Keng Peng Tee, Yuanwei Chua, Haizhou Li, Huajin
Tang: Gesture Recognition Based on Localist Attractor Networks with Application
to Robot Control, IEEE Computational Intelligence Magazine, vol. 7, No. 1, pp.
64-74, 2012
154. Jin-Shea
Kuo, Haizhou Li: Learning regional transliteration variants, Information
Processing and Management, 48(1): 154-169, 2012
155. Tin
Lay Nwe, Hanwu Sun, Bin Ma, Haizhou Li, Speaker
Clustering and Cluster Purification Methods for RT07 and RT09 Evaluation
Meeting Data, IEEE Transactions on Audio, Speech and Language Processing, vol
20, No. 2, pp 461-473, 2012
156. Haizhou
Li , John-John Cabibihan,
Yeow Kee Tan: Towards an Effective Design of Social Robots, International
Journal of Social Robotics, 3(4), pp. 333-335, November 2011
157. Sakriani
Sakti, Michael Paul, Andrew Finch, Shinsuke Sakai,
Thang Tat Vu, Noriyuki Kimura, Chiori Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park, Chai Wutiwiwatchai, Bo Xu, Hammam Riza, Karunesh
Arora, Chi Mai Luong, Haizhou Li, A-STAR: Toward Translating Asian Spoken
Languages, Computer Speech and Language, vol. 27, No. 2, pp. 509 - 527, 2013
158. Huajin
Tang, Haizhou Li, Book Review: Information Theoretic Learning: Renyi's Entropy and Kernel Perspectives, IEEE Computational
Intelligence Magazine, vol. 6, No. 3, August 2011
159. Eliathamby
Ambikairajah, Haizhou Li, Liang Wang, Bo Yin, and Vidhyasaharan Sethu, Language
Identification: A Tutorial, IEEE Circuits and Systems Magazine, vol. 11, No. 2,
pp.82 - 108, 2011
160. Huajin
Tang, Haizhou Li, and Zhang Yi, Online learning and
stimulus-driven responses of neurons in visual cortex, Cognitive Neurodynamics, vol. 5, no. 1, pp. 77-85, 2011
161. Omid Dehzangi, Bin Ma, Eng-Siong Chng and Haizhou Li, Error
Corrective Fusion of Classifier Scores for Spoken Language, IEICE Transactions
on Information and Systems, Vol. E94-D, No.12, pp.2503-2512, 2011
162. Deyi
Xiong, Min Zhang, Haizhou Li, A Maximum Entropy Segmentation Model for
Statistical Machine Translation, IEEE Transactions on Audio, Speech and
Language Processing, 19 (8), November 2011
163. Huy Dat Tran,
Haizhou Li, Sound Event Recognition with Probabilistic Distance SVMs, IEEE
Transactions on Audio, Speech and Language Processing, vol. 19, No. 6, pp 1556
- 1568, 2011
164. Jonathan
Dennis, Huy Dat Tran,
Haizhou Li, Spectrogram Image Feature for Sound Event Classification in
Mismatched Conditions, in Signal Processing Letters, vol. 18, No. 2, pp 130 -
133, February 2011
165. Haizhou
Li, Ma Bin, TechWare: Speaker and Spoken Language
Recognition Resources, IEEE Signal Processing Magazine, vol. 27, No. 6, pp
139-142, November 2010
166. Kong
Aik Lee, Chang Huai You, Haizhou Li, Tomi Kinnunen, and Khe
Chai Sim, Using Discrete Probabilities with Bhattacharyya Measure for SVM-based
Speaker Verification, IEEE Transactions on Audio, Speech and Language
Processing, 19(4), pp.861 - 870, May 2011
167. Deyi
Xiong, Min Zhang, Aiti Aw, Haizhou Li, Linguistically
Annotated Reordering Evaluation and Analysis, Computational Linguistics, vol.
36, No. 3, pp 535-568, 2010
168. Donglai Zhu, Bin Ma, Haizhou Li, Speaker Verification
with Feature-Space MAPLR Parameters, IEEE Transactions on Audio, Speech and
Language Processing, vol. 19, No. 3, pp 505-515, March 2011
169. Huajin
Tang, Haizhou Li, Zhang Yi, A Discrete-Time Neural Network for Optimization
Problems with Hybrid Constraints, IEEE Transactions on Neural Networks, vol.
21, no. 7, pp. 1184-1189, 2010
170. Namunu
C. Maddage, Haizhou Li, Beat Space Segmentation and Octave Scale Cepstral
Feature for Sung Language Recognition in Pop Music, ACM Transactions on
Multimedia Computing, Communications and Applications (TOMCCAP), vol. 7 Issue
4, November 2011, Article No. 37
171. Lei
Wang, Eng Siong Chng, Haizhou Li, A Tree-Construction Search Approach for
Multivariate Time Series Motifs Discovery, Pattern Recognition Letters, vol.
31, No. 9, pp 869-875, 2010
172. Huajin
Tang, Haizhou Li, and Rui Yan, Memory Dynamics in Attractor Networks with
Saliency Weights, Neural Computation, 22(7), pp. 1899-1926, July 2010
173. Chang
Huai You, Kong Aik Lee, Haizhou Li, GMM-SVM Kernel with a Bhattacharyya-Based
Distance for Speaker Recognition, IEEE Transactions on Audio, Speech and
Language Processing, vol. 18, No. 6, pp1300-1312, 2010
174. Tomi
Kinnunen, Haizhou Li, An Overview of Text-Independent Speaker Recognition: from
Features to Supervectors, Speech Communication 52 (1), 2010, pp. 12-40 (Speech
Communication Most Cited Article 2007-2013)
175. Xiong
Xiao, Jinyu Li, Eng Siong Chng, Haizhou Li, Chin-Hui Lee, A Study on the
Generalization Capability of Acoustic Models for Robust Speech Recognition,
IEEE Transactions on Audio, Speech and Language Processing, vol. 18, No 6,
pp1158-1169, 2010
176. Namunu
C. Maddage, Khe Chai Sim, Haizhou Li, Word Level
Automatic Alignment of Music and Lyrics using Vocal Synthesis, ACM Transactions
on Multimedia Computing, Communications, and Applications (TOMCCAP), vol. 6,
No. 3, 2010
177. Huy Dat Tran,
Haizhou Li, Jump Function Kolmogorov for Audio Classification in Noise-mismatch
Conditions, IEEE Transactions on Signal Processing, vol. 57, No 8, pp
2908-2918, 2009
178. Tee
Kiah Chia, Khe Chai Sim, Haizhou Li and Hwee Tou Ng,
Statistical Lattice-Based Spoken Document Retrieval, ACM Transactions on
Information Systems, vol. 28, No. 1, 2010
179. Rong
Tong, Bin Ma, Haizhou Li, and Eng Siong Chng, A Target-Oriented Phonotactic
Front-end for Spoken Language Recognition, IEEE Transactions on Audio, Speech
and Language Processing, vol. 17, No 7, pp.1335-1347, 2009
180. Donglai Zhu, Haizhou Li, Bin Ma, and Chin-Hui
Lee, Optimizing the Performance of Spoken Language Recognition with
Discriminative Training, IEEE Transactions on Audio, Speech and Language
Processing, vol. 16, No. 8, pp.1642-165, 2008
181. Chang
Hui You, Kong-Aik Lee, and Haizhou Li, An SVM Kernel with GMM-Supervector Based
on the Bhattacharyya Distance for Speaker Recognition, IEEE Signal Processing
Letters, vol. 16, No. 1, pp.49-52, 2009
182. Xiong
Xiao, Eng Siong Chng, Haizhou Li, Normalization of the Speech Modulation
Spectra for Robust Speech Recognition, IEEE Transactions on Audio, Speech and
Language Processing, vol. 16, No. 8, pp.1662-1674, 2008
183. Haizhou
Li, Jin-Shea Kuo, Jian Su, Chih-Lung Lin, Mining Live Transliterations using
Incremental Learning Algorithms, International Journal of Computer Processing Of Languages, vol. 21, No. 2, pp. 183-203, 2008
184. Khe Chia Sim and Haizhou Li, On Acoustic
Diversification Front-end for Spoken Language Identification, IEEE Transactions
on Audio, Speech and Language Processing, vol. 16, No. 5, pp.1029-1037, 2008
185. Bin
Ma, Haizhou Li, and Rong Tong, Spoken Language Recognition with Ensemble
Classifiers, IEEE Transactions on Audio, Speech and Language Processing, vol.
15, No. 7, 2007
186. Jin-shea
Kuo, Haizhou Li, and Ying-Kuei Yang, Active Learning
for Constructing Transliteration Lexicons from the Web, Journal of the American
Society for Information Science and Technology, vol. 59, No. 1, 2008
187. Xiong
Xiao, Eng Siong Chng, and Haizhou Li, Temporal structure normalization of
speech feature for robust speech recognition, IEEE Signal Processing Letters,
vol. 14, No. 7, 2007
188. Jin-Shea
Kuo, Haizhou Li, Ying-Kuei Yang, A Phonetic
Similarity Model for Automatic Extraction of Transliteration Pairs, ACM
Transactions on Asian Language Information Processing, vol. 6, Issue 2,
September, 2007
189. Tin
Lay Nwe and Haizhou Li, Exploring Vibrato-Motivated Acoustic Features for
Singer Identification, IEEE Transactions on Audio, Speech and Language
Processing, vol. 15, No. 2, 2007
190. Haizhou
Li, Bin Ma, and Chin-Hui Lee, A Vector Space Modeling Approach to Spoken
Language Identification, IEEE Transactions on Audio, Speech and Language
Processing, vol. 15, No. 1, 2007
Books
and Book Chapters
1. Haizhou
Li, Kar-Ann Toh, Liyuan Li, Advanced Topics in Biometrics, World Scientific,
2011
2. Haizhou
Li, Bin Ma, and Chin-Hui Lee, Vector-based Spoken Language Classification, in
Springer Handbook of Speech Processing, Jacob Benesty,
M. Mohan Sondhi, Arden Huang (editors), Springer 2007
3. Chin-Hui
Lee, Haizhou Li, Lin-shan Lee, Renhua Wang, and Qiang Huo (editors), Advances in
Chinese Spoken Language Processing, World Scientific, 2007
4. Shuzhi
Sam Ge, Haizhou Li, John-John Cabibihan and Yeow Kee
Tan (editors), Social Robotics, Springer Lecture Notes in Artificial
Intelligence 6414, 2010
5. Qiang Huo, Bin Ma,
Eng Siong Chng, and Haizhou Li (editors), Chinese Spoken Language Processing,
Springer Lecture Notes in Artificial Intelligence 4274, 2006
6. Yinglin Yu, Haizhou Li, Neural Networks and
Signal Analysis, South China University of Technology Press, 1996
Teaching
1. CSC3020
Machine Learning (CUHK-SZ)
2. EE2211
Introduction to Machine Learning (NUS)
3. EE2012
Analytical Methods in Electrical and Computer Engineering (NUS)
4. EE6733
Advanced Topics on Vision and Machine Learning (NUS)
Ph.D.
Students
1. Jiacheng ZHANG (CUHKSZ-SLAI), 09/2025 -
2. Shaochen ZHANG (CUHKSZ-SLAI), 09/2025 -
3. Shuhan
ZHANG (CUHKSZ-SLAI), 09/2025 -
4. Sirui LI (CUHKSZ), 09/2025 -
5. Youcun ZHENG (CUHKSZ), 09/2025 -
6. Fan BU
(CUHKSZ-SLAI), 09/2025 -
7. Chenyu
YANG (CUHKSZ), 09/2024 -
8. Kuang
WANG (CUHKSZ), 09/2024 -
9. Qibing
BAI (CUHKSZ), 09/2023 -
10. Zhijun
LIU (CUHKSZ), 09/2023 -
11. Zheyuan
LIN (CUHKSZ), 09/2023 -
12. Dedimuni Dashanka Nadeeshan
De Silva (U Bremen), 2023 -
13. Saurav
Pahuja (U Bremen), 2022 -
14. Wenxuan
WU (CUHK), 09/2022 -
15. Junyi
AO (CUHKSZ), 09/2022 -
16. Sho
INOUE (CUHKSZ), 09/2022 -
17. Mehmet
Sinan YILDIRIM (NUS), 01/2022 -
18. Jingru
LIN (NUS), 08/2022 -
19. Yidi
JIANG (NUS), Target Speech and Audio Event Detection via Multimodal Cues,
08/2021 -01/2026, thesis
20. Zeyang
SONG (NUS), Scalable and Efficient Spiking Neural Networks for Speech
Processing, 08/2021 -02/2026
21. Yi MA
(NUS), Leveraging Interpretability for Speaker Verification, 08/2020 – 12/2025,
thesis
22. Junchen
LU (NUS), Expressive Speech Synthesis, 08/2020 – 02/2026
23. Marvin
Borsdorf (U Bremen), Speech Separation for Monolingual and Multilingual
Cocktail Party Scenarios, 2025.10, web, thesis
24. Wupeng
WANG (NUS), Domain-Invariant Speech Separation in Real Scenarios, 2025.06, web,
thesis
25. Yiming
CHEN (NUS), Semi-Supervised and Adversarial Data Synthesis for Language
Modeling, 2025, web, thesis
26. Victor
Li Chuang (NUS), Towards Holistic and Proactive Conversational Recommender
Systems, 2025, web, thesis
27. Tianchi
LIU (NUS), Advances in Robust and Practical Speaker Verification, 2024, web,
thesis
28. Qu
YANG (NUS), Speech Processing Using Spiking Neural Networks, 2024, web, thesis
29. Xuehao
ZHOU (NUS), Cross-Regional Text-to-Speech Synthesis with Language and Accent
Diversity, 2024 (Huawei, Singapore) web, thesis
30. Jiadong WANG (NUS), Cross-Modality
Complementarity for Audio-Visual Speech Recognition, 2024 (TUM Germany) web, thesis
31. Xianghu YUE (NUS), Self-Supervised
Modeling for Multimodal Understanding, 2024 (NUS Singapore) web, thesis
32. Zexu PAN (NUS), Look Attentively to Hear:
Audio-Visual Speaker Extraction, 2023 (Alibaba
DAMO Academy, Singapore) web, thesis
33. Qinyi WANG (NUS), Code-Switch Detection Techniques and Language Modeling
Strategies for Automatic Speech Recognition,
2023 (Huawei, Singapore) web, thesis
34. Kun ZHOU (NUS), Emotion Modeling for Speech Generation,
2023 (Alibaba
DAMO Academy, Singapore) web, thesis
35. Chen ZHANG (NUS), Self-Supervised
Modeling for Open-Domain Dialogue Evaluation, 2023 web, thesis
36. Ruijie TAO (NUS), Audio-Visual
Active Speaker Detection and Recognition, 2023 web, thesis
37. Nana HOU (NTU), Mismatch Problem in Deep‑learning based Speech Enhancement, 2023 (Zoom, Singapore) web, thesis
38. Xiaoxue GAO (NUS), Automatic Lyrics Transcription of
Polyphonic Music, 2022 (A*STAR,
Singapore) web, thesis
39. Zihan CHEN (SUTD), Adaptive Communication-efficient
Federated Learning on Real-world Data, 2022 web, thesis
40. Yi ZHOU (NUS), Cross-Lingual Voice
Conversion, 2021 (Tomato.ai, US) web, thesis
41. Grandee LEE (NUS), Cross-Lingual
Language Modeling, Methods and Applications, 2021 (Singapore University of
Social Sciences, Singapore) web, thesis
42. Zihan PAN (NUS), Neural Encoding of
Auditory Signals in Spiking Neural Networks, 2020 (A*STAR, Singapore) web
43. Jibin WU (NUS),
Auditory information processing using spiking neural networks, 2020 (The
Hong Kong Polytechnic University, Hong Kong SAR) web, thesis
44. Chenglin
XU (NTU), Single channel multi-talker speech separation with deep learning,
2020 (Kuaishou, China) web, thesis
45. Paul
Yaozhu CHAN (NUS), The psychoacoustics and synthesis of singing harmony, 2020 (A*STAR,
Singapore) web, thesis
46. Berrak
SISMAN (NUS), Machine learning for limited data voice conversion, 2020 (University
of Texas at Dallas, US) web, thesis
47. Malu
Zhang (UESTC), On the study of spiking machine learning algorithms, 2019
(University of Electronics Science and Technology of China, China)
48. Chitralekha
GUPTA (NUS), Comprehensive evaluation of singing quality, 2019 (NUS Singapore)
web, thesis
49. Nicole
MIRNIG (Salzburg), Essential of robot feedback: On developing a taxonomy for
human-robot interaction, 2019 (University of Salzburg, Austria), thesis
50. Wenda
CHEN (UIUC), Modeling phones, keywords, topics and
intents in spoken languages, 2019 (A*STAR, Singapore) web
51. Van
Tung PHAM (NTU), Robust spoken term detection using partial search and
re-scoring hypothesized detections techniques, 2018, thesis
52. Tze Yuang CHONG (NTU), Exploiting long context using joint
distance and occurrence information for language modeling, 2018, thesis
53. Duc
Hoang Ha NGUYEN (NTU), Feature-based robust techniques for speech recognition,
2017, thesis
54. Chong
ZHANG (NUS), Computational intelligence in diagnostic and prognostic
applications, 2017, thesis
55. Van
Hai DO (NTU), Acoustic modeling for speech recognition under limited training
data conditions, 2015 (Thuyloi University,
Vietnam), thesis
56. Zhizheng
WU (NTU), Spectral mapping for voice conversion, 2015 (The Chinese
University of Hong Kong, Shenzhen), thesis
57. Trung Hieu NGUYEN (NTU), Speaker diarization
in meetings domain, 2014, thesis
58. Lei
WANG (NTU), Audio pattern discovery and retrieval, 2012, thesis
59. Rong
TONG (NTU), Towards a high performance phonotactic features for spoken language
recognition, 2012 (Singapore Institute of Technology, Singapore), thesis
60. Omid
DEHZANGHI (NTU), Discriminative feature extraction for speech recognition using
continuous output codes, 2012, thesis
61. Xiong
XIAO (NTU), Robust speech features and acoustic models for speech recognition,
2009, thesis
62. Tee
Kiah CHIA (NUS), Lattice-based statistical spoken document retrieval, 2009, thesis
63. Hendra
SETIAWAN (NUS), Reordering in statistical machine translation: a function word,
syntax-based approach, 2008, thesis
MPhil
Students
1. Rui KE
(2025-)
2. Yihang
LIN (2024-)
Project
Acknowledgement
1. 2023.01.01
- 2026.12.31: National Natural Science Foundation of China (Grant No. 62271432)
2. 2024.02.06
- 2026.02.05: Shenzhen Science and Technology Program (Shenzhen Key Laboratory,
Grant No. ZDSYS20230626091302006)
3. 2022.10.28
- 2025.10.31: Shenzhen Science and Technology Research Fund (Fundamental
Research Key Project, Grant No. JCYJ20220818103001002)
4. 2024.09.01
- 2029.08.31: Program for Guangdong Introducing Innovative and Entrepreneurial
Teams, Grant No. 2023ZT10X044
5. 2019 -
Deutsche Forschungsgemeinschaft (DFG, German Research
Foundation) under Germany's Excellence Strategy (University Allowance, EXC
2077, University of Bremen).
6. 2024 -
Hearable-centered assistance: From sensor to participation - Hearaz (GRK 2969) funded by Deutsche Forschungsgemeinschaft
(DFG, German Research Foundation)
Postal Address
1. School
of Artificial Intelligence, Shenzhen Research Institute of Big Data, The
Chinese University of Hong Kong, Shenzhen, Guangdong 518172, China
2. Department
of Electrical and Computer Engineering, National University of Singapore,
Singapore 117583
3. Machine
Listening Lab, University of Bremen, 28359 Bremen, Germany
Short Bio (for IEEE Transactions)
Haizhou
Li (Fellow, IEEE) received the B.Sc., M.Sc., and Ph.D. degrees in electrical and
electronic engineering from the South China University of Technology,
Guangzhou, China, in 1984, 1987, and 1990 respectively. He is currently a
Presidential Chair Professor and the Dean of the School of Artificial
Intelligence, The Chinese University of Hong Kong, Shenzhen (CHHK-Shenzhen),
China. He is also an Adjunct Professor with the Department of Electrical and
Computer Engineering, National University of Singapore, Singapore. Prior to
that, he taught with the University of Hong Kong, Hong Kong, (1988–1990) and
South China University of Technology, (1990–1994). He was a Visiting Professor
with CRIN in France (1994–1995), Research Manager with the Apple-ISS Research
Centre (1996–1998), Research Director with Lernout & Hauspie Asia Pacific
(1999–2001), a Vice President with InfoTalk Corp.
Ltd. (2001–2003), and the Principal Scientist with the Institute for Infocomm Research, Singapore (2003–2016). His research
interests include speech information processing, natural language processing,
and neuromorphic computing. Dr. Li was an Editor-in-Chief of IEEE/ACM
Transactions on Audio, Speech and Language Processing (2015–2018), the
President of the International Speech Communication Association (2015–2017),
the President of Asia Pacific Signal and Information Processing Association
(2015–2016), the President of Asian Federation of Natural Language Processing
(2017–2018), and Vice President of IEEE Signal Processing Society (2024-2026).
He was the General Chair of ACL 2012, INTERSPEECH 2014, ASRU 2019 and ICASSP 2022.
Dr. Li is an ISCA Fellow, IEEE Fellow, AAIA Fellow, and a Fellow of the Academy
of Engineering Singapore. He was the recipient of the National Infocomm Award 2002, and the President's Technology Award
2013 in Singapore. He was named one of the two Nokia Visiting Professors in
2009 by the Nokia Foundation, and U Bremen Excellence Chair Professor since
2019.
Short Bio (Chinese)
李海洲教授现任香港中文大学(深圳)人工智能学院院长、校长学勤讲座教授,他也是德国不来梅大学卓越讲座教授。此前,他曾担任新加坡国立大学终身教授新加坡科技研究局资讯与通信研究院首席科学家和研究总监。李教授曾任《IEEE/ACM 音频、语音和语言处理汇刊》主编
(2015-2018年)、IEEE语音与语言处理技术委员会委员 (2013-2015年)、IEEE信号处理学会出版委员会委员(2015-2018年)、IEEE 信号处理学会奖励委员会委员(2021-2023年)、IEEE 信号处理学会会议委员会委员(2023-2024)、IEEE信号处理学会副会长(2024-2026)。李教授是国际语音通信学会主席 (2015-2017年)、也曾任亚太信号与信息处理协会主席(2015-2016年)、亚洲自然语言处理联合会主席 (AFNLP, 2017-2018年)。此外,他还担任了多个大型学术会议的大会主席,包括ACL 2012、INTERSPEECH
2014, ICASSP 2022和APSIPA ASC
2025。李教授在2009年荣获诺基亚基金会“诺基亚教授”称号、2013年荣获新加坡共和国最高科技奖“总统科技奖”、2014年荣获IEEE会士荣誉、2015年荣获“东盟卓越工程成就奖”、2021年新加坡工程院院士荣誉。
Resources: .PNG Files of Logos
1. Human
Language Technology Lab (HLT) logos
2. Language,
Intelligence, and Machines Centre (LIMA) logos
3. Neuromorphic
Auditory Perception Project (NAP) logos
4. Shenzhen
Key Laboratory of Cross-Modal Cognitive Computing Lab (C3Lab) logos