|
|
|
Haizhou Li received the B.Sc, M.Sc,
and Ph.D degrees in electrical and electronic engineering from South China
University of Technology, Guangzhou, China in 1984, 1987, and 1990
respectively. He is now the Executive Dean and Presidential Chair Professor at
the School of Data Science, The Chinese University of Hong Kong, Shenzhen
(CUHK-Shenzhen), China. Dr. Li is also with the Department of Electrical and
Computer Engineering, National University of Singapore (NUS), Singapore.
Dr. Li has worked on speech and
language technology in academia and industry since 1988. He has taught in The
University of Hong Kong (1988-1989), South China University of Technology in
Guangzhou, China (1990-1994), Nanyang Technological University in Singapore
(2006-2016), University of Eastern Finland (2009), and University of New South
Wales (since 2011). He was a Visiting Professor at CRIN/INRIA in France
(1994-1995). Prior to joining CUHKSZ and NUS, he was a Research Manager in
Apple-ISS Research Centre (1996-1998), Research Director of Lernout &
Hauspie Asia Pacific (1999-2001), Vice President of InfoTalk Corp. Ltd and
General Manager of InfoTalk Technology (Singapore) Pte Ltd (2001-2003), the Principal
Scientist and Department Head of Human Language Technology at the Institute for
Infocomm Research (2003-2016), and the Research Director of the Institute for
Infocomm Research (2014-2016), the Agency for Science, Technology and Research,
Singapore. He co-founded Baidu-I2R Research Centre in Singapore (2012). Dr. Li
was known for his technical contributions to several award-winning speech
products, such as Apple's Chinese Dictation Kits for Macintosh (1996) and
Lernout & Hauspie's Speech-Pen-Keyboard Text Entry Solution for Asian
languages (1999). He was the architect of a series of major technology
deployments that include TELEFIQS voice-automated call centre service in
Singapore Changi International Airport (2001), voiceprint engine for Lenovo A586
Smartphone (2012), and Baidu Music Search (2013).
Dr. Li's research interests include
automatic speech recognition, natural language processing and neuromorphic
computing. He has served as the Editor-in-Chief of IEEE/ACM TRANSACTIONS ON
AUDIO, SPEECH AND LANGUAGE PROCESSING (2015-2018), Associate Editor (2008-2012)
and Senior Area Editor (2014-2016) of IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH
AND LANGUAGE PROCESSING, Associate Editor (2012-2013) of ACM TRANSACTIONS ON
SPEECH AND LANGUAGE PROCESSING, Computer Speech and Language (2012-2024),
Springer International Journal of Social Robotics (2008-2024), and a Member of
IEEE Speech and Language Processing Technical Committee (2013-2015), Awards
Board (2021-2023), Publications Board (2015-2018), and Conference Board
(2023-2024) of IEEE Signal Processing Society. He has served as the President
of the International Speech Communication Association (ISCA, 2015-2017), the
President of Asia Pacific Signal and Information Processing Association
(APSIPA, 2015-2016), the President of the Chinese and Oriental Language
Information Processing Society (COLIPS, 2015-2024), the President of the Asian
Federation of Natural Language Processing (AFNLP, 2017-2018), the Vice
President (Conferences) of IEEE Signal Processing Society (2024-2026). He was
the General Chair of ACL 2012, INTERSPEECH 2014, IWSDS 2019, ASRU 2019, and
IEEE ICASSP 2022, the Local
Arrangement Chair of SIGIR 2008 and ACL-IJCNLP 2009, and the Technical Program
Chair of ISCSLP 1998, APSIPA Annual Summit and Conference 2010, IEEE Spoken
Language Technology Workshop 2014, and IEEE ChinaSIP 2015.
Dr. Li was the recipient of
National Infocomm Awards 2002, Institution of Engineers Singapore (IES)
Prestigious Engineering Achievement Award 2013 and 2015, President's Technology
Award 2013, and MTI Innovation Activist Gold Award 2015 in Singapore. He was
named one of the two Nokia Visiting Professors in 2009 by Nokia Foundation,
IEEE Fellow in 2014 for leadership in multilingual, speaker and language
recognition, ISCA Fellow in 2018 for contributions to multilingual speech
information processing, Bremen Excellence Chair Professor in 2019, Fellow of
the Academy of Engineering Singapore in 2022, and DFG Mercator Fellow in 2022.
Dr. Li is a member of ACL, ACM, and APSIPA.
1.
Fellow,
DFG Mercator Fellow, 2022
2.
Fellow,
Asia-Pacific Artificial Intelligence Association, 2022
3.
Fellow,
Academy of Engineering Singapore, 2022
4.
Bremen
Excellence Chair Professor, Germany, 2019
5.
Fellow
of the International Speech Communication Association 2018 (citation: for
contributions to multilingual speech information processing)
6.
First
Prize at 2nd International Collegiate Competition for Brain-Inspired Computing,
Beijing, China, 2018
7.
A*STAR
Awards 2016 (A*STAR Borderless Awards: Autonomous Vehicle Programme)
8.
PS21
ExCEL Awards 2015 Innovation Champion (Bronze), Prime Minister's Office,
Singapore (Citation: for efforts in practicing application research to pursue
fundamental understandings of speech recognition and machine translation
technologies)
9.
ASEAN
Outstanding Engineering Achievement Award 2015, ASEAN Federation of Engineering
Organizations (Citation: in recognition of an outstanding engineering project
which has made significant contributions to the country's development - Speak
to Me in My Language)
10.
MTI
Innovation Activist Gold Award 2015, Ministry of Trade and Industry, Singapore
11.
Best
Technology Show and Tell Award, INTERSPEECH 2014
12.
IEEE Fellow 2014 (Citation: for leadership in
multilingual, speaker and language recognition.)
13.
President's Technology Award 2013, Singapore (Citation: for the
outstanding contributions to human language technology that have empowered the
industry and benefited the Asian society. see
also photo and speech by Minister S.
Iswaran at
National Archives of Singapore)
14.
IES
Prestigious Engineering Achievement Award 2013, Singapore (voiceprint
technology)
15.
The
Most Cited Article, Speech Communication, 2007-2013
16.
Distinguished
Alumni Awards, South China University of Technology, 2012 (SCUT 60th
Anniversary)
17.
Nokia Visiting Professor 2009, Nokia Foundation
18.
Achiever
of the Year 2007/08, Institute for Infocomm Research, A*STAR
19.
The
Enterprise Challenge Awards 2004, Prime Minister's Office, Singapore
20.
National
Infocomm Awards 2002, Infocomm Development Authority, Singapore
2.
Best
Paper Award 2021, Chen Zhang, Luis Fernando D`Haro, Thomas Friedrichs, Haizhou
Li and Yiming Chen, Investigating the Impact of Pre-trained Language Models on
Dialog Evaluation, The 12th International Workshop on Spoken Dialog System
Technology, 15-17 November 2021, Singapore.
3.
Best
Paper Award 2021, Qian Xinyuan, Bidisha Sharma, Amine El Abridi and Haizhou Li,
SLoClas: A DATABASE FOR JOINT SOUND LOCALIZATION AND CLASSIFICATION, The 24th
Conference of the Oriental COCOSDA, 18-20 November 2021, Singapore.
4.
IEEE
Computational Intelligence Magazine Outstanding Paper Award 2019, How the Brain
Formulates Memory: A Spatio-Temporal Model, Jun Hu, Huajin Tang, Kay Chen Tan
and Haizhou Li, IEEE Computational Intelligence Magazine, vol. 11, no. 4, pp.
56-68, May 2016
5.
Featured
productive , and innovative author in speech and language
processing (1965-2015) by the NLP4NLP Corpus, 2019
6.
AI
2000 Speech Recognition Most Influential Scholars Honorable Mention (2009-2019)
7.
Poster
Presentation Award, A Dual Alignment Scheme for Improved Speech-to-Singing
Voice Conversion, The 9th APSIPA Annual Summit and Conference, 12-15 December,
2017, Kuala Lumper, Malaysia
8.
Best
Paper Award, Computer-Assisted Pronunciation Training: From Pronunciation
Scoring Towards Spoken Language Learning, Nancy F. Chen, Haizhou Li, The 8th
APSIPA Annual Summit and Conference, 13-16 December, 2016, Jeju, Korea
9.
IEEE
Computational Intelligence Society Outstanding TNNLS Paper Award 2016, Rapid
Feedforward Computation by Temporal Encoding and Learning with Spiking Neurons,
Qiang Yu, Huajin Tang, Kay Chen Tan and Haizhou Li, IEEE Transactions on Neural
Networks and Learning Systems, Vol. 24, No. 10, pp. 1539-1552, 2013
10.
Best
Paper Award, Spoken Keyword Spotting Based on DTW, Jinyong Hou, Lei Xie, Peng
Yang, Xiong Xiao, Zhixiang Liang, Haihua Xu, Lei Wang, Bin Ma, Hang Lu, Eng
Siong Chng, Haizhou Li, China National Conference on Man-Machine Speech
Communication (NCMMSC) 2015, October 2015, Tianjin China
11.
Best
Paper Award, Parallel Inference of Dirichlet Process Gaussian Mixture Models
for Unsupervised Acoustic Modeling: A Feasibility Study, Hongjie Chen,
Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, ZeroSpeech 2015 Challenge at
INTERSPEECH 2015, 6-10 September 2015, Dresden, Germany
12.
Best
Paper M. Anandakrishnan Award, A Cloud-based Large Vocabulary Speech
Recognition System for Tamil, Sunil Sivadas, Boon Pang Lim, Thai NgocThuy Hoang
Helen, Muthalagu Meyyappan, Bin Ma, and Haizhou Li, 14th Tamil Internet
Conference 2015, 30 May - 1 June 2015, Singapore
13.
Best
Paper Award, The 4th Asia Pacific Signal and Information Processing Association
Annual Summit and Conference, A Study on Spoofing Attack in State-of-the-Art
Speaker Verification: the Telephone Speech Case, Zhizheng Wu, Tomi Kinnunen,
Eng Siong Chng, Haizhou Li, and Eliathamby Ambikairajah, in Proc. APSIPA ASC
2012, 3-6 December, 2012, Hollywood, California, USA
1.
Best
Student Paper Award, Use of Claimed Speaker Models for Replay Detection, The
10th APSIPA Annual Summit and Conference, 12-15 November 2018 in Honolulu, USA
2.
Best
Student Paper Award, Perceptual Evaluation of Singing Quality, Chitralekha
Gupta, Haizhou Li, Ye Wang, The 9th APSIPA Annual Summit and Conference, 12-15
December, 2017, Kuala Lumper, Malaysia
3.
IEEE
Ganesh N. Ramaswamy Memorial Student Grant 2015, Source-Specific Informative
Prior for i-Vector Extraction, Sven Shepstone, Kong Aik Lee, Haizhou Li,
Zheng-Hua Tan, and Soren Holdt Jensen, ICASSP 2015, 19-24 April 2015, Brisbane,
Australia
4.
IEEE
Ganesh N. Ramaswamy Memorial Student Grant 2014, Minimum Divergence Estimation
of Speaker Prior in Multi-Session PLDA Scoring, Liping Chen, Kong Aik Lee, Bin
Ma, Wu Guo, Haizhou Li, Li Rong Dai, ICASSP 2014, 4-9 May 2014, Florence, Italy
5.
ISCA
International Symposium on Chinese Spoken Language Processing, Best Student
Paper Award 2010, Factor Analysis based Spatial Correlation Modeling for
Speaker Verification, Eryu Wang, Kong Aik Lee, Bin Ma, Haizhou Li, Wu Guo, and
Lirong Dai, in Proc. ISCSLP, pp. 166 - 170, 29 November - 3 December 2010, Sun
Moon Lake, Taiwan
1.
Vice
President, IEEE Signal Processing Society 2024-2026
2.
Member,
Awards Board, IEEE Signal Processing Society 2021-2023
3.
Member,
Fellow Evaluation Committee, IEEE Signal Processing Society 2019
4.
President,
Asian Federation of Natural Language Processing (AFNLP), 2017-2018
5.
President,
International Speech Communication Association (ISCA), 2015-2017
6.
Vice
President, Asian Federation of Natural Language Processing (AFNLP), 2015-2016
7.
Member,
Publications Board, IEEE Signal Processing Society, 2015-2017
8.
Vice
President, International Speech Communication Association (ISCA), 2013-2015
9.
Board
Member, International Speech Communication Association (ISCA), 2009-2017
10.
Board
Member, Asian Federation of Natural Language Processing (AFNLP), 2006-2012
11.
Committee
Member, IEEE Speech and Language Processing Technical Committee, 2013-2015
12.
Committee
Member, IEEE Singapore Computer Chapter, 2010-2011
13.
Committee
Member, IEEE Singapore, Systems, Man, & Cybernetics Chapter, 2011-2014
14.
President,
Teochew Doctorate Society, Singapore 2018-2022
15.
President,
Chinese and Oriental Languages Information Processing Society, 2011-2022
16.
President,
Asia Pacific Signal and Information Processing Association, 2015-2016
17.
President-Elect,
Asia Pacific Signal and Information Processing Association, 2013-2014
18.
President
(2006-2014), Honorary President (2015-), South China University of Technology
Alumni Association (Singapore)
19.
Chair,
ISCA Special Interest Group on Chinese Spoken Language Processing, ISCA,
2011-2014
20.
Member
of Standing Committee, National Conference on Man-Machine Speech
Communications, China, 2006-
1.
Editor-in-Chief,
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2015-2018
2.
Guest
Associate Editor, Frontiers in Neuroscience, 2018
3.
Editor,
Signal Processing Repository, IEEE Signal Processing Society, 2013-2014
4.
Editor,
IEEE Speech and Language Processing Technical Committee Newsletter, 2013-2015
5.
Senior
Area Editor, IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING,
2014
6.
Associate
Editor, IEEE TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2009-2012
7.
Associate
Editor, Springer International Journal of Social Robotics, 2007-present
8.
Associate
Editor, ACM TRANSACTIONS ON SPEECH AND LANGUAGE PROCESSING, 2011-2013
9.
Associate
Editor, Journal of Multimedia, 2013-2014
10.
Associate
Editor, Computer Speech and Language, 2012- present
11.
Editor,
IEEE Speech and Language Processing Technical Committee Newsletter, 2013-2015
12.
Guest
Editor, PROCEEDINGS OF THE IEEE, 2013 (Special Issue on Speech Information
Processing)
13.
Guest
Editor, IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2014
(Special Issue on Continuous Space Language Modeling)
14.
Guest
Editor, IEEE TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2013
(Special Issue on Large Scale Optimization)
15.
Guest
Editor, The Institute of the Electronics, Information and Communication
Engineers (IEICE), 2012 (Special Issue on Recent Advances in Multimedia Signal
Processing Techniques and Applications)
16.
Guest
Editor, Computational Linguistics and Chinese Language Processing (CLCLP), 2007
17.
Guest
Editor, International Journal of Computer Processing of Oriental Languages
(IJCPOL), 2007
18.
Guest
Editor, ACM TRANSACTIONS ON ASIAN LANGUAGES INFORMATION PROCESSING, 2007
1.
Local
Chair, The 2023 Conference on Empirical Methods in Natural Language Processing
(EMNLP), 6-10 December, 2023, Singapore
2.
General
Chair, The 47th International Conference on Acoustics, Speech, and Signal
Processing (ICASSP), 22-27 May 2022, Singapore
3.
General
Chair, The 26th International Conference on Asian Language Processing (IALP),
27-28 Oct 2022, in Singapore and Shenzhen
4.
General
Chair, The 22nd Annual Meeting of the Special Interest Group on Discourse and
Dialogue, 29-31 July 2021, Singapore
5.
General
Chair, The 12th International Workshop on Spoken Dialog System Technology,
15-17 November 2021, Singapore
6.
General
Chair, The 24th Oriental COCOSDA, 18-20 November 2021, Singapore
7.
Senior
Area Chair, ACL-IJCNLP 2021, Thailand
8.
Area
Chair, The 21th Annual Conference of the International Speech Communication
Association (INTERSPEECH), 2020, Shanghai (online)
9.
Member
of Best Paper Committee, EMNLP 2020, Online conference
10.
Publicity
Chair, AACL-IJCNLP 2020, Online conference
11.
Senior
Area Chair, EMNLP-IJCNLP 2019, Hong Kong
12.
General
Chair, IEEE Workshop on Automatic Speech Recognition and Understanding 2019,
December 2019, Singapore
13.
Area
Chair, The 20th Annual Conference of the International Speech Communication
Association (INTERSPEECH), 2019, Graz, Austria
14.
Area
Chair, The 19th Annual Conference of the International Speech Communication
Association (INTERSPEECH), 2018, Hyderabad, India
15.
Associate
Editor, the 24th International Conference on Pattern Recognition (ICPR), 20-24
August 2018, Beijing, China
16.
General
Chair, The 5th International Conference on Orange Technologies (ICOT), 8-10
December 2017, Singapore
17.
Area
Chair, The 18th Annual Conference of the International Speech Communication
Association (INTERSPEECH), 2017 , Stockholm, Sweden
18.
Area
Chair, The 17th Annual Conference of the International Speech Communication
Association (INTERSPEECH), 2016 , San Francisco, USA
19.
Technical
Program co-Chair, The Third IEEE China Summit and International Conference on
Signal and Information Processing (ChinaSIP), 2015, Chengdu, China
20.
Area
Chair, IEEE International Conference on Acoustics, Speech and Signal Processing
(ICASSP), 2015, Brisbane, Australia
21.
Area
Chair, The 8th IAPR International Conference on Biometrics, 2015, Phuket,
Thailand
22.
General
Chair, The 15th Annual Conference of the International Speech Communication
Association (INTERSPEECH), 2014 , Singapore
23.
General
Chair, The 9th International Symposium on Chinese Spoken Language Processing,
(ISCA SIG CSLP) 2014, Singapore
24.
Technical
Program Chair, IEEE Workshop on Spoken Language Technology (SLT) 2014, South
Lake Tahoe
25.
Area
Chair, Conference on Empirical Methods in Natural Language Processing (EMNLP),
2013, Seattle, USA
26.
Publicity
Chair, Automatic Speech Recognition and Understanding Workshop (ASRU), 2013,
Olomouc, Czech Republic
27.
Publicity
Chair, 15th ACM International Conference on Multimodal Interaction (ICMI),
2013, Sydney, Australia
28.
Area
Chair, IEEE China Summit & International Conference on Signal and
Information Processing (ChinaSIP), 2013, Beijing, China
29.
Area
Chair, International Conference on Pattern Recognition, 2012 (Tsukuba, Japan),
2014 (Stockholm, Sweden)
30.
General
Chair, The 50th Annual Meeting of Association for Computational Linguistics
(ACL), 2012, Jeju, Korea
31.
Organizing
Chair, The Speaker and Language Recognition Workshop (Odyssey), 2012
32.
Posters
Chair, 5th ACM SIGGRAPH Conference and Exhibition on Computer Graphics and
Interactive Techniques in Asia, 2012, Singapore
33.
Area
Coordinator, INTERSPEECH 2010, 26-30 September 2010, Makuhari, Japan
34.
Workshop
Chair, The 2nd Named Entities Workshop with Shared Task on Machine
Transliteration, ACL 2010, Sweden
35.
Area
Chair, The 23rd International Conference on Computational Linguistics, COLING
2010, Beijing China
36.
Area
Chair, 2010 Conference on Empirical Methods on Natural Language Processing,
EMNLP 2010, Cambridge, Massachusetts, USA
37.
Program
Chair, The 2nd Asia Pacific Signal and Information Processing Association
Annual Summit and Conference, APSIPA ASC 2010, Singapore
38.
Conference
Chair, International Conference on Social Robotics 2010, Singapore
39.
General
Chair, International Conference on Asian Language Processing, IALP 2009,
Singapore
40.
Local
Organizing Chair, The 47th Annual Meeting of ACL - 4th International Conference
on Natural Language Processing, ACL-IJCNLP 2009, Singapore
41.
Workshop
Chair, The 1st Named Entities Workshop with Shared Task on Machine
Transliteration, ACL-IJCNLP 2009 Workshop, Singapore
42.
Local
Arrangements Chair, The 31st SIGIR (The 31st Annual International ACM SIGIR
Conference) 2008, Singapore
43.
Workshop
Chair, The 3rd IJCNLP (International Joint Conf. on Natural Language
Processing) 2008, Hyderabad
44.
Chair,
The 6th SIGHAN Workshop on Chinese Language Processing, 2008, Hyderabad,
45.
Technical
Track Chair, 5th ACM SIGGRAPH International Conference on Virtual-Reality
Continuum and its Applications in Industry (VRCAI 2008), 8-9 December 2008,
Singapore
46.
Program
Chair, Infocomm Horizons 2007
47.
Senior
Researcher, Johns Hopkins University 2007 Summer Workshop on Human Language
Technology
48.
Member
of Standing Committee, National Conference on Man-Machine Speech
Communications, China, 2006-
49.
General
Chair, The 5th ISCSLP (International Symposium on Chinese Spoken Language
Processing), 2006
1.
Member,
Academic Board, Master of Arts in Translation and Interpretation (MTI)
programme, Nanyang Technological University, Singapore (2016-2020)
2.
Member,
The Electrical and Computer Engineering Panel (2017-18), FCT, Portugal
3.
Member,
The RGC Engineering Panel, University Grants Committee, Hong Kong (2017-2020)
4.
Co-Chair,
A*STAR Lead User of RI Technologies Taskforce (2015)
5.
Member,
National Robotics Taskforce (2014)
6.
External
Reviewer, Research Grants Council of Hong Kong Government
7.
Member
of Evaluation Panel, Singapore-Israel Industrial R&D Foundation
8.
Member
of Organizing Committee, The Agency for Science, Technology & Research
(A*STAR) and Singapore National Academy of Science (SNAS) Young Scientist
Awards 2013, 2014
9.
Chair,
Infocomms, Media & Computing Cluster Thematic Oversight Committee, Science
and Engineering Research Council, Singapore 2013-2014
10.
Member
of Program Committee: INTERSPEECH, ICASSP, ASRU, ACL, EMNLP, IJCNLP, APSIPA
ASC, PACLIC, IWSLT, IWSDS, NEWS, Oriental COCOSDA, AIRS, SIGHAN, ODYSSEY, SLTU,
ICPR, Speech Prosody
1.
Seeing
to Hear Better, The 25th Conference of the Oriental COCOSDA, Hanoi,
Vietnam, 24-26 November, 2022
2.
Recent
Advances in Selective Auditory Attention, The 2020 IEEE Symposium Series on
Computational Intelligence (IEEE SSCI), Canberra, Australia, 1-4 December,
2020.
3.
Speech Processing at Cocktail Party, The 15th IEEE Conference on
Industrial Electronics and Applications (ICIEA 2020), Kristiansand, Norway, 9-13
November 2020
4.
The
Story of Artificial Intelligence, The 2nd International Conference on
Intelligent Autonomous Systems, 28 February - 2 March, 2019
5.
Exemplar-based
Sparse Representation for Voice Conversion, the 119th audio, speech
information processing symposium, 21 December 2017, Tokyo
6.
Whither
Speech Recognition? Alibaba Technology Forum, Learning from the Deep World, 18
September 2017, Singapore
7.
Recent
Advances in Singing Synthesis, 5th International Conference on Statistical
Language and Speech Processing, 23-25 October 2017, Le Mans, France
8.
Speech
Synthesis Perfects Everyone's Singing, International Conference on Orange
Technologies, 17-20 December 2016, Melbourne, Australia
9.
Mandarin
Chinese spoken by speakers of European origin, The Fifth Conference on Natural
Language Processing and Chinese Computing & The Twenty Fourth International
Conference on Computer Processing of Oriental Languages
(NLPCC-ICCPOL 2016), December 2-6, 2016, Kunming, China
10.
iCALL
Mandarin Corpus, Oriental COCOSDA (International Committee for
Coordination and Standardization of Speech Databases and Assessment
Techniques), 26-28 October 2016, Bali, Indonesia
11.
Voice
conversion and spoofing countermeasures for speaker verification, Odyssey
2016, The Speaker
and Language Recognition Workshop, June 21-24, Bilbao, Spain
1.
Member,
Association for Computing Machinery (ACM)
2.
Fellow,
Institute of Electrical and Electronics Engineers (IEEE)
3.
Fellow,
International Speech Communication Association (ISCA)
4.
Member,
Association for Computational Linguistics (ACL)
5.
Member,
Asia Pacific Signal and Information Processing Association (APSIPA)
6.
President,
Chinese and Oriental Languages Information Processing Society (COLIPS)
1.
Tao
Luo, Weng-Fai Wong, Rick Siow Mong Goh, Anh Tuan Do, Zhixian Chen, Haizhou Li,
Wenyu Jiang, Weiyun Yau, Achieving Green AI with Energy-Efficient Deep Learning
Using Neuromorphic Computing. Commun. ACM 66(7): 52-57 (2023)
2.
Tingting
Wang, Zexu Pan, Meng Ge, Zhen Yang, Haizhou Li, Time-Domain Speech Separation
Networks With Graph Encoding Auxiliary. IEEE Signal Process. Lett. 30: 110-114
(2023)
3.
Yi
Zhou, Zhizheng Wu, Mingyang Zhang, Xiaohai Tian, Haizhou Li, TTS-Guided
Training for Accent Conversion Without Parallel Data. IEEE Signal Process.
Lett. 30: 533-537 (2023)
4.
Mingyang
Zhang, Xuehao Zhou, Zhizheng Wu, Haizhou Li, Towards Zero-Shot Multi-Speaker
Multi-Accent Text-to-Speech Synthesis. IEEE Signal Process. Lett. 30: 947-951
(2023)
5.
Kun
Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li, Emotion
Intensity and its Control for Emotional Voice Conversion. IEEE Trans. Affect.
Comput. 14(1): 31-48 (2023)
6.
Hui
Tian, Yiqin Qiu, Wojciech Mazurczyk, Haizhou Li, Zhenxing Qian, STFF-SM:
Steganalysis Model Based on Spatial and Temporal Feature Fusion for Speech
Streams. IEEE ACM Trans. Audio Speech Lang. Process. 31: 277-289 (2023)
7.
Qiquan
Zhang, Xinyuan Qian, Zhaoheng Ni, Aaron Nicolson, Eliathamby Ambikairajah,
Haizhou Li, A Time-Frequency Attention Module for Neural Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process. 31: 462-475 (2023)
8.
Xinyuan
Qian, Zhengdong Wang, Jiadong Wang, Guohui Guan, Haizhou Li, Audio-Visual
Cross-Attention Network for Robotic Speaker Tracking. IEEE ACM Trans. Audio
Speech Lang. Process. 31: 550-562 (2023)
9.
Chen
Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li, PoE:
A Panel of Experts for Generalized Automatic Dialogue Assessment. IEEE ACM
Trans. Audio Speech Lang. Process. 31: 1234-1250 (2023)
10.
Ruijie
Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li,
Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive
Pairs. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1706-1719 (2023)
11.
Yi
Zhou, Zhizheng Wu, Xiaohai Tian, Haizhou Li, Optimization of Cross-Lingual
Voice Conversion With Linguistics Losses to Reduce Foreign Accents. IEEE ACM
Trans. Audio Speech Lang. Process. 31: 1916-1926 (2023)
12.
Xiaoxue
Gao, Chitralekha Gupta, Haizhou Li, PoLyScriber: Integrated Fine-Tuning of
Extractor and Lyrics Transcriber for Polyphonic Music. IEEE ACM Trans. Audio
Speech Lang. Process. 31: 1968-1981 (2023)
13.
Zhenyu
Weng, Huiping Zhuang, Haizhou Li, Balakrishnan Ramalingam, Rajesh Elara Mohan,
Zhiping Lin, Online Multi-Face Tracking With Multi-Modality Cascaded Matching.
IEEE Trans. Circuits Syst. Video Technol. 33(6): 2738-2752 (2023)
14.
Yiqin
Qiu, Hui Tian, Haizhou Li, Chin-Chen Chang, Athanasios V. Vasilakos, Separable
Convolution Network With Dual-Stream Pyramid Enhanced Strategy for Speech
Steganalysis. IEEE Trans. Inf. Forensics Secur. 18: 2737-2750 (2023)
15.
Jibin
Wu, Yansong Chua, Malu Zhang, Guoqi Li, Haizhou Li, Kay Chen Tan, A Tandem
Learning Rule for Effective Training and Rapid Inference of Deep Spiking Neural
Networks. IEEE Trans. Neural Networks Learn. Syst. 34(1): 446-460 (2023)
16.
Xianghu
Yue, Jingru Lin, Fabian Ritter Gutierrez, Haizhou Li, Self-Supervised Learning
With Segmental Masking for Speech Representation. IEEE J. Sel. Top. Signal
Process. 16(6): 1367-1379 (2022)
17.
Hongqiang
Du, Lei Xie, Haizhou Li, Noise-robust voice conversion with domain adversarial
training. Neural Networks 148: 74-84 (2022)
18.
Jibin
Wu, Chenglin Xu, Xiao Han, Daquan Zhou, Malu Zhang, Haizhou Li, Kay Chen Tan,
Progressive Tandem Learning for Pattern Recognition With Deep Spiking Neural
Networks. IEEE Trans. Pattern Anal. Mach. Intell. 44(11): 7824-7840 (2022)
19.
Kun
Zhou, Berrak Sisman, Rui Liu, Haizhou Li, Emotional voice conversion: Theory,
databases and ESD. Speech Commun. 137: 1-18 (2022)
20.
Hongning
Zhu, Kong Aik Lee, Haizhou Li, Discriminative speaker embedding with serialized
multi-layer multi-head attention. Speech Commun. 144: 89-100 (2022)
21.
Tianchi
Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li, Neural Acoustic-Phonetic
Approach for Speaker Verification With Phonetic Attention Mask. IEEE Signal
Process. Lett. 29: 782-786 (2022)
22.
Zexu
Pan, Xinyuan Qian, Haizhou Li, Speaker Extraction With Co-Speech Gestures Cue.
IEEE Signal Process. Lett. 29: 1467-1471 (2022)
23.
Haizhou
Li, A Unique ICASSP 2022: During an Unusual Time [Conference Highlights]. IEEE
Signal Process. Mag. 39(2): 159-160 (2022)
24.
Zexu
Pan, Ruijie Tao, Chenglin Xu, Haizhou Li, Selective Listening by Synchronizing
Speech With Lips. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1650-1664
(2022)
25.
Rui
Liu, Berrak Sisman, Guanglai Gao, Haizhou Li, Decoding Knowledge Transfer for
Neural Text-to-Speech Training. IEEE ACM Trans. Audio Speech Lang. Process. 30:
1789-1802 (2022)
26.
Xiaoxue
Gao, Chitralekha Gupta, Haizhou Li, Automatic Lyrics Transcription of
Polyphonic Music With Lyrics-Chord Multi-Task Learning. IEEE ACM Trans. Audio
Speech Lang. Process. 30: 2280-2294 (2022)
27.
Chitralekha
Gupta, Haizhou Li, Masataka Goto, Deep Learning Approaches in Topics of Singing
Information Processing. IEEE ACM Trans. Audio Speech Lang. Process. 30:
2422-2451 (2022)
28.
Zexu
Pan, Meng Ge, Haizhou Li, USEV: Universal Speaker Extraction With Visual Cue.
IEEE ACM Trans. Audio Speech Lang. Process. 30: 3032-3045 (2022)
29.
Enze
Su, Siqi Cai, Longhan Xie, Haizhou Li, Tanja Schultz, STAnet: A Spatiotemporal
Attention Network for Decoding Auditory Spatial Attention From EEG. IEEE Trans.
Biomed. Eng. 69(7): 2233-2242 (2022)
30.
Siqi
Cai, Enze Su, Longhan Xie, Haizhou Li, EEG-Based Auditory Attention Detection
via Frequency and Channel Neural Attention. IEEE Trans. Hum. Mach. Syst. 52(2):
256-266 (2022)
31.
Malu
Zhang, Jiadong Wang, Jibin Wu, Ammar Belatreche, Burin Amornpaisannon, Zhixuan
Zhang, Venkata Pavan Kumar Miriyala, Hong Qu, Yansong Chua, Trevor E. Carlson,
Haizhou Li, Rectified Linear Postsynaptic Potential Function for
Backpropagation in Deep Spiking Neural Networks. IEEE Trans. Neural Networks
Learn. Syst. 33(5): 1947-1958 (2022)
32.
Jibin
Wu, Qi Liu, Malu Zhang, Zihan Pan, Haizhou Li, Kay Chen Tan, HuRAI: A
brain-inspired computational model for human-robot auditory interface.
Neurocomputing 465: 103-113 (2021)
33.
Rui
Liu, Berrak Sisman, Yixing Lin, Haizhou Li, FastTalker: A neural text-to-speech
architecture with shallow and group autoregression. Neural Networks 141:
306-314 (2021)
34.
Hongqiang
Du, Xiaohai Tian, Lei Xie, Haizhou Li, Factorized WaveNet for voice conversion
with limited data. Speech Commun. 130: 45-54 (2021)
35.
Tharshini
Gunendradasan, Eliathamby Ambikairajah, Julien Epps, Vidhyasaharan Sethu,
Haizhou Li, An adaptive transmission line cochlear model based front-end for
replay attack detection. Speech Commun. 132: 114-122 (2021)
36.
Bidisha
Sharma, Xiaoxue Gao, Karthika Vijayan, Xiaohai Tian, Haizhou Li, NHSS: A speech
and singing parallel database. Speech Commun. 133: 9-22 (2021)
37.
Xinyuan
Qian, Qi Liu, Jiadong Wang, Haizhou Li, Three-Dimensional Speaker Localization:
Audio-Refined Visual Scaling Factor Estimation. IEEE Signal Process. Lett. 28:
1405-1409 (2021)
38.
Rui
Liu, Berrak Sisman, Feilong Bao, Jichen Yang, Guanglai Gao, Haizhou Li,
Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing
for Mongolian Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 29:
274-285 (2021)
39.
Mingyang
Zhang, Yi Zhou, Li Zhao, Haizhou Li, Transfer Learning From Speech Synthesis to
Voice Conversion With Non-Parallel Training Data. IEEE ACM Trans. Audio Speech
Lang. Process. 29: 1290-1302 (2021)
40.
Rui
Liu, Berrak Sisman, Guanglai Gao, Haizhou Li, Expressive TTS Training With
Frame and Style Reconstruction Loss. IEEE ACM Trans. Audio Speech Lang.
Process. 29: 1806-1818 (2021)
41.
Yi
Zhou, Xiaohai Tian, Haizhou Li, Language Agnostic Speaker Embedding for
Cross-Lingual Personalized Speech Generation. IEEE ACM Trans. Audio Speech
Lang. Process. 29: 3427-3439 (2021)
42.
Chen Zhang, Grandee
Lee, Luis
Fernando D'Haro,
Haizhou Li, D-Score: Holistic Dialogue Evaluation Without Reference. IEEE ACM
Trans. Audio Speech Lang. Process. 29: 2502-2516 (2021)
43.
Zihan Pan, Malu
Zhang, Jibin
Wu, Jiadong
Wang, Haizhou Li, Multi-Tone
Phase Coding of Interaural Time Difference for Sound Source Localization With
Spiking Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 29:
2656-2670 (2021)
44.
Chenglin Xu, Wei Rao, Jibin Wu, Haizhou Li, Target Speaker Verification with
Selective Auditory Attention for Single and Multi-Talker Speech. IEEE ACM Trans. Audio Speech Lang. Process.
29: 2696-2709 (2021)
45.
Berrak
Sisman, Junichi Yamagishi, Simon King, and Haizhou Li, An Overview of Voice
Conversion and its Challenges: From Statistical Modeling to Deep Learning, IEEE/ACM
Transactions on Audio, Speech, and Language Processing, vol. 29, pp. 132-157,
2021, doi:
10.1109/TASLP.2020.3038524
46.
Rui
Liu, Berrak Sisman, Feilong Bao, Jichen Yang, Guanglai Gao and Haizhou Li,
Exploiting morphological and phonological features to improve prosodic phrasing
for Mongolian speech synthesis, IEEE/ACM Transactions on Audio, Speech, and
Language Processing, 2020, doi: 10.1109/TASLP.2020.3040523
47.
Rui
Liu, Berrak Sisman, Feilong Bao, Guanglai Gao and Haizhou Li, Modeling Prosodic
Phrasing with Multi-Task Learning in Tacotron-based TTS, IEEE Signal Processing
Letters, vol. 27, pp. 1470-1474, 2020
48.
Yi
Zhou, Xiaohai Tian and Haizhou Li, Multi-Task WaveRNN with an Integrated
Architecture for Cross-lingual Voice Conversion, IEEE Signal Processing
Letters, vol. 27, pp. 1310-1314, 2020
49.
Changhuai
You and Jichen Yang, Device Feature Extraction Based on Parallel Neural network
training for replay spoofing detection, IEEE/ACM Transactions on Audio, Speech
and Language Processing, vol. 28, pp 2308-2318, 2020
50.
Mingyang
Zhang, Berrak Sisman, Li Zhao and Haizhou Li, DeepConversion: Voice conversion
with limited parallel training data, Speech Communication, vol. 122, pp. 31-43,
2020
51.
Chenglin
Xu, Wei Rao, Eng Siong Chng and Haizhou Li, SpEx: Multi-Scale Time Domain
Speaker Extraction Network, IEEE/ACM Transaction on Audio, Speech, and Language
Processing, vol. 28, pp. 1370-1384, 2020
52.
Malu
Zhang, Xiaoling Luo, Jibin Wu, Yi Chen, Ammar Belatreche, Zihan Pan, Hong Qu,
and Haizhou Li, An Efficient Threshold-Driven Aggregate-Label Learning
Algorithm for Multimodal Information Processing, IEEE Journal of Selected
Topics in Signal Processing, 14(3), pp. 592-602, March 2020, doi:
10.1109/JSTSP.2020.2983547
53.
Malu
Zhang, Jibin Wu, Ammar Belatreche, Zihan Pan, Xiurui Xie, Yansong Chua, Guoqi
Li, Hong Qu and Haizhou Li, Supervised Learning in Spiking Neural Networks with
Synaptic Delay-Weight Plasticity, Neurocomputing, vol. 409, pp. 103-118, October
2020
54.
Jibin
Wu, Emre Yılmaz, Malu Zhang, Haizhou Li and Kay Chen Tan, Deep Spiking Neural
Networks for Large Vocabulary Automatic Speech Recognition, Frontiers in
Neuroscience, 14(199), March 2020
55.
Zihan
Pan, Yansong Chua, Jibin Wu, Malu Zhang, Haizhou Li and Eliathamby
Ambikairajah, An Efficient and Perceptually Motivated Auditory Neural Encoding
and Decoding Algorithm for Spiking Neural Networks, Frontiers in Neuroscience,
13(1420), January 2020
56.
Jichen
Yang, Rohan Kumar Das and Haizhou Li, Significance of Subband Features for
Synthetic Speech Detection, IEEE Transactions on Information Forensics and
Security, vol. 15, pp. 2160-2170, 2020, doi: 10.1109/TIFS.2019.2956589
57.
Chitralekha
Gupta, Haizhou Li and Ye Wang, Automatic Leaderboard: Evaluation of Singing
Quality Without a Standard Reference, IEEE/ACM Transactions on Audio, Speech,
and Language Processing, vol. 28, pp. 13-26, 2020, doi:
10.1109/TASLP.2019.2947737
58.
Qiang
Yu, Haizhou Li, Kay Chen Tan, Spike Timing or Rate? Neurons Learn to Make
Decisions for Both Through Threshold-Driven Plasticity, IEEE Trans. Cybernetics
49(6): 2178-2189, 2019
59.
Berrak
Sisman, Mingyang Zhang, Haizhou Li, Group Sparse Representation with WaveNet
Vocoder Adaptation for Spectrum and Prosody Conversion, IEEE/ACM Transactions
on Audio, Speech, and Language Processing, IEEE/ACM Trans. Audio, Speech &
Language Processing 27(6): 1085-1097 (2019)
60.
Karthika
Vijayan, Haizhou Li, Tomoki Toda, Speech-to-Singing Voice Conversion: The
Challenges and Strategies for Improving Vocal Conversion Processes, IEEE Signal
Processing Magazine. 36(1): 95-102, 2019
61.
Luis
Fernando D'Haro, Rafael E. Banchs, Chiori Hori, Haizhou Li: Automatic
evaluation of end-to-end dialog systems with adequacy-fluency metrics, Computer
Speech & Language 55: 200-215, 2019
62.
Chong
Zhang, Kay Chen Tan, Haizhou Li, Geok Soon Hong, A Cost-Sensitive Deep Belief
Network for Imbalanced Classification, IEEE Transactions on Neural Networks and
Learning Systems. 30(1): 109-122, 2019
63.
Van
Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li,
Re-ranking spoken term detection with acoustic exemplars of keywords. Speech
Communication 104: 12-23, 2018
64.
Longting
Xu, Kong-Aik Lee, Haizhou Li, Zhen Yang, Generalizing I-Vector Estimation for
Rapid Speaker Recognition. IEEE/ACM Trans. Audio, Speech & Language Processing
26(4): 749-759, 2018
65.
Saad
Irtza, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li, Using language
cluster models in hierarchical language identification. Speech Communication
100: 30-40, 2018
66.
Kaavya
Sriskandaraja, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li,
Front-End for Antispoofing Countermeasures in Speaker Verification: Scattering
Spectral Decomposition, IEEE Journal of Selected Topics in Signal Processing
11(4): 632-643, 2017
67.
Hongjie Chen,
Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, Multitask Feature Learning for
Low-Resource Query-by-Example Spoken Term Detection, IEEE Journal of Selected
Topics in Signal Processing 11(8): 1329-1339, 2017
68.
Xiaohai Tian,
Siu Wa Lee, Zhizheng Wu, Eng Siong Chng, Haizhou Li, An Exemplar-Based Approach
to Frequency Warping for Voice Conversion, IEEE/ACM Trans. Audio, Speech &
Language Processing 25(10): 1863-1876, 2017
69.
Hongjie Chen,
Lei Xie, Cheung-Chi Leung, Xiaoming Lu, Bin Ma, Haizhou Li, Modeling Latent
Topics and Temporal Distance for Story Segmentation of Broadcast News, IEEE/ACM
Trans. Audio, Speech & Language Processing 25(1): 108-119, 2017
70.
Jun Hu, Huajin
Tang, Kay Chen Tan, Haizhou Li, How the Brain Formulates Memory: A
Spatio-Temporal Model, IEEE Computational Intelligence Magazine, 11(2): 56-68,
2016
71.
Xiong Xiao,
Shengkui Zhao, Duc Hoang Ha Nguyen, Xionghu Zhong, Douglas L. Jones, Eng Siong
Chng, Haizhou Li, Speech dereverberation for enhancement and recognition using
dynamic features constrained deep neural networks and feature adaptation,
EURASIP Journal Adv. Sig. Proc. 2016: 4, 2016
72.
Zhizheng Wu,
Haizhou Li, On the study of replay and voice conversion attacks to
text-dependent speaker verification, Multimedia Tools Appl. 75(9): 5311-5327,
2016
73.
Nancy F. Chen,
Darren Wee, Rong Tong, Bin Ma, Haizhou Li, Large-scale characterization of
non-native Mandarin Chinese spoken by speakers of European origin: Analysis on
iCALL. Speech Communication 84: 46-56, 2016
74.
Sven Ewan
Shepstone, Kong-Aik Lee, Haizhou Li, Zheng-Hua Tan, Soren Holdt Jensen, Total
Variability Modeling Using Source-Specific Priors. IEEE/ACM Trans. Audio,
Speech & Language Processing 24(3): 504-517, 2016
75.
Duc Hoang Ha
Nguyen, Xiong Xiao, Eng Siong Chng, Haizhou Li, Feature Adaptation Using Linear
Spectro-Temporal Transform for Robust Speech Recognition. IEEE/ACM Trans.
Audio, Speech & Language Processing 24(6): 1006-1019, 2016
76.
Qiang Yu, Rui
Yan, Huajin Tang, Kay Chen Tan, Haizhou Li, A Spiking Neural Network System for
Robust Sequence Recognition, IEEE Transactions on Neural Networks and Learning
Systems, 27(3): 621-635, 2016, doi: 10.1109/TNNLS.2015.2416771
77.
Yuma Ueda,
Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Eng Siong Chng, Haizhou Li,
Single-channel Dereverberation for Distant-Talking Speech Recognition by
Combining Denoising Autoencoder and Temporal Structure Normalization. Signal
Processing Systems 82(2): 151-161, 2016
78.
Liping Chen,
Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai, Exploration of Local
Variability in Text-Independent Speaker Verification, Signal Processing Systems
82(2): 217-228, 2016
79.
Dau-Cheng Lyu,
Tien Ping Tan, Eng Siong Chng, Haizhou Li: Mandarin-English code-switching
speech corpus in South-East Asia: SEAME. Language Resources and Evaluation
49(3): 581-600, 2015
80.
Chang Huai You,
Haizhou Li, and Kong-Aik Lee, Relevance factor of maximum a posteriori
adaptation for GMM-NAP-SVM in speaker and language recognition, Computer Speech
and Language, vol.30, no.1, pp.116-134, 2015
81.
Van Hai Do,
Xiong Xiao, Eng Siong Chng, and Haizhou Li, Context-dependent Phone Mapping for
Acoustic Modeling of Under-resourced Languages, International Journal of Asian
Language Processing, vol.23, no.1, pp.21-33, 2015
82.
Haipeng Wang,
Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, Acoustic Segment Modeling
with Spectral Clustering Methods, IEEE/ACM Transactions on Audio, Speech and
Language Processing, vol.23, no.2, pp.264-277, 2015
83.
Rafael E.
Banchs, Luis F. D'Haro, and Haizhou Li, Adequacy-Fluency Metrics: Evaluating MT
in the Continuous Space Model Framework, IEEE/ACM Transactions on Audio, Speech
and Language Processing, vol.23, no.3, pp.472-482, 2015
84.
Tze Yuang Chong,
Rafael E. Banchs, Eng Siong Chng, Haizhou Li, Decoupling Word-Pair Distance and
Co-occurrence Information for Effective Long History Context Language Modeling,
IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(7):
1221-1232, 2015
85.
Haizhou Li,
Inaugural editorial: Embracing Opportunities for Growth, IEEE/ACM Transactions
on Audio, Speech and Language Processing, 23(1): 5-6, 2015
86.
Jonathan William
Dennis, Tran Huy Dat, Haizhou Li: Generalized Hough Transform for Speech
Pattern Classification. IEEE/ACM Transactions on Audio, Speech & Language
Processing 23(11): 1963-1972, 2015
87.
Zhizheng Wu,
Nicholas Evans, Tomi Kinnunen, Junichi Yamagishi, Federico Alegre, Haizhou Li,
Spoofing and countermeasures for speaker verification: a survey, Speech
Communication, vol.66, Pages 130-153, 2015
88.
Van Hai Do,
Xiong Xiao, Engsiong Chng, Haizhou Li, Cross-Lingual Phone Mapping for Large
Vocabulary Speech Recognition of Under-Resourced Languages, IEICE Transactions
97-D(2): 285-295, 2014
89.
Miaolong Yuan,
Huajin Tang, Haizhou Li, Real-Time Keypoint Recognition Using Restricted
Boltzmann Machine, IEEE Trans. Neural Netw. Learning Syst. 25(11): 2119-2126,
2014
90.
Zhizheng Wu,
Haizhou Li, Voice conversion versus speaker verification: an overview, APSIPA
Transactions on Signal and Information Processing, vol.3, e17
doi:10.1017/ATSIP.2014.17, 2014
91.
Zhizheng Wu, Eng
Siong Chng, Haizhou Li, Exemplar-based voice conversion using joint nonnegative
matrix factorization, Multimedia Tools and Applications, Springer, 2014
92.
Zhizheng Wu,
Tuomas Virtanen, Eng Siong Chng, Haizhou Li, Exemplar-based sparse
representation with residual compensation for voice conversion, IEEE/ACM
Transactions on Audio, Speech and Language Processing, vol. 22, No. 10, pp.
1506-1521, 2014
93.
Anthony Larcher,
Kong Aik Lee, Bin Ma, Haizhou Li, Text-dependent speaker verification:
Classifiers, databases and RSR2015, Speech Communication, vol. 60, May 2014, pp.
56-77
94.
Qiang Yu, Huajin
Tang, Kay Chen Tan, and Haizhou Li, Precise-Spike-Driven Synaptic Plasticity:
Learning Hetero-Association of Spatiotemporal Spike Patterns, PLoS ONE, 8(11):
e78318, 2013, doi: 10.1371/journal.pone.0078318
95.
Qiang Yu, Huajin
Tang, Kay Chen Tan, Haizhou Li: Rapid Feedforward Computation by Temporal
Encoding and Learning With Spiking Neurons. IEEE Trans. Neural Networks
Learning System, 24(10): 1539-1552, 2013
96.
S. J. Wright, D.
Kanevsky, L. Deng, X. He, G. Heigold, and H. Li, Optimization Algorithm and
Applications for Speech and Language Processing, IEEE Transactions on Audio,
Speech and Language Processing, 21(11):2231-2243, 2013
97.
Haipeng Wang,
Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li, Shifted-Delta MLP Features for
Spoken Language Recognition. IEEE Signal Process. Lett. 20(1): 15-18, 2013
98.
Jun Hu, Huajin
Tang, Kay Chen Tan, Haizhou Li and Luping Shi, A Spike-Timing Based Integrated
Model for Pattern Recognition. Neural Computation, vol. 25, no. 2, pp. 450-472,
2013
99.
Raymond W. M.
Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li: Spoken Language Recognition
with Prosodic Features. IEEE Transactions on Audio, Speech & Language
Processing, 21(9): 1841-1853, 2013
100. V. Hautamaki, T. Kinnunen, F. Sedlak, Kong Aik Lee, Bin Ma, and Haizhou
Li, Sparse Classifier Fusion for Speaker Verification, IEEE Transactions on
Audio, Speech and Language Processing, 21(8): 1622-1631, August 2013
101. Douglas D. O'Shaughnessy, Li Deng, Haizhou Li: Speech Information
Processing: Theory and Applications [Scanning the Issue]. Proceedings of the
IEEE vol. 101, No. 5 pp. 1034-1037, May 2013
102. Haizhou Li, Kong Aik Lee, and Bin Ma, Spoken Language Recognition: From
Fundamentals to Practice, Proceedings of the IEEE, vol. 101, No. 5, pp. 1136 –
1159, May 2013
103. Jiali Yu, Huajin Tang, Haizhou Li, Dynamics Analysis of a Population
Decoding Model, IEEE Transactions on Neural Networks and Learning Systems, vol.
24, No. 3, 2013
104. Jiali Yu, Huajin Tang, Haizhou Li, Luping Shi, Dynamical properties of
continuous attractor neural network with background tuning, Neurocomputing, vol.
99, pp. 439 - 447, 2013
105. Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, Haizhou Li, Mixture of
factor analyzers using priors from non-parallel speech for voice conversion,
IEEE Signal Processing Letters, 19(12), pp. 914-917, 2012
106. Omid Dehzangi, Bin Ma, Eng-Siong Chng and Haizhou Li, Discriminative
Feature Extraction for Speech Recognition Using Continuous Output Codes,
Pattern Recognition Letters, 33 (2012), pp. 1703-1709.
107. Liyuan Li, Shuicheng Yan, Xinguo Yu, Yeow Kee Tan, and Haizhou Li,
Robust Multiperson Detection and Tracking for Mobile Service and Social Robots,
IEEE Transactions on Systems, Man, and Cybernetics - PART B: CYBERNETICS, vol. 42,
No. 5, 2012
108. T. Kinnunen, R. Saeidi, F. Sedlak, Kong Aik Lee, J. Sandberg, M.
Hansson-Sandsten, Haizhou Li, Low-Variance Multitaper MFCC Features: a Case
Study in Robust Speaker Verification, IEEE Transactions on Audio, Speech and
Language Processing, 20(7): 1990-2001, September 2012
109. Andreea Niculescu, Betsy van Dijk, Anton Nijholt, Haizhou Li, See Swee
Lan: Making Social Robots More Attractive: The Effects of Voice Pitch, Humor
and Empathy. International Journal of Social Robotics 5(2): 171-191 (2013)
110. Wenliang Chen, Jun'ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie
Zhang, Yiou Wang, Kentaro Torisawa, Haizhou Li, Bitext Dependency Parsing With
Auto-Generated Bilingual Treebank, IEEE Transactions on Audio, Speech and
Language Processing, 20(5): 1461-1472 (2012)
111. Xiaoxuan Wang, Lei Xie, Mimi Lu, Bin Ma, Engsiong Chng, Haizhou Li,
Broadcast News Story Segmentation Using Conditional Random Fields and
Multimodal Features. IEICE Transactions on Information and Systems, vol. E95-D,
No.5, pp.1206-1215, 2012
112. Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, Haizhou Li: Selective
Gammatone Envelope Feature for Robust Sound Event Recognition. IEICE
Transactions 95-D (5): 1229-1237, 2012
113. Rui Yan, Keng Peng Tee, Yuanwei Chua, Haizhou Li, Huajin Tang: Gesture
Recognition Based on Localist Attractor Networks with Application to Robot
Control, IEEE Computational Intelligence Magazine, vol. 7, No. 1, pp. 64-74,
2012
114. Jin-Shea Kuo, Haizhou Li: Learning regional transliteration variants,
Information Processing and Management, 48(1): 154-169, 2012
115. Tin Lay Nwe, Hanwu Sun, Bin Ma, Haizhou Li, Speaker Clustering and
Cluster Purification Methods for RT07 and RT09 Evaluation Meeting Data, IEEE
Transactions on Audio, Speech and Language Processing, vol 20, No. 2, pp
461-473, 2012
116. Haizhou Li , John-John Cabibihan, Yeow Kee Tan: Towards an Effective
Design of Social Robots, International Journal of Social Robotics, 3(4), pp.
333-335, November 2011
117. Sakriani Sakti, Michael Paul, Andrew Finch, Shinsuke Sakai, Thang Tat
Vu, Noriyuki Kimura, Chiori Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park,
Chai Wutiwiwatchai, Bo Xu, Hammam Riza, Karunesh Arora, Chi Mai Luong, Haizhou
Li, A-STAR: Toward Translating Asian Spoken Languages, Computer Speech and
Language, vol. 27, No. 2, pp. 509 - 527, 2013
118. Huajin Tang, Haizhou Li, Book Review: Information Theoretic Learning:
Renyi's Entropy and Kernel Perspectives, IEEE Computational Intelligence
Magazine, vol. 6, No. 3, August 2011
119. Eliathamby Ambikairajah, Haizhou Li, Liang Wang, Bo Yin, and
Vidhyasaharan Sethu, Language Identification: A Tutorial, IEEE Circuits and
Systems Magazine, vol. 11, No. 2, pp.82 - 108, 2011
120. Huajin Tang, Haizhou Li, and Zhang Yi, Online learning and
stimulus-driven responses of neurons in visual cortex, Cognitive Neurodynamics,
vol. 5, no. 1, pp. 77-85, 2011
121. Omid Dehzangi, Bin Ma, Eng-Siong Chng and Haizhou Li, Error Corrective
Fusion of Classifier Scores for Spoken Language, IEICE Transactions on
Information and Systems, Vol. E94-D, No.12, pp.2503-2512, 2011
122. Deyi Xiong, Min Zhang, Haizhou Li, A Maximum Entropy Segmentation Model
for Statistical Machine Translation, IEEE Transactions on Audio, Speech and
Language Processing, 19 (8), November 2011
123. Huy Dat Tran, Haizhou Li, Sound Event Recognition with Probabilistic
Distance SVMs, IEEE Transactions on Audio, Speech and Language Processing, vol.
19, No. 6, pp 1556 - 1568, 2011
124. Jonathan Dennis, Huy Dat Tran, Haizhou Li, Spectrogram Image Feature for
Sound Event Classification in Mismatched Conditions, in Signal Processing
Letters, vol. 18, No. 2, pp 130 - 133, February 2011
125. Haizhou Li, Ma Bin, TechWare: Speaker and Spoken Language Recognition
Resources, IEEE Signal Processing Magazine, vol. 27, No. 6, pp 139-142,
November 2010
126. Kong Aik Lee, Chang Huai You, Haizhou Li, Tomi Kinnunen, and Khe Chai
Sim, Using Discrete Probabilities with Bhattacharyya Measure for SVM-based
Speaker Verification, IEEE Transactions on Audio, Speech and Language
Processing, 19(4), pp.861 - 870, May 2011
127. Deyi Xiong, Min Zhang, Aiti Aw, Haizhou Li, Linguistically Annotated
Reordering Evaluation and Analysis, Computational Linguistics, vol. 36, No. 3,
pp 535-568, 2010
128. Donglai Zhu, Bin Ma, Haizhou Li, Speaker Verification with Feature-Space
MAPLR Parameters, IEEE Transactions on Audio, Speech and Language Processing, vol.
19, No. 3, pp 505-515, March 2011
129. Huajin Tang, Haizhou Li, Zhang Yi, A Discrete-Time Neural Network for
Optimization Problems with Hybrid Constraints, IEEE Transactions on Neural
Networks, vol. 21, no. 7, pp. 1184-1189, 2010
130. Namunu C. Maddage, Haizhou Li, Beat Space Segmentation and Octave Scale
Cepstral Feature for Sung Language Recognition in Pop Music, ACM Transactions
on Multimedia Computing, Communications and Applications (TOMCCAP), vol. 7
Issue 4, November 2011, Article No. 37
131. Lei Wang, Eng Siong Chng, Haizhou Li, A Tree-Construction Search
Approach for Multivariate Time Series Motifs Discovery, Pattern Recognition
Letters, vol. 31, No. 9, pp 869-875, 2010
132. Huajin Tang, Haizhou Li, and Rui Yan, Memory Dynamics in Attractor
Networks with Saliency Weights, Neural Computation, 22(7), pp. 1899-1926, July
2010
133. Chang Huai You, Kong Aik Lee, Haizhou Li, GMM-SVM Kernel with a
Bhattacharyya-Based Distance for Speaker Recognition, IEEE Transactions on
Audio, Speech and Language Processing, vol. 18, No. 6, pp1300-1312, 2010
134. Tomi Kinnunen, Haizhou Li, An Overview of Text-Independent Speaker Recognition:
from Features to Supervectors, Speech Communication 52 (1), 2010, pp. 12-40
(Speech Communication Most Cited Article 2007-2013)
135. Xiong Xiao, Jinyu Li, Eng Siong Chng, Haizhou Li, Chin-Hui Lee, A Study
on the Generalization Capability of Acoustic Models for Robust Speech
Recognition, IEEE Transactions on Audio, Speech and Language Processing, vol. 18,
No 6, pp1158-1169, 2010
136. Namunu C. Maddage, Khe Chai Sim, Haizhou Li, Word Level Automatic
Alignment of Music and Lyrics using Vocal Synthesis, ACM Transactions on
Multimedia Computing, Communications, and Applications (TOMCCAP), vol. 6, No.
3, 2010
137. Huy Dat Tran, Haizhou Li, Jump Function Kolmogorov for Audio
Classification in Noise-mismatch Conditions, IEEE Transactions on Signal
Processing, vol. 57, No 8, pp 2908-2918, 2009
138. Tee Kiah Chia, Khe Chai Sim, Haizhou Li and Hwee Tou Ng, Statistical
Lattice-Based Spoken Document Retrieval, ACM Transactions on Information
Systems, vol. 28, No. 1, 2010
139. Rong Tong, Bin Ma, Haizhou Li, and Eng Siong Chng, A Target-Oriented
Phonotactic Front-end for Spoken Language Recognition, IEEE Transactions on
Audio, Speech and Language Processing, vol. 17, No 7, pp.1335-1347, 2009
140. Donglai Zhu, Haizhou Li, Bin Ma, and Chin-Hui Lee, Optimizing the
Performance of Spoken Language Recognition with Discriminative Training, IEEE
Transactions on Audio, Speech and Language Processing, vol. 16, No. 8,
pp.1642-165, 2008
141. Chang Hui You, Kong-Aik Lee, and Haizhou Li, An SVM Kernel with
GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition,
IEEE Signal Processing Letters, vol. 16, No. 1, pp.49-52, 2009
142. Xiong Xiao, Eng Siong Chng, Haizhou Li, Normalization of the Speech
Modulation Spectra for Robust Speech Recognition, IEEE Transactions on Audio,
Speech and Language Processing, vol. 16, No. 8, pp.1662-1674, 2008
143. Haizhou Li, Jin-Shea Kuo, Jian Su, Chih-Lung Lin, Mining Live
Transliterations using Incremental Learning Algorithms, International Journal
of Computer Processing Of Languages, vol. 21, No. 2, pp. 183-203, 2008
144. Khe Chia Sim and Haizhou Li, On Acoustic Diversification Front-end for
Spoken Language Identification, IEEE Transactions on Audio, Speech and Language
Processing, vol. 16, No. 5, pp.1029-1037, 2008
145. Bin Ma, Haizhou Li, and Rong Tong, Spoken Language Recognition with
Ensemble Classifiers, IEEE Transactions on Audio, Speech and Language
Processing, vol. 15, No. 7, 2007
146. Jin-shea Kuo, Haizhou Li, and Ying-Kuei Yang, Active Learning for
Constructing Transliteration Lexicons from the Web, Journal of the American
Society for Information Science and Technology, vol. 59, No. 1, 2008
147. Xiong Xiao, Eng Siong Chng, and Haizhou Li, Temporal structure
normalization of speech feature for robust speech recognition, IEEE Signal
Processing Letters, vol. 14, No. 7, 2007
148. Jin-Shea Kuo, Haizhou Li, Ying-Kuei Yang, A Phonetic Similarity Model
for Automatic Extraction of Transliteration Pairs, ACM Transactions on Asian
Language Information Processing, vol. 6, Issue 2, September, 2007
149. Tin Lay Nwe and Haizhou Li, Exploring Vibrato-Motivated Acoustic
Features for Singer Identification, IEEE Transactions on Audio, Speech and
Language Processing, vol. 15, No. 2, 2007
150. Haizhou Li, Bin Ma, and Chin-Hui Lee, A Vector Space Modeling Approach
to Spoken Language Identification, IEEE Transactions on Audio, Speech and
Language Processing, vol. 15, No. 1, 2007
1.
Haizhou
Li, Kar-Ann Toh, Liyuan Li, Advanced Topics in Biometrics, World Scientific,
2011
2.
Haizhou
Li, Bin Ma, and Chin-Hui Lee, Vector-based Spoken Language Classification, in
Springer Handbook of Speech Processing, Jacob Benesty, M. Mohan Sondhi, Arden
Huang (editors), Springer 2007
3.
Chin-Hui
Lee, Haizhou Li, Lin-shan Lee, Renhua Wang, and Qiang Huo (editors), Advances
in Chinese Spoken Language Processing, World Scientific, 2007
4.
Shuzhi
Sam Ge, Haizhou Li, John-John Cabibihan and Yeow Kee Tan (editors), Social
Robotics, Springer Lecture Notes in Artificial Intelligence 6414, 2010
5.
Qiang
Huo, Bin Ma, Eng Siong Chng, and Haizhou Li (editors), Chinese Spoken Language
Processing, Springer Lecture Notes in Artificial Intelligence 4274, 2006
6.
Yinglin
Yu, Haizhou Li, Neural Networks and Signal Analysis, South China University of
Technology Press, 1996
1.
CSC3020
Machine Learning (CUHK-SZ)
2.
EE2211
Introduction to Machine Learning (NUS)
3.
EE2012
Analytical Methods in Electrical and Computer Engineering (NUS)
4.
EE6733
Advanced Topics on Vision and Machine Learning (NUS)
1.
Kun ZHOU, Emotion Modeling for Speech
Generation, 2023
2.
Chen ZHANG, Self-Supervised Modeling for
Open-Domain Dialogue Evaluation, 2023
3.
Ruijie
TAO, Audio-Visual Active Speaker Detection and Recognition, 2023
4.
Xiaoxue GAO, Automatic Lyrics Transcription Of
Polyphonic Music, 2022
5.
Zihan CHEN, Adaptive Communication-efficient
Federated Learning On Real-world Data, 2022
6.
Yi ZHOU, Cross-Lingual Voice Conversion,
2021
7.
Grandee LEE, Cross-Lingual Language Modeling,
Methods and Applications, 2021
8.
Zihan
PAN, Neural Encoding of Auditory Signals in Spiking Neural Networks, 2020
9.
Jibin WU Auditory information processing
using spiking neural networks, 2020
10. Chenglin
XU,
Single channel multi-talker speech separation with deep learning, 2020
11. Paul
Yaozhu CHAN, The psychoacoustics and synthesis of singing harmony, 2020
12. Berrak
SISMAN, Machine learning for limited data voice conversion, 2020
13. Chitralekha
GUPTA, Comprehensive evaluation of singing quality, 2019
14. Nicole
MIRNIG, Essential of robot feedback: On developing a taxonomy for human-robot
interaction, 2019
15. Wenda
CHEN, Modeling phones, keywords, topics and intents in spoken
languages, 2019
16. Van
Tung PHAM, Robust spoken term detection using partial search and
re-scoring hypothesized detections techniques, 2018
17. Tze
Yuang CHONG, Exploiting long context using joint distance and
occurrence information for language modeling, 2018
18. Duc
Hoang Ha NGUYEN, Feature-based robust techniques for speech recognition,
2017
19. Chong
ZHANG, Computational intelligence in diagnostic and prognostic
applications, 2017
20. Van
Hai DO, Acoustic modeling for speech recognition under limited
training data conditions, 2015
21. Zhizheng WU,
Spectral mapping for voice conversion, 2015
22. Trung
Hieu NGUYEN, Speaker diarization in meetings domain, 2014
23. Lei
WANG, Audio pattern discovery and retrieval, 2012
24. Rong
TONG, Towards a high performance phonotactic features for spoken
language recognition, 2012
25. Omid DEHZANGHI,
Discriminative feature extraction for speech recognition using continuous
output codes, 2012
26. Xiong
XIAO, Robust speech features and acoustic models for speech
recognition, 2009
27. Tee
Kiah CHIA, Lattice-based statistical spoken document retrieval, 2009
28. Hendra
SETIAWAN, Reordering in statistical machine translation: a function
word, syntax-based approach, 2008