Human Language Technology

Journal Articles

2020

  • Jichen Yang, Rohan Kumar Das and Haizhou Li, “Significance of Subband Features for Synthetic Speech Detection”, IEEE Transactions on Information Forensics and Security, 15(1), December 2020, pp. 2160-2170.
  • Chitralekha Gupta, Haizhou Li and Ye Wang, “Automatic Leaderboard: Evaluation of Singing Quality Without a Standard Reference,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28(1), December 2020, pp. 13-26.

2019

  • Qiang Yu, Haizhou Li and Kay Chen Tan, “Spike Timing or Rate? Neurons Learn to Make Decisions for Both Through Threshold-Driven Plasticity”, IEEE Trans. Cybernetics, 49(6), June 2019, pp. 2178-2189.
  • Berrak Sisman, Mingyang Zhang and Haizhou Li, “Group Sparse Representation with WaveNet Vocoder Adaptation for Spectrum and Prosody Conversion”, IEEE/ACM Trans. Audio, Speech & Language Processing 27(6), June 2019, pp. 1085-1097.
  • Jichen Yang* and Rohan Kumar Das*, “Low Frequency Frame-wise Normalization over Constant-Q Transform for Playback Speech Detection”, in Digital Signal Processing, Elsevier, vol. 89, June 2019, pp. 30-39. (*equal contribution)
  • Emre Yılmaz, Vikramjit Mitra, Ganesh Sivaraman and Horacio Franco, “Articulatory and Bottleneck Features for Speaker-Independent ASR of Dysarthric Speech,” Computer Speech & Language, vol. 58, Nov. 2019, pp. 319-334.

  • Karthika Vijayan, Haizhou Li and Tomoki Toda, “Speech-to-Singing Voice Conversion: The Challenges and Strategies for Improving Vocal Conversion Processes,” IEEE Signal Processing Magazine, 36(1), January 2019, pp. 95-102.
  • Luis Fernando D’Haro, Rafael E. Banchs, Chiori Hori and Haizhou Li, “Automatic evaluation of end-to-end dialog systems with adequacy-fluency metrics,” Computer Speech & Language, vol. 55, March 2019, pp. 200-215.
  • Chong Zhang, Kay Chen Tan, Haizhou Li and Geok Soon Hong, “A Cost-Sensitive Deep Belief Network for Imbalanced Classification”, IEEE Transactions on Neural Networks and Learning Systems, 30(1), January 2019, pp. 109-122.
  • Maulik Madhavi and Hemant Patil, “Vocal Tract Length Normalization using a Gaussian mixture model framework for query-by-example spoken term detection,” Computer Speech & Language, vol. 58, November 2019, pp. 175-202.
  • Rohan Kumar Das, Sarfaraz Jelil and S. R. M. Prasanna, “Exploring Text- constraint Models and Source Information for Long-enrollment with Short-test Speaker Verification”, in Circuits, Systems and Signal Processing, Springer, vol. 38, Issue 4, April 2019, pp. 1175-1792.
  • Rohan Kumar Das and S. R. M. Prasanna, “Investigating Text-independent Speaker Verification Systems Under Varied Data Conditions”, in Circuits, Systems and Signal Processing, Springer, vol. 38, Issue 8,  August 2019, pp. 3778-3801.
  • Malu Zhang, Hong Qu, Ammar Belatreche, Yi Chen, and Zhang Yi, “A Highly Effective and Robust Membrane Potential-Driven Supervised Learning Method for Spiking Neurons,” IEEE Transactions on Neural Networks and Learning Systems, 30(1), January 2019, pp. 123-137.

2018

  • Chitralekha Gupta, Haizhou Li, and Ye Wang, “A Technical Framework for Automatic Perceptual Evaluation of Singing Quality”, APSIPA Transactions on Singnal and Information Processing, E10(7), September 2018, pp. 1-11.
  • Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng and Haizhou Li, “Re-ranking spoken term detection with acoustic exemplars of keywords”, Speech Communication, vol 104, November 2018, pp. 12-23.
  • L. Xu, Kong-Aik Lee, Haizhou Li and Zhen Yang, “Generalizing I-Vector Estimation for Rapid Speaker Recognition”, IEEE/ACM Trans. Audio, Speech & Language Processing, 26(4), April 2018, pp. 749-759.
  • Saad Irtza, Vidhyasaharan Sethu, Eliathamby Ambikairajah and Haizhou Li, “Using language cluster models in hierarchical language identification”, Speech Communication, 100, June 2018, pp. 30-40.
  • Chong Zhang, Kay Chen Tan, Haizhou Li, and Geok Soon Hong, “A cost-sensitive deep belief network for imbalanced classification,” IEEE Transactions on Neural Networks and Learning Systems, 30(1), January 2018, pp. 1-14.
  • Jibin Wu, Yansong Chua, Malu Zhang, Haizhou Li, and Kay Chen Tan, “A Spiking Neural Network Framework for Robust Sound Classification,” Frontiers in Neuroscience, 12, November 2018, p. 836.

2017

  • Kaavya Sriskandaraja, Vidhyasaharan Sethu, Eliathamby Ambikairajah and Haizhou Li, “Front-End for Antispoofing Countermeasures in Speaker Verification: Scattering Spectral Decomposition”, IEEE Journal of Selected Topics in Signal Processing, 11(4), June 2017, pp. 632-643.
  • Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, “Multitask Feature Learning for Low-Resource Query-by-Example Spoken Term Detection”, IEEE Journal of Selected Topics in Signal Processing, 11(8), December 2017, pp. 1329-1339.
  • Xiaohai Tian, Siu Wa Lee, Zhizheng Wu, Eng Siong Chng and Haizhou Li, “An Exemplar-Based Approach to Frequency Warping for Voice Conversion, IEEE/ACM Trans. Audio, Speech & Language Processing”, 25(10), October 2017, pp. 1863-1876.
  • Hongjie Chen, Lei Xie, Cheung-Chi Leung, Xiaoming Lu, Bin Ma and Haizhou Li, “Modeling Latent Topics and Temporal Distance for Story Segmentation of Broadcast News”, IEEE/ACM Trans. Audio, Speech & Language Processing, 25(1), January 2017, pp. 112-123.
  • Kaavya Sriskandaraja, Vidhyasaharan Sethu, Eliathamby Ambikairajah and Haizhou Li, “Front-End for Antispoofing Countermeasures in Speaker Verification: Scattering Spectral Decomposition”, IEEE Journal of Selected Topics in Signal Processing, 11(4), June 2017, pp. 632-643.
  • Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma and Haizhou Li, “Multitask Feature Learning for Low-Resource Query-by-Example Spoken Term Detection”, IEEE Journal of Selected Topics in Signal Processing, 11(8), December 2017, pp. 1329-1339.
  • Xiaohai Tian, Siu Wa Lee, Zhizheng Wu, Eng Siong Chng and Haizhou Li, “An Exemplar-Based Approach to Frequency Warping for Voice Conversion”, IEEE/ACM Trans. Audio, Speech & Language Processing 25(10), October 2017, pp. 1863-1876.

2016

  • Xiong Xiao, Shengkui Zhao, Duc Hoang Ha Nguyen, Xionghu Zhong, Douglas L. Jones, Eng Siong Chng and Haizhou Li, “Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation”, EURASIP J. Adv. Sig. Proc., January 2016.  
  • Zhizheng Wu and Haizhou Li, “On the study of replay and voice conversion attacks to text-dependent speaker verification”, Multimedia Tools Applications, 75(9), May 2016, pp. 5311-5327.
  • Nancy F. Chen, Darren Wee, Rong Tong, Bin Ma and Haizhou Li, “Large-scale characterization of non-native Mandarin Chinese spoken by speakers of European origin: Analysis on iCALL”, Speech Communication 84, November 2016, pp. 46-56.
  • Sven Ewan Shepstone, Kong-Aik Lee, Haizhou Li, Zheng-Hua Tan and Søren Holdt Jensen, “Total Variability Modeling Using Source-Specific Priors”, IEEE/ACM Trans. Audio, Speech & Language Processing 24(3), March 2016, pp. 504-517.
  • Duc Hoang Ha Nguyen, Xiong Xiao, Eng Siong Chng and Haizhou Li, “Feature Adaptation Using Linear Spectro-Temporal Transform for Robust Speech Recognition”, IEEE/ACM Trans. Audio, Speech & Language Processing, 24(6), June 2016, pp. 1006-1019.
  • Qiang Yu, Rui Yan, Huajin Tang, Kay Chen Tan and Haizhou Li, “A Spiking Neural Network System for Robust Sequence Recognition” IEEE Trans. Neural Networks and Learning Systems, 27(3), March 2016, pp. 621-635.
  • Yu, Rui Yan, Huajin Tang, Kay Chen Tan and Haizhou Li, “A Spiking Neural Network System for Robust Sequence Recognition”, IEEE Transactions on Neural Networks and Learning Systems, 27(3), March 2016, pp. 621-635.
  • Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li and Li-Rong Dai, “Exploration of Local Variability in Text-Independent Speaker Verification”, Journal of Signal Processing Systems, 82(2), February 2016, pp. 217-228.
  • Jun Hu, Huajin Tang, Kay Chen Tan and Haizhou Li, “How the Brain Formulates Memory: A Spatio-Temporal Model Research Frontier”, IEEE Computational Intelligence Magazine, 11(2), May 2016, pp. 56-68.

2015

  • Jonathan Dennis, Huy Dat Tran and Haizhou Li, “Generalized Hough Transform for Speech Pattern Classification”, IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(11), November 2015, pp. 1963-1972.
  • Chang Huai You, Haizhou Li, and Kong-Aik Lee, “Relevance factor of maximum a posteriori adaptation for GMM-NAP-SVM in speaker and language recognition”, Journal of  Computer Speech and Language, 30(1), March 2015, pp. 116-134.
  • Dau-Cheng Lyu, Tien Ping Tan Eng Chng and Haizhou Li, “Mandarin-English code-switching speech corpus in South-East Asia”, Language Resources and Evaluation, 49(3), September 2015, pp. 581-600.
  • Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Acoustic Segment Modeling with Spectral Clustering Methods”, IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(2), February 2015, pp. 264-277.
  • Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Context-dependent Phone Mapping for Acoustic Modeling of Under-resourced Languages”, International Journal of Asian Language Processing, 23(1), 2015, pp. 21-33.
  • Haizhou Li, Marcello Federico, Xiaodong He, Helen M. Meng, and Isabel Trancoso, “Introduction to the Special Section on Continuous Space and Related Methods in Natural Language Processing”, IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(3),March 2015, pp. 427-430.
  • Tze Yuang Chong, Rafael E. Banchs, Eng Chng and Haizhou Li, “Decoupling Word-Pair Distance and Co-occurrence Information for Effective Long History Context Language Modeling”, IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(7), July 2015, pp. 1221-1232.
  • Rafael E. Banchs, Luis F. D’Haro, and Haizhou Li, “Adequacy-Fluency Metrics: Evaluating MT in the Continuous Space Model Framework”, IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(3), March 2015, pp. 472-482.
  • Zhizheng Wu, Nicholas Evans, Tomi Kinnunen, Junichi Yamagishi, Federico Alegre, and Haizhou Li, “Spoofing and countermeasures for speaker  verification: a survey”, Speech Communication, 66(c), February 2015, pp. 130-153.
  • Haizhou Li, Inaugural editorial: Embracing Opportunities for Growth, IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(1), January 2015, pp. 5-6.
  • Zhizheng Wu, Eng Siong Chng, and Haizhou Li, “Exemplar-based voice conversion using joint nonnegative matrix factorization”, Multimedia Tools and Applications, Springer, 74(22), November 2015, pp. 9943-9958.

2014

  • Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Engsiong Chng and Haizhou Li, “Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization”, The 9th International Symposium on Chinese Spoken Language Processing, Singapore, October 2014, pp. 379-383. 
  • Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Cross-lingual phone mapping for large vocabulary speech recognition of under-resourced languages”, IEICE Transactions on Information and Systems, 97-D(2), February 2014, pp. 285-295.
  • Miaolong Yuan, Huajin Tang, and Haizhou Li, “Real-Time Keypoint Recognition Using Restricted Boltzmann Machine,” IEEE Transactions on Neural Networks and Learning Systems, 25(11), November 2014, pp. 2119-2126.
  • Zhizheng Wu and Haizhou Li, “Voice conversion versus speaker verification: an overview”, APSIPA Transactions on Signal and Information Processing, 3(e17), December 2014, pp. 1-16.
  • Zhizheng Wu, Tuomas Virtanen, Eng Siong Chng, and Haizhou Li, “Exemplar-based sparse representation with residual compensation for voice conversion”, IEEE/ACM Transactions on Audio, Speech and Language Processing, 22(10), October 2014, pp. 1506-1521.
  • Anthony Larcher, Kong Aik Lee, Bin Ma, and Haizhou Li, “Text-dependent speaker verification: Classifiers, databases and RSR2015”, Speech Communication, 60, May 2014, pp. 56-77.

2013

  • S. J. Wright, D. Kanevsky, Li Deng, Xiaodong He, G. Heigold, and Haizhou Li, “Optimization Algorithm and Applications for Speech and Language  Processing”, IEEE Transactions on Audio, Speech and Language Processing, 21(11), November 2013, pp. 2231-2243.
  • Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Spoken Language Recognition With Prosodic Features”, IEEE Transactions on Audio, Speech and Language Processing, 21(9), September 2013, pp. 1841-1853.
  • Ville Hautamäki, Tomi Kinnunen, Filip Sedlak, Kong Aik Lee, Bin Ma, and Haizhou Li, “Sparse Classifier Fusion for Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, 21(8), August 2013, pp. 1622-1631.
  • Qiang Yu, Huajin Tang, Kay Chen Tan, and Haizhou Li, “Precise-Spike-Driven Synaptic Plasticity: Learning Hetero-Association of Spatiotemporal Spike Patterns”, PLoS ONE, 8(11), November 2013, pp. 1-16.
  • Haizhou Li, Kong Aik Lee, and Bin Ma, “Spoken Language Recognition: From Fundamentals to Practice”, Proceedings of the IEEE, 101(5), May 2013, pp. 1136-1159.
  • Douglas D. O’Shaughnessy, Li Deng, and Haizhou Li, “Speech Information Processing: Theory and Applications”, Proceedings of the IEEE, 101(5), May 2013, pp. 1034-1037.
  • Jiali Yu, Huajin Tang, and Haizhou Li, “Dynamics Analysis of a Population Decoding Model”, IEEE Transactions on Neural Networks and Learning Systems, 24(3), March 2013, pp. 498-503.
  • Qiang Yu, Huajin Tang, Kay Chen Tan, and Haizhou Li, “Rapid Feedforward Computation by Temporal Encoding and Learning With Spiking Neurons”, IEEE Transactions on Neural Networks and Learning Systems, 24(10), October 2013, pp. 1539-1552.
  • Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, and Haizhou Li, “Shifted-Delta MLP Features for Spoken Language Recognition”, IEEE Signal  Processing Letters, 20(1), January 2013, pp. 15-18.
  • Andreea Niculescu, Betsy van Dijk, Anton Nijholt, Haizhou Li, and See Swee Lan, “Making Social Robots More Attractive: The Effects of Voice Pitch, Humor and Empathy”, International Journal of Social Robotics, 5(2), April 2013, pp. 171-191.
  • Jiali Yu, Huajin Tang, and Haizhou Li, “Continuous attractors of discrete-time recurrent neural networks”, Neural Computing and Applications, 23(1), July 2013, pp. 89-96.
  • Jiali Yu, Huajin Tang, Haizhou Li, and Luping Shi, “Dynamical properties of continuous attractor neural network with background tuning”, Neurocomputing, 99(1), January 2013, pp. 439-447.
  • Jun Hu, Huajin Tang, Kay Chen Tan, Haizhou Li, and Luping Shi, “A Spike-Timing-Based Integrated Model for Pattern Recognition”, Neural Computation, 25(2), February 2013, pp. 450-472.
  • Sakriani Sakti, Michael Paul, Andrew Finch, Shinsuke Sakai, Thang Tat Vu, Noriyuki Kimura, Chiori Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park, Chai Wutiwiwatchai, Bo Xu, Hammam Riza, Karunesh Arora, Chi Mai Luong, and Haizhou Li, “A-STAR: Toward Translating Asian Spoken Languages”, Computer  Speech and Language, 27(2), February 2013, pp. 509-527.

2012

  • Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, and Haizhou Li, “Mixture of factor analyzers using priors from non-parallel speech for voice conversion”, IEEE Signal Processing Letters, 19(12), December 2012, pp. 914-917.
  • Omid Dehzangi, Bin Ma, Eng-Siong Chng, and Haizhou Li, “Discriminative Feature Extraction for Speech Recognition Using Continuous Output Codes”, Pattern Recognition Letters, 33(13), October 2012, pp. 1703-1709.
  • Liyuan Li, Shuicheng Yan, Xinguo Yu, Yeow Kee Tan, and Haizhou Li, “Robust Multiperson Detection and Tracking for Mobile Service and Social Robots”, IEEE Transactions on Systems, Man, and Cybernetics – part B: Cybernetics, 42(5), October 2012, pp. 1398-1412.
  • Tomi Kinnunen, Rahim Saeidi, Filip Sedlak, Kong Aik Lee, Johan Sandberg, Maria Hansson-Sandsten, and Haizhou Li, ”Low-Variance Multitaper MFCC Features: a Case Study in Robust Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, 20(7),  September 2012, pp. 1990-2001.
  • Andreea Niculescu, Betsy van Dijk, Anton Nijholt, Haizhou Li, and Swee Lan See, “Making social robots more attractive: the effects of voice pitch, humor and empathy”, International Journal of Social Robotics, 5(2), April 2012, pp. 171-191.
  • Wenliang Chen, Jun’ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang, Kentaro Torisawa, and Haizhou Li, “Bitext dependency parsing with auto-generated bilingual treebank”, IEEE Transactions on Audio, Speech and Language Processing, 20(5), July 2012, pp. 1461-1472.
  • Xiaoxuan Wang, Lei Xie, Mimi Lu, Bin Ma, Engsiong Chng, and Haizhou Li, “Broadcast news story segmentation using conditional random fields and  multimodal features”, IEICE Transactions on Information and Systems, E95-D(5), May 2012, pp. 1206-1215.
  • Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, and Haizhou Li, “Selective gammatone envelope feature for robust sound event recognition”, IEICE Transactions, 95-D(5), May 2012, pp. 1229-1237.
  • Rui Yan, Keng Peng Tee, Yuanwei Chua, Haizhou Li, and Huajin Tang, “Gesture Recognition Based on Localist Attractor Networks with Application to Robot Control”, IEEE Computational Intelligence Magazine, 7(1), February 2012, pp. 64-74.
  • Keng Peng Tee, Rui Yan, Yuanwei Chua, Zhiyong Huang, and Haizhou Li, “Modular IK: a Robust Inverse Kinematic Algorithm for Gesture Imitation in an  Upper-Body Humanoid Robot”, International Journal of Humanoid Robotics, 9(2), June 2012.
  • Jin-Shea Kuo and Haizhou Li, “Learning regional transliteration variants”, Information Processing and Management, 48(1), January 2012, pp. 154-169.
  • Tin Lay Nwe, Hanwu Sun, Bin Ma, and Haizhou Li, “Speaker Clustering and Cluster Purification Methods for RT07 and RT09 Evaluation Meeting Data”, IEEE Transactions on Audio, Speech and Language Processing, 20(2), February 2012, pp. 461-473.
  • Haizhou Li, “FOREWORD – Special Section on Recent Advances in Multimedia Signal Processing Techniques and Applications”, IEICE Transactions on  Information and Systems, 95-D(5), May 2012, pp. 1181-1181.

2011

  • Haizhou Li, John-John Cabibihan, and Yeow Kee Tan, “Towards an Effective Design of Social Robots”, International Journal of Social Robotics, vol. 3, no. 4, November 2011, pp. 333-335.
  • Huajin Tang and Haizhou Li, “Book Review: Information Theoretic Learning: Renyi’s Entropy and Kernel Perspectives”, IEEE Computational Intelligence Magazine, vol. 6, no. 3, August 2011, pp. 60-62.
  • Eliathamby Ambikairajah, Haizhou Li, Liang Wang, Bo Yin, and Vidhyasaharan Sethu, “Language Identification: A Tutorial”, IEEE Circuits and Systems Magazine, vol. 11, no. 2, June 2011, pp. 82-108.
  • Huajin Tang Haizhou Li, and Zhang Yi, “Online learning and stimulus-driven responses of neurons in visual cortex”, Cognitive Neurodynamics, vol. 5, no. 1, March 2011, pp. 77-85.
  • Omid Dehzangi, Bin Ma, Eng-Siong Chng, and Haizhou Li, “Error Corrective Fusion of Classifier Scores for Spoken Language”, IEICE Transactions on Information and Systems, vol. E94-D, no.12, December 2011, pp. 1994-1997.
  • Deyi Xiong, Min Zhang, and Haizhou Li, “A Maximum Entropy Segmentation Model for Statistical Machine Translation”, IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 8, November 2011, pp. 2494-2505.
  • Huy Dat Tran and Haizhou Li, “Sound Event Recognition with Probabilistic Distance SVMs”, IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 6, August 2011, pp. 1556-1568.
  • Jonathan Dennis, Huy Dat Tran, and Haizhou Li, “Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions”, IEEE Signal Processing Letters, vol. 18, no. 2, February 2011, pp. 130-133.
  • Kong Aik Lee, Chang Huai You, Haizhou Li, Tomi Kinnunen, and Khe Chai Sim, “Using Discrete Probabilities with Bhattacharyya Measure for SVM-based Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 4, May 2011, pp. 861-870.
  • Donglai Zhu, Bin Ma, and Haizhou Li, “Speaker Verification with Feature-Space MAPLR Parameters”, IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 3, March 2011, pp. 505-515.
  • Namunu C. Maddage and Haizhou Li, “Beat Space Segmentation and Octave Scale Cepstral Feature for Sung Language Recognition in Pop Music”, ACM Transactions on Multimedia Computing, Communications and Applications (TOMCCAP), vol. 7, no. 4, Article 37, November 2011, pp. 1-20.

2010

  • Haizhou Li and Ma Bin, “TechWare: Speaker and Spoken Language Recognition Resources”, IEEE Signal Processing Magazine, vol. 27, no. 6, November 2010, pp. 139-142.
  • Deyi Xiong, Min Zhang, Aiti Aw, and Haizhou Li, “Linguistically Annotated Reordering Evaluation and Analysis”, Computational Linguistics, vol. 36, no. 3, September 2010, pp. 535-568.
  • Huajin Tang, Haizhou Li, and Zhang Yi, “A Discrete-Time Neural Network for Optimization Problems with Hybrid Constraints”, IEEE Transactions on Neural Networks, vol. 21, no. 7, July 2010, pp. 1184-1189.
  • Lei Wang, Eng Siong Chng, and Haizhou Li, “A Tree-Construction Search Approach for Multivariate Time Series Motifs Discovery”, Pattern Recognition Letters, vol. 31, no. 9, July 2010, pp. 869-875.
  • Huajin Tang, Haizhou Li, and Rui Yan, “Memory Dynamics in Attractor Networks with Saliency Weights”, Neural Computation, vol. 22, no. 7, July 2010, pp. 1899-1926.
  • Chang Huai You, Kong Aik Lee, and Haizhou Li, “GMM-SVM Kernel with a Bhattacharyya-Based Distance for Speaker Recognition”, IEEE Transactions on Audio, Speech and Language Processing, vol. 18, no. 6, August 2010, pp. 1300-1312.
  • Tomi Kinnunen and Haizhou Li, “An Overview of Text-Independent Speaker Recognition: from Features to Supervectors”, Speech Communication, vol. 52,  no. 1, January 2010, pp. 12-40. (Speech Communication Most Cited Article since 2007)
  • Xiong Xiao, Jinyu Li, Eng Siong Chng, Haizhou Li, and Chin-Hui Lee, “A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition”, IEEE Transactions on Audio, Speech and Language Processing, vol. 18, no. 6, August 2010, pp. 1158-1169.
  • Namunu C. Maddage, Khe Chai Sim, and Haizhou Li, “Word Level Automatic Alignment of Music and Lyrics using Vocal Synthesis”, ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP), vol. 6, no. 3, Article 19, August 2010. pp. 1-16.
  • Tee Kiah Chia, Khe Chai Sim, Haizhou Li, and Hwee Tou Ng, “Statistical Lattice-Based Spoken Document Retrieval”, ACM Transactions on Information  Systems, vol. 28, no. 1, Article 2, January 2010, pp. 1-30.

2009

  • Huy Dat Tran and Haizhou Li, “Jump Function Kolmogorov for Audio Classification in Noise-mismatch Conditions”, IEEE Transactions on Signal Processing, vol. 57, no. 8, August 2009, pp. 2908-2918.
  • Rong Tong, Bin Ma, Haizhou Li, and Eng Siong Chng, “A Target-Oriented Phonotactic Front-end for Spoken Language Recognition”, IEEE Transactions on  Audio, Speech and Language Processing, vol. 17, no. 7, September 2009, pp. 1335-1347.
  • Chang Hui You, Kong-Aik Lee, and Haizhou Li, “An SVM Kernel with GMM-Supervector Based on the Bhattacharyya Distance for Speaker  Recognition”, IEEE Signal Processing Letters, vol. 16, no. 1, January 2009, pp. 49-52.

2008

  • Donglai Zhu, Haizhou Li, Bin Ma, and Chin-Hui Lee, “Optimizing the Performance of Spoken Language Recognition with Discriminative Training”, IEEE Transactions on Audio, Speech and Language Processing, vol. 16, no. 8, November 2008, pp. 1642-165.
  • Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Normalization of the Speech Modulation Spectra for Robust Speech Recognition”, IEEE Transactions on Audio, Speech and Language Processing, vol. 16, no. 8, November 2008, pp. 1662-1674.
  • Haizhou Li, Jin-Shea Kuo, Jian Su, and Chih-Lung Lin, “Mining Live Transliterations using Incremental Learning Algorithms”, International Journal of Computer Processing of Languages, vol. 21, no. 2, June 2008, pp. 183-203.
  • Khe Chia Sim and Haizhou Li, “On Acoustic Diversification Front-end for Spoken Language Identification”, IEEE Transactions on Audio, Speech and Language Processing, vol. 16, no. 5, July 2008, pp. 1029-1037.
  • Jin-shea Kuo, Haizhou Li, and Ying-Kuei Yang, “Active Learning for Constructing Transliteration Lexicons from the Web”, Journal of the American Society for Information Science and Technology, vol. 59, no. 1, January 2008, pp. 126-135.

2007

  • Bin Ma, Haizhou Li, and Rong Tong, “Spoken Language Recognition with Ensemble Classifiers”, IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 7, September 2007, pp. 2053-2062.
  • Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Temporal Structure Normalization of Speech Feature for Robust Speech Recognition”, IEEE Signal Processing Letters, vol. 14, no. 7, July 2007, pp. 500-503.
  • Jin-Shea Kuo, Haizhou Li, and Ying-Kuei Yang, “A Phonetic Similarity Model for Automatic Extraction of Transliteration Pairs”, ACM Transactions on Asian  Language Information Processing, vol. 6, no. 2, Article 6, September 2007, pp. 1-24.
  • Tin Lay and Haizhou Li, “Exploring Vibrato-Motivated Acoustic Features for Singer Identification”, IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 2, February 2007, pp. 519-530.
  • Haizhou Li, Bin Ma, and Chin-Hui Lee, “A Vector Space Modeling Approach to Spoken Language Identification”, IEEE Transactions on Audio, Speech and  Language Processing, vol. 15, no. 1, January 2007, pp. 271-284.

2006

  • Minghui Dong, Kim-Teng Lua, and Haizhou Li, “A Unit Selection-based Speech Synthesis Approach for Mandarin Chinese”, Journal of Chinese Language and Computing, vol. 16, no. 1, March 2006, pp. 1-10.
  • Bin Ma and Haizhou Li, “A Comparative Study of Four Language Identification Systems”, Computational Linguistics and Chinese Language Processing, vol. 11, no. 2, June 2006, pp. 159-182.

1995

  • Jian Su, K. T. Ng, Haizhou Li, and Jean-Paul Haton, “Nonparametric Distance Measures of Speaker Verification”, IET Electronics Letters, vol. 31, no. 9, April 1995, pp. 700-701.
  • Haizhou Li, Jian Su, Jean-Paul Haton, “Short-Timed Speech Dynamics for Speaker Recognition”, IET Electronics Letters, vol. 31, no. 17, August 1995, pp. 1416-1418.