Speech and audio processing | Research

Research areas

Spoken language technology: automatic speech recognition, speaker recognition, language identification, keyword search
Speech and audio signal processing: speech enhancement, sound source separation, pitch estimation, music transcription
Rehabilitation technologies for communication disorders: hearing aids and cochlear implants, pathological speech analysis, speech and language assessment

Current Projects

Unsupervised speech modeling for low-resource languages
To be added.
Objective assessment of pathological voices
To be added.
Acoustical analysis of aphasia speech
To be added.
Computer-assisted assessment technology of speech, hearing and language disabilities
To be added.
Multi-pitch estimation of piano music
To be added.
Music chord recognition
To be added.

Team

Publications

Publications in 2025 2025
Publications in 2024 2024
1. Jingyu Li, Aemon Yat Fei Chiu and Tan Lee,"An Investigation of Reprogramming for Cross-Language Adaptation in Speaker Verification Systems," in Proc. ISCSLP 2024, pp.390-394, Beijing, China, Nov. 7-10, 2024.
2. Yujia Xiao, Xi Wang, Xu Tan, Lei He, Xinfa Zhu, Sheng Zhao and Tan Lee,"Contrastive Context-Speech Pretraining for Expressive Text-to-Speech Synthesis," in Proceedings of the 32nd ACM International Conference on Multimedia, pp. 2099-2107, Melbourne, Australia, Oct. 28 - Nov. 1, 2024.
3. S.-I. Ng, C.W.-Y. Ng, J. Wang and Tan Lee,"Automatic Detection of Speech Sound Disorder in Cantonese-Speaking Pre-School Children," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 32, pp. 4355-4368, 2024.
4. Wei Liu, Jingyong Hou, Dong Yang, Muyong Cao and Tan Lee,"LUPET: Incorporating Hierarchical Information Path into Multilingual ASR", in Proc. Interspeech 2024, Kos, Greece, pp. 3979-3983, Kos, Greece, Sep. 1-5, 2024.
5. Wei Liu, Jingyong Hou, Dong Yang, Muyong Cao and Tan Lee,"A Parameter-efficient Language Extension Framework for Multilingual ASR", in Proc. Interspeech 2024, Kos, Greece, pp. 3929-3933, Kos, Greece, Sep. 1-5, 2024.
6. Dehua Tao, Tan Lee, Harold Chiu and Sarah Luk,"Learning Representation of Therapist Empathy in Counseling Conversation Using Siamese Hierarchical Attention Network", in Proc. Interspeech 2024, Kos, Greece, pp. 1085-1089, Kos, Greece, Sep. 1-5, 2024.
7. J. Li and Tan Lee,"Efficient Black-Box Speaker Verification Model Adaptation With Reprogramming And Backend Learning", 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), pp. 12732-12736, Seoul, Korea, April 14-19, 2024.
8. D. Tao, Tan Lee, H. Chui and S. Luk,"Modeling Intrapersonal and Interpersonal Influences for Automatic Estimation of Therapist Empathy in Counseling Conversation", 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), pp. 12692-12696, Seoul, Korea, April 14-19, 2024.
9. W. Liu, Y. Qin, Z. Peng and Tan Lee,"Sparsely Shared Lora on Whisper for Child Speech Recognition", 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), pp. 11751-11755, Seoul, Korea, April 14-19, 2024.
10. Y. Tian, J. Li and Tan Lee,"Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss", 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), pp. 11501-11505, Seoul, Korea, April 14-19, 2024.
Publications in 2023 2023
1. Y. Tian, W. Liu and Tan Lee,"Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data," in 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Taipei, Taiwan, Dec. 16-20, 2023.
2. Si-Ioi Ng, Cymie Wing-Yee Ng and Tan Lee,"A Study on Using Duration and Formant Features in Automatic Detection of Speech Sound Disorder in Children," in Proc.Interspeech 2023, Dublin, Ireland, pp. 4643-4647, Aug. 20-24, 2023.
3. Dehua Tao, Tan Lee, Harold Chui and Sarah Luk,"A Study on Prosodic Entrainment in Relation to Therapist Empathy in Counseling Conversation," in Proc.Interspeech 2023, Dublin, Ireland, pp. 3662-3666, Aug. 20-24, 2023.
4. Wei Liu, Zhiyuan Peng and Tan Lee,"CoMFLP: Correlation Measure Based Fast Search on ASR Layer Pruning," in Proc.Interspeech 2023, Dublin, Ireland, pp. 3282-3286, Aug. 20-24, 2023.
5. Jingyu Li, Wei Liu, Zhaoyang Zhang, Jiong Wang and Tan Lee,"Model Compression for DNN-based Speaker Verification Using Weight Quantization," in Proc.Interspeech 2023, Dublin, Ireland, pp. 1988-1992, Aug. 20-24, 2023.
6. Yusheng Tian, Guangyan Zhang and Tan Lee,"Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models," in Proc.Interspeech 2023, Dublin, Ireland, pp. 4893–4897, Aug. 20-24, 2023.
7. Yujia Xiao, Shaofei Zhang, Xi Wang, Xu Tan, Lei He, Sheng Zhao, Frank K. Soong and Tan Lee,"ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading," in Proc.Interspeech 2023, Dublin, Ireland, pp. 4883-4887, Aug. 20-24, 2023.
8. Monira Islam and Tan Lee,"Functional Connectivity Analysis in Multi-channel EEG for Emotion Detection with Spatial-Temporal Features and 3D CNN," 45th Annual International Conference of the IEEE Engineering in Medicine Biology Society (EMBC), NSW, Sydney, Australia, 24-27 July, 2023.
9. Jonathan H.N. Lee, Eddie S.K. Chong, Harold Chui, Tan Lee, Sarah Luk, Dehua Tao and Nicolette W.T. Lee,"A curvilinear association between therapists’ use of discourse particles and therapist empathy in psychotherapy," Journal of Counseling Psychology, 70(5), pp.562-570, July 2023.
10. Jingyu Li, Yusheng Tian and Tan Lee,"Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification," 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes Island, Greece, June 4-10, 2023.
11. Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma and Tan Lee,"Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring," 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes Island, Greece, June 4-10, 2023.
12. Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma and Tan Lee,"An ASR-Free Fluency Scoring Approach with Self-Supervised Learning," 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes Island, Greece, June 4-10, 2023.
13. Guangyan Zhang, Ying Qin, Wenjie Zhang, Jialun Wu, Mei Li, Yutao Gai, Feijun Jiang and Tan Lee,"iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis Based on Disentanglement Between Prosody and Timbre," IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 31, pp.1693-1705, April 2023.
Publications in 2022 2022
1. Jonathan H.N. Lee, Harold Chui, Tan Lee, Sarah Luk, Dehua Tao and Nicolette W.T. Lee,"Formality in psychotherapy: How are therapists’ and clients’ use of discourse particles related to therapist empathy?" Frontiers in Psychiatry, Dec., 2022.
2. Dehua Tao, Harold Chui, Sarah Luk and Tan Lee,"CUEMPATHY: A Counseling Speech Dataset for Psychotherapy Research," in Proc. ISCSLP 2022, pp.354-358, Singapore, Dec. 11-14, 2022.
3. Zhiyuan Peng, Huanji He, Ke Ding, Tan Lee and Guanglu Wan,"Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition," in Proc. ISCSLP 2022, pp.324-328, Singapore, Dec. 11-14, 2022.
4. Daxin Tan, Liqun Deng, Nianzu Zheng, Y.T. Yeung, Xin Jiang, Xiao Chen and Tan Lee,"Correct Speech: A Fully Automated System for Speech Correction and Accent Reduction," in Proc. ISCSLP 2022, pp.81-85, Singapore, Dec. 11-14, 2022.
5. Monira Islam and Tan Lee,"Wavelet based Emotion Detection from Multi-channel EEG using a Hybrid CNN-LSTM Model," in Proc. TENCON 2022 - 2022 IEEE Region 10 Conference (TENCON), IEEE, Hong Kong, Nov. 01-04, 2022.
6. Dehua Tao, Tan Lee, Harold Chui and Sarah Luk,"Hierarchical Attention Network for Evaluating Therapist Empathy in Counseling Session," in Proc.Interspeech 2022, pp.2008-2012, Incheon, Korea, Sept. 18-22, 2022.
7. Dehua Tao, Tan Lee, Harold Chui and Sarah Luk,"Characterizing Therapist's Speaking Style in Relation to Empathy in Psychotherapy," in Proc.Interspeech 2022, pp.2003-2007, Incheon, Korea, Sept. 18-22, 2022.
8. Si-Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang and Tan Lee,"Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations," in Proc.Interspeech 2022, pp.2853-2857, Incheon, Korea, Sept. 18-22, 2022.
9. Jingyu Li, Wei Liu and Tan Lee,"EDITnet: A Lightweight Network for Unsupervised Domain Adaptation in Speaker Verification," in Proc.Interspeech 2022, pp.3694-3698, Incheon, Korea, Sept. 18-22, 2022.
10. Jonathan Him Nok Lee, Dehua Tao, Harold Chui, Tan Lee, Sarah Luk , Nicolette Wing Tung Lee and Koonkan Fung,"Durational Patterning at Discourse Boundaries in Relation to Therapist Empathy in Psychotherapy," in Proc.Interspeech 2022, pp.5248-5252, Incheon, Korea, Sept. 18-22, 2022.
11. Yusheng Tian, Jingyu Li and Tan Lee,"Transport-Oriented Feature Aggregation for Speaker Embedding Learning," in Proc.Interspeech 2022, pp.316-320, Incheon, Korea, Sept. 18-22, 2022.
12. Zhiyuan Peng, Xuanji He, Ke Ding, Tan Lee and Guanglu Wan,"Unifying Cosine and PLDA Back-ends for Speaker Verification," in Proc.Interspeech 2022, pp.336-340, Incheon, Korea, Sept. 18-22, 2022.
13. Guangyan Zhang, Kaitao Song, Xu Tan, Daxin Tan, Yuzi Yan, Yanqing Liu, Gang Wang, Wei Zhou, Tao Qin, Tan Lee and Sheng Zhao,"Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech," in Proc.Interspeech 2022, pp.456-460, Incheon, Korea, Sept. 18-22, 2022.
14. Daxin Tan, Guangyan Zhang and Tan Lee,"Environment Aware Text-to-Speech Synthesis," in Proc.Interspeech 2022, pp.481-485, Incheon, Korea, Sept. 18-22, 2022.
15. Monira Islam, Tan Lee,"MEMD-HHT based Emotion Detection from EEG using 3D CNN," 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC 2022), pp. 294-297, Scottish Event Campus, Glasgow, UK, July 11-15, 2022.
16. Monira Islam, Tan Lee,"Multivariate Empirical Mode Decomposition of EEG for Mental State Detection at Localized Brain Lobes," 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC 2022), pp. 3759-3762, Scottish Event Campus, Glasgow, UK, July 11-15, 2022.
17. Rui-Si Ma, Si-Ioi Ng, Tan Lee, Yi-Jian Yang and Raymond Kim-Wai Sum,"Validation of a Speech Database for Assessing College Students' Physical Competence under The Concept of Physical Literacy," International Journal of Environmental Research and Public Health 2022, vol. 19, no. 12, 7046, 2022.
18. Si-Ioi Ng, Rui-Si Ma, Tan Lee and Raymond Kim-Wai Sum,"Acoustical Analysis of Speech Under Physical Stress in Relation to Physical Activities and Physical Literacy," Proc. Speech Prosody 2022, pp. 200-204, Lisbon, Portugal, Portugal, May 23-26, 2022.
19. G.Y. Zhang, Y.C. Leng, D. Tan, Y. Qin, K.T. Song, X. Tan, S. Zhao and Tan Lee,"A Study on the Efficacy of Model Pre-Training In Developing Neural Text-to-Speech System," 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), pp. 6087-6091, Singapore, May 22-27, 2022.
20. Shuiyang Mao, P.C. Ching and Tan Lee,"Enhancing Segment-Based Speech Emotion Recognition by Iterative Self-Learning," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 30, pp. 123-134, 2022.
Publications in 2021 2021
1. Hei-Yi Mak and Tan Lee,"Low-resource NMT: a case study on the written and spoken Languages in Hong Kong,” in Proceedings of the 5th International Conference on Natural Language Processing and Information Retrieval (NLPIR), Sanya, China, pp.81-87, Dec. 17-20, 2021.
2. Jingyu Li, Si-Ioi Ng and Tan Lee,"Improving Text-Independent Speaker Verification With Auxiliary Speakers Using Graph," in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Cartagena, pp. 198-205, Dec. 13-17, 2021.
3. W. Liu and Tan Lee,"Utterance-level neural confidence measure for end-to-end children speech recognition," in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Cartagena, pp. 449-456, Dec. 13-17, 2021.
4. D. Tan, L. Deng, Y.T. Yeung, X. Jiang, X, Chen and Tan Lee,"EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion," in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Cartagena, pp. 626-633, Dec. 13-17, 2021.
5. D. Tan, and Tan Lee,"Fine-Grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement," in Proc. Interspeech 2021, Brno, Czechia, pp. 4683-4687, Aug 30-Sept. 1, 2021.
6. Z. Peng, X. Li and Tan Lee,"Pairing Weak with Strong: Twin Models for Defending Against Adversarial Attack on Speaker Verification," in Proc. Interspeech 2021, Brno, Czechia, pp. 4284-4288, Aug 30-Sept. 1, 2021.
7. G. Zhang, Y. Qin, D. Tan and Tan Lee,"Applying the Information Bottleneck Principle to Prosodic Representation Learning," in Proc. Interspeech 2021, Brno, Czechia, pp. 3156-3160, Aug 30-Sept. 1, 2021.
8. S.-I. Ng, C.W.-Y Li, and Tan Lee,"Detection of Consonant Errors in Disordered Speech Based on Consonant-Vowel Segment Embedding," in Proc. Interspeech 2021, Brno, Czechia, pp. 2931-2935, Aug 30-Sept. 1, 2021.
9. Xurong Xie, Xunying Liu, Tan Lee and Lan Wang,"Bayesian learning for deep neural network adaptation," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 29, pp. 2096-2110, May 2021.
10. G.Y. Zhang, S.R. Qiu, Ying Qin and Tan Lee,"Estimating Mutual Information in Prosody Representation for Emotional Prosody Transfer in Speech Synthesis," in Proc. ISCSLP 2021, Hong Kong, Jan 24-26, 2021.
11. Ying Qin, Yao Qian, A. Loukina, P. Lange, A. Misra, K. Evanini and Tan Lee,"Automatic Detection of Word-Level Reading Errors in Non-native English Automatic Detection of Word-Level Reading Errors in Non-native English," in Proc. ISCSLP 2021, Hong Kong, Jan 24-26, 2021.
Publications in 2020 2020
1. Si-Ioi Ng and Tan Lee,"Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder," in Proc. Interspeech 2020, Shanghai, China, pp. 4476-4480, Oct 25-29, 2020.
2. Guangyan Zhang, Ying Qin and Tan Lee,"Learning Syllable-Level Discrete Prosodic Representation for Expressive Speech Generation," in Proc. Interspeech 2020, Shanghai, China, pp. 3426-3430, Oct 25-29, 2020.
3. Shuiyang Mao, P.C. Ching, C.-C. Jay Kuo and Tan Lee,"Advancing Multiple Instance Learning with Attention Modeling for Categorical Speech Emotion Recognition," in Proc. Interspeech 2020, Shanghai, China, pp. 2357-2361, Oct 25-29, 2020.
4. Shuiyang Mao, P.C. Ching and Tan Lee,"EigenEmo: Spectral Utterance Representation Using Dynamic Mode Decomposition for Speech Emotion Classification," in Proc. Interspeech 2020, Shanghai, China, pp. 2352-2356, Oct 25-29, 2020.
5. Jingyu Li and Tan Lee,"Text-Independent Speaker Verification with Dual Attention Network," in Proc. Interspeech 2020, Shanghai, China, pp. 956-960, Oct 25-29, 2020.
6. Shuiyang Mao, P.C. Ching and Tan Lee,"Emotion Profile Refinery for Speech Emotion Classification," in Proc. Interspeech 2020, Shanghai, China, pp. 531-535, Oct 25-29, 2020.
7. Si-Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, Tan Lee, Kathy Y.S. Lee and Michael C.F. Tong,"CUCHILD: A Large-Scale Cantonese Corpus of Child Speech for Phonology and Articulation Assessment," in Proc. Interspeech 2020, Shanghai, China, pp. 424-428, Oct 25-29, 2020.
8. Ying Qin, Y. Wu, Tan Lee and A.P.H. Kong,"An End-to-End Approach to Automatic Speech Assessment for Cantonese-speaking People with Aphasia," Journal of Signal Processing Systems, vol.92, No.8, pp. 819-830, Aug. 2020.
9. Y.Z. Wu and Tan Lee,"Time-Frequency Feature Decomposition Based On Sound Duration For Acoustic Scene Classification," 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), pp. 716-720, Virtual Barcelona, Spain, May 4-8, 2020.
10. Matthew K.-H. Ma, Tan Lee, Manson C.-M. Fong and William S.Y. Wang,"Resting-State EEG-Based Biometrics With Signals Features Extracted By Multivariate Empirical Mode Decomposition," 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), pp. 991-995, Virtual Barcelona, Spain, May 4-8, 2020.
11. Z.Y. Peng, S.Y. Feng and Tan Lee,"Mixture Factorized Auto-Encoder For Unsupervised Hierarchical Deep Factorization Of Speech Signal," 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), pp. 6769-6773, Virtual Barcelona, Spain, May 4-8, 2020.
12. Y. Qin, Tan Lee and A.P.H. Kong,"Automatic Assessment of Speech Impairment in Cantonese-speaking People with Aphasia," IEEE Journal of Selected Topic in Signal Processing (JSTSP), vol. 14, no. 2, pp. 331-345, Feb. 2020.
Publications in 2019 2019
1. Shuiyang Mao, P.C. Ching, Tan Lee, "Deep Learning of Segment-Level Feature Representation with Multiple Instance Learning for Utterance-Level Speech Emotion Recognition ," in Proc. Interspeech 2019, Graz, Austria, pp. 1686-1690, Sept 15-19, 2019.
2. Jiarui Wang, Ying Qin, Zhiyuan Peng, Tan Lee, "Child Speech Disorder Detection with Siamese Recurrent Network using Speech Attribute Features," in Proc. Interspeech 2019, Graz, Austria, pp. 3885-3889, Sept 15-19, 2019.
3. Xurong Xie, Xunying Liu, Tan Lee, Lan Wang, "Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features ," in Proc. Interspeech 2019, Graz, Austria, pp. 759-763, Sept 15-19, 2019.
4. Ying Qin, Tan Lee, Anthony Pak Hin Kong, "Automatic Assessment of Language Impairment Based on Raw ASR Output," in Proc. Interspeech 2019, Graz, Austria, pp. 3078-3082, Sept 15-19, 2019.
5. Siyuan Feng, Tan Lee, Zhiyuan Peng, "Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling," in Proc. Interspeech 2019, Graz, Austria, pp. 1093-1097, Sept 15-19, 2019.
6. Siyuan Feng, Tan Lee, "Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation," in Proc. Interspeech 2019, Graz, Austria, pp. 281-285, Sept 15-19, 2019.
7. Siyuan Feng, Tan Lee, "Exploiting Cross-Lingual Speaker and Phonetic Diversity for Unsupervised Subword Modeling," IEEE/ACM Trans. Audio, Speech, Lang. Process., vol 27, no. 12, pp. 2000-2011, 2019.
8. Yuanyuan Liu, Tan Lee, Thomas Law, Kathy Yuet-Sheung Lee, "Acoustical Assessment of Voice Disorder With Continuous Speech Using ASR Posterior Features," IEEE/ACM Trans. Audio, Speech, Lang. Process., vol 27, no. 6, pp. 1047-1059, 2019.
9. Shuiyang Mao, Dehua Tao, Guangyan Zhang, P.C. Ching, Tan Lee, "REVISITING HIDDEN MARKOV MODELS FOR SPEECH EMOTION RECOGNITION," in Proc. ICASSP 2019, Brighton, UK, pp. 6715-6719, May 12-17, 2019.
10. Xurong Xie, Xunying Liu, Tan Lee, Lan Wang, "BLHUC: BAYESIAN LEARNING OF HIDDEN UNIT CONTRIBUTIONS FOR DEEP NEURAL NETWORK SPEAKER ADAPTATION," in Proc. ICASSP 2019, Brighton, UK, pp. 5711-5715, May 12-17, 2019.
11. Yuzhong Wu, Tan Lee, "ENHANCING SOUND TEXTURE IN CNN-BASED ACOUSTIC SCENE CLASSIFICATION," in Proc. ICASSP 2019, Brighton, UK, pp. 815-819, May 12-17, 2019.
12. Ying Qin, Tan Lee, Anthony Pak Hin Kong, "COMBINING PHONE POSTERIORGRAMS FROM STRONG AND WEAK RECOGNIZERS FOR AUTOMATIC SPEECH ASSESSMENT OF PEOPLE WITH APHASIA," in Proc. ICASSP 2019, Brighton, UK, pp. 6420-6424, May 12-17, 2019.
13. Zhiyuan Peng, Siyuan Feng, Tan Lee, "ADVERSARIAL MULTI-TASK DEEP FEATURES AND UNSUPERVISED BACK-END ADAPTATION FOR LANGUAGE RECOGNITION," in Proc. ICASSP 2019, Brighton, UK, pp. 5961-5965, May 12-17, 2019.
Publications in 2018 2018
1. Shuiyang Mao and P.C. Ching, "An Effective Discriminative Learning Approach for Emotion-Specific Features Using Deep Neural Networks," in Proc. ICONIP 2018, Siem Reap, Cambodia, pp. 50-61, Dec 13-16, 2018.
2. Si Ioi Ng, Dehua Tao, Jiarui Wang, Yi Jiang, Wing Yee Ng, Tan Lee, "An Automated Assessment Tool for Child Speech Disorders," in Proc. ISCSLP 2018, Taipei, Taiwan, pp. 493-494, Nov 26-29, 2018.
3. Yuanyuan Liu, Tan Lee, P. C. Ching, Thomas K. T. Law and Kathy Y. S. Lee, "Prediction of Voice Disorder Severity: Contributions from Sustained Vowels and Continuous Speech," in Proc. ISCSLP 2018, Taipei, Taiwan, pp. 290-294, Nov 26-29, 2018.
4. Yuanyuan Liu, Ying Qin, Siyuan Feng, Tan Lee and P.C. Ching, "Disordered Speech Assessment Using Kullback-Leibler Divergence Features with Multi-Task Acoustic Modeling," in Proc. ISCSLP 2018, Taipei, Taiwan, pp. 61-65, Nov 26-29, 2018.
5. Jiarui Wang, Si Ioi Ng, Dehua Tao, Wing Yee Ng and Tan Lee, "A Study on Acoustic Modeling for Child Speech Based on Multi-Task Learning," in Proc. ISCSLP 2018, Taipei, Taiwan, pp. 389-393, Nov 26-29, 2018.
6. Xurong Xie, Xunying Liu, Tan Lee and Lan Wang, "Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion," in Proc. ISCSLP 2018, Taipei, Taiwan, pp. 36-40, Nov 26-29, 2018.
7. Ying Qin, Tan Lee, Yuzhong Wu and Anthony Pak Hin Kong, "An End-to-End Approach to Automatic Speech Assessment for People with Aphasia," in Proc. ISCSLP 2018, Taipei, Taiwan, pp 66-70, Nov 26-29, 2018.
8. Man-Ling Sung, Siyuan Feng and Tan Lee, "Unsupervised Pattern Discovery from Thematic Speech Archives based on Multilingual Bottleneck Features," in Proc. APSIPA ASC 2018, Honolulu, USA, pp. 1448-1455, Nov 12-15, 2018.
9. Hansjörg Mixdorff, Albert Rilliard, Tan Lee, Matthew K. H. Ma and Angelika Hönemann, "Cross-cultural (a)symmetries in audio-visual attitude perception," in Proc. Interspeech 2018, Hyderabad, India, pp. 426-430 Sept 2-6, 2018.Paper
10. Ying Qin, Tan Lee, Siyuan Feng and Anthony Pak Hin Kong, "Automatic speech assessment for people with aphasia using TDNN-BLSTM with multi-task learning," in Proc. Interspeech 2018, Hyderabad, India, pp 3418-3422, Sept 2-6, 2018.Paper
11. Siyuan Feng and Tan Lee, "Improving cross-lingual knowledge transferability using multilingual TDNN-BLSTM with language-dependent pre-final layer," in Proc. Interspeech 2018, Hyderabad, India, pp. 2439-2443, Sept 2-6, 2018.Paper
12. Siyuan Feng and Tan Lee, "Exploiting speaker and phonetic diversity of mismatched language resources for unsupervised subword modeling," in Proc. Interspeech 2018, Hyderabad, India, pp. 2673-2677, Sept 2-6, 2018.Paper
13. Hong Zhang, Mark Liberman and Tan Lee, "Information structure and prosodic prominence: how does sentence final particle affect Cantonese intonation?," in Proc. Speech Prosody 2018, Poznań, Poland, pp. 903-907, June 13-16, 2018.
14. Tan Lee, Matthew K. H. Ma, Albert Rilliard, Hansjörg Mixdorff and Angelika Hönemann, "Free labeling of audio-visual attitudinal expressions in Cantonese," in Proc. Speech Prosody 2018, Poznań, Poland, pp. 483-487, June 13-16, 2018.
15. Lei Xie, Tan Lee and Man-Wai Mak, "Guest editorial: Advances in deep learning for speech processing," Journal of Signal Processing Systems, pp. 1-3, 2018.
16. Ying Qin, Tan Lee and Anthony Pak Hin Kong, "Automatic speech assessment for aphasic patients based on syllable-level embedding and supra-segmental duration features," in Proc. ICASSP 2018, Calgary, Canada, pp. 5994-5998, April 15-20, 2018. Paper
17. Yuzhong Wu and Tan Lee, "Reducing model complexity for DNN based large-scale audio classification," in Proc. ICASSP 2018, Calgary, Canada, pp. 331-335, April 15-20, 2018. arXiv Preprint Version
Publications in 2017 2017
1. Anna Chi Shan Kam, John Ka Keung Sung, Tan Lee, Terence Ka Cheong Wong and Andrew van Hasselt,"Improving mobile phone speech recognition by personalized amplification: Application in people with normal hearing and mild-to-moderate hearing loss," Ear and Hearing, Vol.38, No.2, 2017.
2. Hansjörg Mixdorff, Angelika Hönemann, Albert Rilliard, Tan Lee and Matthew K. H. Ma, "Audio-visual expressions of attitude: How many different attitudes can perceivers decode?," Speech Communication, vol. 95, pp.114 - 126, December 2017. Online Version
3. Hansjörg Mixdorff, Angelika Hönemann, Albert Rilliard, Tan Lee and Matthew Ma, "Cross-Language perception of audio-visual attitudinal expressions," in Proc. AVSP 2017, Stockholm, SWEDEN, August 25-26, 2017. Paper
4. Siyuan Feng and Tan Lee, "On the linguistic relevance of speech units learned by unsupervised acoustic modeling," in Proc. Interspeech 2017, Stockholm, SWEDEN, pp. 2068-2072, August 20-24, 2017. Paper
5. Xurong Xie, Xunying Liu, Tan Lee and Lan Wang, "RNN-LDA clustering for feature based DNN adaptation," in Proc. Interspeech 2017, Stockholm, SWEDEN, pp. 2396-2400, August 20-24, 2017. Paper
6. Yuanyuan Liu, Tan Lee, P. C. Ching, Thomas K. T. Law and Kathy Y. S. Lee,"Acoustic assessment of disordered voice with continuous speech based on utterance-level ASR posterior features, " in Proc. Interspeech 2017, Stockholm, SWEDEN, pp. 2680-2684, August 20-24, 2017. Paper
7. Lufei Gao, Li Su, Yi-Hsuan Yang, Tan Lee, "Polyphonic piano note transcription with non-negative matrix factorization of differential spectrogram," in Proc. ICASSP 2017, New Orleans, USA, pp. 291-295, March 4-9, 2017.
8. Raymond W. M. Ng, Alvin C.M. Kwan, Tan Lee, Thomas Hain, "SHEFCE: A Cantonese-English bilingual speech corpus for pronunciation assessment," in Proc. ICASSP 2017, New Orleans, USA, pp. 5825-5829, March 4-9, 2017.
Publications in 2016 2016
1. Anna Chi Shan Kam, John Ka Keung Sung, Tan Lee, Terence Ka Cheong Wong, Andrew van Hasselt, "Improving mobile phone speech recognition by personalized amplification: application in people with normal hearing and mild-to-moderate hearing loss," in Proc. Ear and Hearing, Online version available from november 2, 2016.
2. Ying Qin, Tan Lee, Anthony Pak Hin Kong, Sam Po Law, "Towards automatic assessment of aphasia speech using automatic speech recognition techniques," in Proc. International Symposium on Chinese Spoken Language Processing, Tianjin, China, October, 17-20, 2016.
3. Siyuan Feng, Tan Lee, Haipeng Wang, "Exploiting language-mismatched phoneme recognizers for unsupervised acoustic modeling," in Proc. International Symposium on Chinese Spoken Language Processing, Tianjin, China, October, 17-20, 2016.
4. Tan Lee, Yuanyuan Liu, Yu Ting Yeung, Thomas K. T. Law, Kathy Y. S. Lee, "Predicting severity of voice disorder from DNN-HMM acoustic posteriors," in Proc. Interspeech 2016, San Francisco, USA, September 8-12, 2016.
5. Jen-Tzung Chien, Pei-Wen Huang, Tan Lee, "Hybrid accelerated optimization for speech recognition," in Proc. Interspeech 2016, San Francisco, USA, September 8-12, 2016.
6. Tan Lee, Yuanyuan Liu, Pei-Wen Huang, Jen-Tzung Chien, Wang Kong Lam, Yu Ying Yeung, Thomas K. T. Law, Kathy Y. S. Lee, Anthony Pak Hin Kong and Sam Po Law, "Automatic speech recognition for acoustical analysis and assessment of Cantonese pathological voice and speech," in Proc. ICASSP 2016, Shanghai, China, March 20-25, 2016.
Publications in 2015 2015
1. Yu Ting Yeung, Tan Lee, Cheung-Chi Leung, "Supervised single-microphone multi-talker speech separation with conditional random fields," IEEE/ACM Trans. on Audio, Speech, and Language Processing, vol.23, no.12, pp.2334-2342, December 2015.
2. Tan Lee, Wang Kong Lam, Anthony Pak Hin Kong, Sam Po Law, "Analysis of intonation patterns in cantonese aphasia speech," in Proc. International Conference Oriental COCOSDA (O-COCOSDA/CASLRE), pp. 86-89, Shanghai, China, October 28-30, 2015.
3. Lufei Gao and Tan Lee, "Multi-pitch estimation based on sparse representation with pre-screened dictionary," in Proc. IEEE 17th International Workshop on Multimedia Signal Processing (MMSP), Xiamen, China, October 19-21, 2015.
4. Tan Lee, Anthony Pak Hin Kong, Wang-Kong Lam," in Proc.Measuring prosodic deficits in oral discourse by speakers with fluent aphasia," Frontiers in Psychology, Conference Abstract of Academy of Aphasia 53rd Annual Meeting, September 2015.
5. Shing Yu, Tan Lee, Manwa L. Ng "Surface electromyographic activity of extrinsic laryngeal muscles in Cantonese tone production," Journal of Signal Processing Systems First online: 11 July 2015.
6. Chun Hoy Wong, Tan Lee, Yu Ting Yeung, P. C. Ching, "Modeling temporal dependency for robust estimation of LP model parameters in speech enhancement," in Proc. Interspeech 2015, Dresden, Germany, September 6-10, 2015.
7. Huijun Ding, Tan Lee, Ing Yann Soon, Chai Kiat Yeo, Peng Dai and Guo Dan, "Objective measures for quality assessment of noise-suppressed speech," Speech Communication, vol. 71, pp. 62-63, July 2015.
8. Feng Huang, Tan Lee, W. Bastiaan Kleijn and Ying-Yee Kong, "A method of speech periodicity enhancement using transform-domain signal decomposition," Speech Communication, vol. 67, pp.102-112, March 2015.
9. Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li, "Acoustic segment modeling with spectral clustering methods," IEEE/ACM Trans. on Audio, Speech and Language Processing , vol 23, no. 2, pp. 264-277, February 2015.
Publications in 2014 2014
1. Chi Shan Kam, John Ka Keung Sung, Tan Lee, Terence Ka Cheong Wong and C. A. van Hasselt, "Improving mobile phone perception by implementing automated customized enhanced technology - application in people with and without hearing loss," in Proc. Hong Kong Speech and Hearing Symposium, October 2014.
2. Nan Yan, Manwa L. Ng, and Tan Lee, "Improving the sound quality of an electronic voice box," in Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Siem Reap, Cambodia, December 9-12, 2014
3. Wang-Kong Lam and Tan Lee, "Correcting chord classification errors based on tonal organization information of classical," in Proc. IEEE International Symposium on Multimedia (ISM 2014), Taichung, Taiwan, December 10-12, 2014.
4. Wang-Kong Lam and Tan Lee, "Automatic key partition based on tonal organization information of classical music," in Proc. 15th International Society for Music Information Retrieval Conference (ISMIR 2014), Taipei, Taiwan, October 27-31, 2014.
5. Yu Ting Yeung, Tan Lee, and Cheng Chi Leung, "Large-margin conditional random fields for single-microphone speech separation," in Proc. Interspeech 2014, pp.983-987, Singapore, September 14-18, 2014.
6. Haipeng Wang, Tan Lee, Cheng Chi Leung, Bin Ma and Haizhou Li, "A graph-based Gaussian component clustering approach to unsupervised acoustic modeling," in Proc. Interspeech 2014, pp. 875-879, Singapore, September 14-18, 2014.
7. Feng Huang and Tan Lee, "Multipitch tracking based on linear programming relaxation and sparsity-based pitch candidate estimation," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 331-335, Singapore, September 12-14, 2014.
8. Shing Yu, Tan Lee, and Manwa L. Ng, "Surface electromyographic activity of non-laryngeal neck muscles in cantonese tone production," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 304-307, Singapore, September 12-14, 2014.
9. Tan Lee, Shing Yu, Meng Yuan, Terence Ka Cheong Wong, and Ying-Yee Kong, "The effect of enhancing temporal periodicity cues on cantonese tone recognition by cochlear implantees," International journal of audiology (online version) , vol.53, no.8, pp. 546-557, August 2014.
10. Anna Chi Shan Kam, Kwok Shun Leung, John Ka Keung Sung, Tan Lee, and Charles A. van Hasselt, "Evaluation of a self-administered tinnitus measurement system," in Proc. 8th International TRI Tinnitus Conference , March 2014.
Publications in 2013 2013
1. Feng Huang and Tan Lee, "Pitch estimation in noisy speech using accumulated peak spectrum and sparse estimation technique," IEEE/ACM Trans. on Audio Speech and Language Processing, vol.21, no.1, pp.99-109, Jan. 2013.
2. Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma and Haizhou Li, "Shifted-delta MLP features for spoken language recognition," IEEE Signal Processing Letters, vol. 20, pp. 15-18, Jan 2013.
3. Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, "Using parallel tokenizers with DTW matrix combination for low-resource spoken term detection," in Proc. ICASSP 2013, Vancouver, Canada, pp. 8545-8549, May 26-31, 2013.
4. Yu Ting Yeung, Tan Lee, Cheung-Chi Leung, "Using dynamic conditional random field on single-microphone speech separation," in Proc. ICASSP 2013, Vancouver, Canada, pp. 146-150, May 26-31, 2013.
5. Feng Huang, Yu Ting Yeung, Tan Lee, "Evaluation of pitch estimation algorithms on separated speech," in Proc. ICASSP 2013, Vancouver, Canada, pp. 6807-6811, May 26-31, 2013.
6. Yu Ting Yeung, Tan Lee, "Structured mean field method for single-microphone speech separation with factorial hidden markov model," in Proc. IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP 2013), pp. 122-126, Beijing, China, July 6-10, 2013.
7. Wang-Kong Lam, Tan Lee, "Chord classification of multi-instrumental music using exemplar-based sparse representation," in Proc. IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP 2013), pp. 113-117, Beijing, China, July 6-10, 2013.
8. Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, "Spoken language recognition with prosodic features," IEEE/ACM Trans. on Audio, Speech and Language Processing, vol.21, no.9, pp.1841-1853, September 2013.
9. Chi-Fong Chan, Shing Yu, Tan Lee, Manwa L. Ng, John Ka Keung Sung, "Investigation of pitch-related activities in surface electromyography (SEMG) of non-laryngeal neck muscles," in Proc. 6th WACBE World Congress on Bioengineering,pp.431-439, Beijing, China, Aug 5-8, 2013.
10. Huijun Ding, Tan Lee, Guo Dan," in Proc.Correlation analysis on objective evaluation and perceptual judgments for noise-suppressed speech signals with Chinese language," in Proc. 6th WACBE World Congress on Bioengineering,pp.517-522, Beijing, China, Aug 5-8, 2013.
11. Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, "Unsupervised mining of acoustic subword units with segment-level Gaussian posteriorgrams," in Proc. Interspeech, Lyon, France, 25-29 August, 2013.
12. Haipeng Wang and Tan Lee, "The CUHK spoken web search system for mediaeval 2013," in Proc. MediaEval 2013 Workshop, Barcelona, Spain, 18-19 October, 2013. Available at: http://ceur-ws.org/vol-1043/mediaeval2013_submission_68.pdf.
13. Meng Yuan, Y. Sun, H. Feng, and Tan Lee, "A speech enhancement method for cochlear implant listeners," in Proc. Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC2013), pp.2036-2039, 2013.
14. Anna Chi Shan Kam, John Ka Keung Sung, Tan Lee, Terence Ka Cheong Wong, and Charles A. van Hasselt, "Clinical evaluation of a computerized self-administered hearing test" in Proc. 4th World Chinese Otorhinolaryngology Head & Neck Surgery Conference , organized by World Chinese Academy of Otorhinolaryngology Head & Neck Surgery, June 2013.
15. Tan Lee, Anthony Pak Hin Kong, Chi-Fong Chan, Haipeng Wang, "Analysis of auto-aligned and auto-segmented oral discourse by speakers with aphasia: a preliminary study on the acoustic parameter of duration" in Proc. Academy of Aphasia 2013 Annual Meeting, Lucerne, Switzerland, October 2013.
16. Manwa L. Ng, Nan Yan and Tan Lee, "Improving the sound quality of an electronic voice box," in Proc. 6th International Conference on Biomedical Engineering and Informatics (BMEI 2013), pp. 368-372, 2013.
Publications in 2012 2012
1. Feng Huang, Tan Lee, and W. Bastiaan Kleijn, "Transform-domain wiener filter for speech periodicity enhancement," in Proc. ICASSP 2012, pp. 4577-4580, Kyoto, Japan, March 25-30, 2012.
2. Feng Huang and Tan Lee, "Sparsity-based confidence measure for pitch estimation in noisy speech," in Proc. ICASSP 2012, pp. 4601-4604, Kyoto, Japan, March 25-30, 2012.
3. Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li, "An acoustic segment modeling approach to query-by-example spoken term detection," in Proc. ICASSP 2012, pp. 5157-5160, Kyoto, Japan, March 25-30, 2012.
4. Yu Ting Yeung, Tan Lee, Cheung-Chi Leung, "Integrating multiple observations for model-based single-microphone Speech separation with conditional random fields," in Proc. ICASSP 2012, pp. 257-260, Kyoto, Japan, March 25-30, 2012.
5. Feng Huang and Tan Lee, "Robust pitch estimation using l1-regularized maximum likelihood estimation," in Proc. Interspeech 2012, Oregon, USA, Sept. 9-13, 2012.
6. Haipeng Wang and Tan Lee, "CUHK System for the spoken web search task at mediaeval 2012," in Proc. Working notes the MediaEval 2012 Workshop, Pisa, Italy, October 4-5, 2012, CEUR-WS.org, ISSN 1613-0073. pdf
7. Huijun Ding, Tan Lee, and Ing Yann Soon, "Two objective measures for speech distortion and noise reduction evaluation of enhanced speech signals," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 117-121, Hong Kong, Dec. 5-8, 2012.
8. Ning Wang, P. C. Ching and Tan Lee, "Exploration of phase and vocal excitation modulation features for speaker recognition," in Proc. 7th Chinese Conference on Biometric Recognition (CCBR 2012), pp. 251-259, December 2012.
Publications in 2011 2011
1. Ning Wang, P. C. Ching, Nengheng Zheng and Tan Lee, "Robust speaker recognition using denoised vocal source and vocal tract features," IEEE/ACM Trans. on Audio, Speech and Language Processing, vol. 19, no. 1, pp. 196-205, January 2011.
2. Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma and Haizhou Li "Score fusion and calibration in multiple language detectors with large performance variation," in Proc. ICASSP 2011, pp. 4404-4407, Prague, Czech Republic, May 22-27, 2011.
3. Tan Lee and P. C. Ching, "Dealing with imperfections in human speech communication with advanced speech processing techniques," in Proc. International Symposium on Signals, Circuits and Systems 2011, Iasi, Romania, June 30-July 1, 2011.
4. Haipeng Wang, Tan Lee and Cheung-Chi Leung, "Unsupervised spoken term detection with acoustic segment model," in Proc. Oriental COCOSDA 2011, pp. 106-111, Hsinchu, Taiwan, October 26-28, 2011.
5. F. Huang, Tan Lee, and W. B. Kleijn, ""Transform-domain speech periodicity enhancement with adaptive coefficient weighting," in Proc. IEEE International Symposium on Intelligent Signal Process. and Communication Systems 2011, Tailand, December 7-9, 2011.
6. Nengheng Zheng, Tan Lee, Chun-Man Mak, "Model-based non-negative matrix factorization for single-channel speech separation," in Proc. IEEE International Conference on Signal Processing, Communications and Computing, pp. 385-388, Xi'an, China, 2011.
7. Nengheng Zheng, Yi Cai, Xia Li, Tan Lee, "Semi-blind speech and music separation based on non-negative matrix factorization and vector similarity," in Proc. National Conference on Man-Machine Speech Communication, Xi'an, China, 2011.
Publications in 2010 2010
1. Nengheng Zheng, Chao Qin, Tan Lee and P. C. Ching, "CU2C: A dual-condition Cantonese speech database for speaker recognition," in Proc. Computer Processing of Asian Spoken Languages, Shuichi Itahashi and Chiu-yu Tseng et al., eds., (Japan: Consideration Books, March 2010), pp.90-93.
2. Houwei Cao, Tan Lee and P. C. Ching, "Development of the Cantonese-English code-mixing speech corpora," in Proc. Computer Processing of Asian Spoken Languages, Shuichi Itahashi and Chiu-yu Tseng et al., eds., (Japan: Consideration Books, March 2010), pp. 204-207.
3. Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma and Haizhou Li "Prosodic attribute model for spoken language identification," in Proc. ICASSP 2010, Dallas, Texas, USA, pp. 5022-5025, April 14-19, 2010.
4. Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma and Haizhou Li, "An entropy-based approach for comparing prosodic properties in tonal and pitch accent languages," in Proc. Proc. Speech Prosody, Chicago, Illinois, USA, May 11-14, 2010.
5. Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma and Haizhou Li, "Detection target dependent score calibration for language recognition," in Proc. Speaker Odyssey, pp. 91-96, Brno, Czech Republic, June 28 - July 01, 2010.
6. Chun-Man Mak, Tan Lee, Suman Senapati, Yu-Ting Yeung and Wang-Kong Lam, "Similarity measures for Chinese pop music based on low-level audio digtal attributes," in Proc. the 11th International Society for Music Information Retrieval Conference (ISMIR 2010), pp. 513-518, Utrecht, Netherlands, Aug 9-13, 2010.
7. Feng Huang, Tan Lee, W. Bastiaan Kleijn "A method of speech periodicity enhancement based on transform-domain signal decomposition," in Proc. EUSIPCO 2010 , pp. 984-988, Aalborg, Denmark, August 23-27, 2010.
8. Yujia Li and Tan Lee, "Perception-based automatic approximation of F0 contours in Cantonese speech," in Proc. Interspeech 2010, pp. 1425-1428, Chiba, Japan, Sep. 2010.
9. Raymond W. M. Ng, Cheung-Chi Leung, Ville Hautamäki, Tan Lee, Bin Ma and Haizhou Li, "Towards long-range prosodic attribute modeling for language recognition," in Proc. Interspeech 2010, pp. 1792-1795,Chiba, Japan, Sep. 2010.
10. Houwei Cao, Tan Lee and P. C. Ching, "Cross-lingual speaker adaptation via Gaussian component mapping," in Proc. Interspeech 2010, pp. 869-872, Chiba, Japan, Sep. 2010.
11. Ning Wang, P. C. Ching, and Tan Lee, "Exploitation of phase information for speaker recognition," in Proc. Interspeech 2010, pp. 2126-2129, Chiba, Japan, Sep. 2010.
12. Feng Huang, Tan Lee, "Pitch estimation in noisy speech based on temporal accumulation of spectrum peaks," in Proc. Interspeech 2010, pp. 641-644, Chiba, Japan, Sep. 2010.
13. Yujia LI and Tan Lee, "Perception and analysis of linearly approximated F0 contours in Cantonese speech," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP) 2010, pp. 435-439, Tainan & Sun Moon Lack, Taiwan, nov. 2010.
14. Ning Wang, P. C. Ching, and Tan Lee, "Robust speaker verification using phase information of speech," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP) 2010, pp. 483-487, Tainan & Sun Moon Lack, Taiwan, nov. 2010.
15. Houwei Cao, P. C. Ching, Tan Lee, and Yu Ting Yeung, "Semantics-based language modeling for Cantonese-English code-mixing speech recognition," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP) 2010, pp. 246-250, Tainan & Sun Moon Lack, Taiwan, nov. 2010.
16. Chun-Man Mak, Tan Lee, and S.W. Lee, "Spectral trajectory estimation using non-negative matrix factorization for model-based monaural speech separation," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP) 2010, pp. 23-28, Tainan & Sun Moon Lack, Taiwan, nov. 2010.
17. Nengheng Zheng, Xia Li, Thierry Blu, and Tan Lee, "SURE-MSE speech enhancement for robust speech recognition," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP) 2010, pp. 271-274, Tainan & Sun Moon Lack, Taiwan, nov. 2010.
Publications in 2009 2009
1. Kevin C. P. Yuen, Meng Yuan, K. W. Pang, Tan Lee, Sigfrid D. Soli, Michael C. F. Tong, Charles A. van Hasselt, "Development of the computerized Cantonese Disyllabic Lexical Tone Identification Test in noise (CANDILET-N)," Cochlear Implants International, vol. 10 (Suppl 1), pp. 130-137, 2009.
2. Kevin C. P. Yuen, Lan Luan, Huan Li, Meng Yuan, Caogang Wei, Keli Cao, Tan Lee "Development of the computerized Mandarin pediatric lexical tone and disyllabic-word picture identification test in noise (MAPPID-N)," Cochlear Implants International, vol. 10 (Suppl 1), pp. 138-147, 2009.
3. Kevin C. P. Yuen, Meng Yuan, Tan Lee, Sigfrid D. Soli, Michael C. F. Tong, Charles A. van Hasselt, "Cantonese lexical tone recognition from frequency-specific temporal envelope and periodicity components in the same versus different noise band carriers," Cochlear Implants International, vol. 10 (Suppl 1), pp. 148-158, 2009.
4. Meng Yuan, Tan Lee, Kevin C. P. Yuen, Sigfrid Soli, Charles A. van Hasselt, and Michael C. F. Tong, "Cantonese tone recognition with enhanced temporal periodicity cues," Journal of Acoustical Society of America, vol. 126(1), pp. 327-337, 2009.
5. Houwei Cao, P. C. Ching and Tan Lee, "Effects of language mixing for automatic recognition of Cantonese-English code-mixing utterances," in Proc. 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), pp. 3011-3014, Brighton, UK, September 6-10, 2009.
6. S. W. Lee, Frank K. Soong and Tan Lee, "Model-based speech separation: identifying transcription using orthogonality," in Proc. 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), pp. 1343-1346, Brighton, UK, September 6-10, 2009.
7. Ning Wang, P. C. Ching and Tan Lee, "Exploration of vocal excitation modulation features for speaker recognition," in Proc. 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), pp. 892-895, Brighton, UK, September 6-10, 2009.
8. Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma and Haizhou Li, "Analysis and selection of prosodic features for language identification," in Proc. International Conference on Asian Language Processing (IALP 2009), pp. 123-128, Singapore, December 7-9, 2009.
9. Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma and Haizhou Li, "Analysis and selection of prosodic features for Asian language recognition," International Journal of Asian Language Processing, vol. 19(4), pp. 139-152, 2009.
10. Joyce Y.C. Chan, Houwei Cao, P. C. Ching and Tan Lee, "Automatic recognition of Cantonese-English code-mixing speech," in Proc. International Journal of Computational Linguistics and Chinese Language Processing, vol.14, no.3, pp.281-304, September 2009.
Publications in 2008 2008
1. Wentao Gu and Tan Lee, "Effects of tone and emphatic focus on speech prosody - A comparison between standard Chinese and Cantonese," in Proc. 8th Phonetic Conference of China and the International Symposium on Phonetic Frontiers, Beijing, China, April 18-20, 2008.
2. Kevin C. P. Yuen, Lan Luan, Huan Li, Meng Yuan, Caogang Wei, Keli Cao, Tan Lee, "Computerized Mandarin pediatric lexical tone and disyllabic-word picture identification test in noise (MAPPID-N): development and standardization," in Proc. abstract presented at International Congress of Audiology (ICA2008), pp. 72, Hong Kong, June 8-12, 2008.
3. Yao Qian, Frank K Soong and Tan Lee, "Tone-enhanced generalized character posterior probability(GCPP) for Cantonese LVCSR," in Proc. Computer Speech and Language, vol. 22, no. 4 pp. 360-373, October, 2008.
4. Yu Ting Yeung, Houwei Cao, N. H. Zheng, Tan Lee and P. C. Ching, "Language modeling for speech recognition of spoken Cantonese," in Proc. Interspeech 2008, pp. 1570-1573, Brisbane, Australia, September 22-26 2008.
5. Yu Ting Yeung, Yao Qian, Tan Lee and Frank K. Soong, "Prosody for Mandarin speech recognition: a comparative study of read and spontaneous speech," in Proc. Interspeech 2008, pp. 1133-1136, Brisbane, Australia, September 22-26 2008.
6. Hoi-To Wai, S. W. Lee, Wang Kong Lam, and Tan Lee, "On pitch tracking and melody characterization for music signal analysis: A singing voice database," in Proc. Oriental COCOSDA2008, pp. 97-102, Kyoto, Japan, november 25-27, 2008.
7. Jiang Cao, Xiaojun Wu, Yu Ting Yeung, Tan Lee and Thomas Fang Zheng, "Automatic collecting of text data for Cantonese language modeling," in Proc. Oriental COCOSDA2008, pp. 130-134, Kyoto, Japan, november 25-27, 2008.
8. Wentao Gu, Tan Lee and P. C. Ching, "Prosodic variation in Cantonese-English code-mixed speech," in Proc. 2008 International Symposium on Chinese Spoken Language Processing, pp. 342-345, Kunming, China, December 16-19, 2008.
9. S. W. Lee, Frank K. Soong, P. C. Ching and Tan Lee, "Pitch tracking for model-based speech separation," in Proc. 2008 International Symposium on Chinese Spoken Language Processing, pp. 145-148, Kunming, China, December 16-19, 2008.
10. Y. J. Li and Tan Lee, "A perceptual study of approximated Cantonese tone contours," in Proc. 2008 International Symposium on Chinese Spoken Language Processing, pp. 49-52, Kunming, China, December 16-19, 2008.
11. Raymond W., M. Ng and Tan Lee, "Entropy-based analysis of the prosodic features of Chinese dialects," in Proc. 2008 International Symposium on Chinese Spoken Language Processing, pp. 65-68, Kunming, China, December 16-19, 2008.
12. Meng Yuan, Tan Lee and Sigfrid D. Soli, "Mandarin tone perception with temporal envelope and periodicity cues from different frequency regions," in Proc. 2008 International Symposium on Chinese Spoken Language Processing, pp. 338-341, Kunming, China, December 16-19, 2008.
13. Nengheng Zheng, Xia Li, Houwei Cao, Tan Lee and P. C. Ching, "Deriving MFCC parameters from the dynamic spectrum for robust speech recognition," in Proc. 2008 International Symposium on Chinese Spoken Language Processing, pp. 85-88, Kunming, China, December 16-19, 2008.
Publications in 2007 2007
1. Nengheng Zheng, Tan Lee and P. C. Ching, "Integration of complementary acoustic features for speaker recognition," IEEE Signal Processing Letters, vol. 14, no. 3, pp. 181-184, March 2007.
2. C. Yang, Frank K. Soong and Tan Lee, "Static and dynamic spectral features: Their noise robustness and optimal weights for ASR," IEEE/ACM Trans. on Audio, Speech and Language Processing, vol. 15, no. 3, pp. 1087-1097, March 2007.
3. Kevin C.P. Yuen, Meng Yuan, Tan Lee, Sigfrid Soli, Michael C.F. Tong, Charles A. van Hasselt, "Frequency-specific temporal envelope and periodicity components for lexical tone identification in Cantonese," Ear & Hearing, vol.28(2) Supplement, pp.107S - 113S, 2007
4. Yao Qian, Tan Lee and Frank K Soong, "Tone recognition in continuous Cantonese speech using supratone models," Journal of the Acoustical Society of America, vol. 121, pp. 2936-2945, May 2007.
5. W.N. Chan, Nengheng Zheng and Tan Lee, "Discrimination power of vocal source and vocal tract features for speaker recognition," IEEE/ACM Trans. on Audio, Speech and Language Processing, vol. 15, no. 6, pp. 1884-1892, August 2007.
6. Nengheng Zheng, Tan Lee, N. Wang and P. C. Ching, "Integrating of complementary features from vocal source and vocal tract for speaker identification," Computational Linguistics & Chinese Language Processing, vol. 12, no. 3, pp. 273-290, September 2007.
7. Jing Zhang and P. C. Ching, "Blind separation of moving speech sources using short-time LOD based ICA method," in Proc. ICASSP 2007, vol. III, pp. 957-960, Honolulu, Hawaii, USA, April 15-20, 2007.
8. Wentao Gu and Tan Lee, "Effects of focus on prosody of Cantonese speech - A comparison of surface feature analysis and model-based analysis," in Proc. International Workshop on Paralinguistic Speech - between Models and Data (ParaLing07), pp. 59-64, Saarbrücken, Germany, August 3, 2007.
9. Wentao Gu and Tan Lee, "Effects of tonal context and focus on Cantonese F0," in Proc. 16th International Congress of Phonetic Sciences, pp. 1033-1036, Saarbrücken, Germany, August 6-10, 2007.
10. Wentao Gu and Tan Lee, "Quantitative analysis of F0 contours of emotional speech of Mandarin," in Proc. 6th ISCA Speech Synthesis Workshop, pp. 228-233, Bonn, Germany, August 22-24, 2007.
11. Meng Yuan, Tan Lee, Kevin C. P. Yuen, Sigfrid D. Soli, Michael C. F. Tong and Charles A. van Hasselt, "Band-specific temporal periodicity enhancement for Cantonese tone perception with noise-excited vocoder," in Proc. 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 694-697, Lyon, France, August 23-26, 2007
12. S. W. Lee, Frank K. Soong and P. C. Ching, "Model-based speech separation with single-microphone input," in Proc. 10th European Conference on Speech Communication and Technology (Interspeech 2007), pp. 850-853, Antwerp, Belgium, August 27-31, 2007.
13. Wentao Gu, Rerrario Shui-Ching Ho, and Tan Lee, "Modeling tones in Hakka on the basis of the command-response model," in Proc. 10th European Conference on Speech Communication and Technology (Interspeech 2007), pp. 2633-2636, Antwerp, Belgium, August 27-31, 2007.
14. Hiroko Hirano, Keikichi Hirose, Goh Kawai, Wentao Gu, and nobuaki Minematsu, "F0 models show Chinese speakers of Japanese insert intonational boundaries and drop pitch," in Proc. 10th European Conference on Speech Communication and Technology (Interspeech 2007), pp. 1885-1888, Antwerp, Belgium, August 27-31, 2007.
15. Yujia Li and Tan Lee, "Perceptual equivalence of approximated Cantonese tone contours," in Proc. 10th European Conference on Speech Communication and Technology (Interspeech 2007), page 2677-2680, Antwerp, Belgium, August, 2007.
16. Houwei Cao, Tan Lee and P. C. Ching, "A study of pronunciation variation in Cantonese-English code-mixing speech," in Proc. Oriental COCOSDA2007, pp. 143-148, Hanoi, Vietnam, Dec. 4-6, 2007.
17. Ning Wang, P. C. Ching, N.H. Zheng and Tan Lee, "Robust speaker recognition using both vocal source and vocal tract features estimated from noisy input utterances," in Proc. 7th IEEE International Symposium on Signal Processing and Information Technology (ISSPIT 2007), pp. 772-777, Cairo, Egypt, Dec. 15-18, 2007.
18. Meng Yuan, Tan Lee, Kevin C. P. Yuen, Sigfrid D. Soli, Michael C. F. Tong, Charles A. van Hasselt, "F0-related periodicity enhancement in temporal envelope for Cantonese tone recognition," in Proc. Asia Pacific Symposium on Cochlear Implant and Related Sciences (APSCI2007), pp. 143-144, Sydney, Australia, Oct. 30 - nov. 2, 2007.
Publications in 2006 2006
1. Tan Lee and Yao Qian, "Tone modeling for speech recognition," Advances in Chinese Spoken Language Processing ed. by C.H.Lee, H. Li, L.S. Lee, R. Wang and Q. Huo, pp. 179-200, Singapore: Springer-Verlag, Dec, 2006
2. P. C. Ching, Tan Lee, W.K. Lo and Helen Meng, "Cantonese speech recognition and synthesis," Advances in Chinese Spoken Language Processing ed. by C.H.Lee, H. Li, L.S. Lee, R. Wang and Q. Huo, pp. 365-386, Singapore: Springer-Verlag, Dec, 2006
3. Y. Zhu and Tan Lee, "Using duration information in Cantonese connected-digit recognition," Computational Linguistics & Chinese Language Processing, vol. 11, no. 1, pp. 1 - 16, March 2006.
4. Tan Lee, P. Kam and Frank K. Soong, "Modeling Cantonese pronunciation variations for large-vocabulary continuous speech recognition," Computational Linguistics & Chinese Language Processing, vol. 11, no. 1, pp. 17 - 35, March 2006.
5. Meng Yuan, Tan Lee, P. C. Ching and Y. Zhu, "Speech recognition on DSP: Issues on computational efficiency and performance analysis," Microprocessors and Microsystems, vol. 30, Issue 3, pp. 155-164, May 2006.
6. Yujia Li, "Tone ratios combined with F0 register in Cantonese as speaker-dependent characteristic," in Proc. International Conference on Speech Prosody, vol. 1, pp. 169 - 172, Dresden, Germany, May 2-5, 2006
7. Helen MENG, P. C. Ching, Tan Lee, MAK Man Wai, MAK Brian, Moon Yiu Sang, Siu Man-hung, Tang Xiaoou, Hui Pak Sum Henry, Lee Pun Yuen Andrew, W.K. Lo, MA Bin and Sio Kok Tou, "The multi-biometric, multi-device and multilingual (M3) corpus," in Proc. Multimodal User Authentication (MMUA) Workshop 2006, 8 pgs. Toulouse, France, May 11, 2006
8. Yao Qian, Frank Soong and Tan Lee, "Tone-enhanced generalized character posterior probability(GCPP) For Cantonese LVCSR ," in Proc. ICASSP 2006, vol. I, pp. 133 - 136, Toulouse, France, May 14-19, 2006.
9. S.W. Lee, Frank K. Soong and P. C. Ching, "An iterative trajectory regeneration algorithm for separating mixed speech sources," in Proc. ICASSP 2006, vol. I, pp. 157 - 160, Philadelphia, Toulouse, France, May 14-19, 2006.
10. W.N. Chan, Tan Lee, N.H. Zheng and H. Ouyang, "Use of vocal source features in speaker segmentation," in Proc. ICASSP 2006, vol. I, pp. 657 - 660, Toulouse, France, May 14-19, 2006.
11. H. Ouyang, Tan Lee and W.N. Chan, "Feature extraction from talking mouths for video-based bi-modal speaker verification," in Proc. ICASSP 2006, vol. V, pp. 513 - 516, Toulouse, France, May 14-19, 2006.
12. Y.C. Chan, P. C. Ching, Tan Lee and Houwei Cao, "Automatic speech recognition of Cantonese-English code-mixing utterances," in Proc. 9th International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP), pp. 113 - 116, Pennsylvania, USA, September 17-21, 2006.
13. Xin Lei, Manhung Siu, Mei-yuh Hwang, Mari Ostendorf and Tan Lee, "Improved tone modeling for Mandarin broadcast news speech recognition," in Proc. 9th International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP), pp. 1237 - 1240, Pennsylvania, USA, September 17-21, 2006.
14. S. Zhang, P. C. Ching and Fan-rang Kong, "Automatic emotion recognition of speech signal in Mandarin," in Proc. 9th International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP), pp.1810 - 1813, Pennsylvania, USA, September 17-21, 2006.
15. W. M. Ng, Tan Lee and W. Gu, "Towards automatic parameter extraction of command-response model for Cantonese," in Proc. 9th International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP), pp.2358 - 2361, Pennsylvania, USA, September 17-21, 2006.
16. Houwei Cao, P. C. Ching, Tan Lee and Ning Wang, "An extended Cantonese-English code-mixing speech corpus: exCUMIX," in Proc. Oriental COCOSDA 2006, pp. 1-5, Penang, Malaysia, December 9-11, 2006.
17. Nengheng Zheng, Ning Wang, Tan Lee and P. C. Ching, "Speaker verification using complementary information from vocal source and vocal tract," in Proc. 2006 International Symposium on Chinese Spoken Language Processing, (ser. Lecture notes in Computer Science, Q. Huo, B. Ma, C. E. Siong, and H. Li, Eds., vol. 4274), vol. I, pp. 518-528, Springer, Singapore, December 2006.
18. Nengheng Zheng, P. C. Ching, Ning Wang and Tan Lee, "Integrating complementary features with a confidence measure for speaker identification," in Proc. 2006 International Symposium on Chinese Spoken Language Processing, (ser. Lecture notes in Computer Science, Q. Huo, B. Ma, C. E. Siong, and H. Li, Eds., vol. 4274), vol. I, pp. 549-557, Springer, Singapore, December 2006.
19. Sheng Zhang, P. C. Ching and Fanrang Kong, "Acoustic analysis of emotional speech in Mandarin Chinese," in Proc. 2006 International Symposium on Chinese Spoken Language Processing, vol. II, pp. 57-66, Singapore, December 2006.
20. Jing Zhang and P. C. Ching, "Short-time ICA for blind separation of noisy speech," in Proc. 2006 International Symposium on Chinese Spoken Language Processing, vol. II, pp. 258-267, Singapore, December 2006.
21. Meng Yuan, Kevin C. P. Yuen, Tan Lee, Sigfrid Soli, Michael C. F. Tong and Charles A. van Hasselt, "Frequency-specific expansion of temporal cues for lexical-tone identification in Cantonese," in Proc. International Hearing Aid Research Conference (IHCON), Lake Tahoe, California, August 16 - 20, 2006.
Publications in 2005 2005
1. C. Yang, Frank K. Soong and Tan Lee, "Static and dynamic spectral features: Their noise robustness and optimal weights for ASR," in Proc. ICASSP 2005, vol. I, pp.241 - 244, Philadelphia, PA, USA, March 2005.
2. Meng Yuan, Tan Lee and P. C. Ching, "Speech recognition on DSP: Issues on computational efficiency and performance analysis," in Proc. IEEE Conference on Communications, Circuits and Systems 2005, vol. II, pp. 852 - 856, HKUST, Hong Kong, May 2005.
3. S.W. Lee, Frank K. Soong and P. C. Ching, "Harmonic filtering for joint estimation of pitch and voiced source with single-microphone input," in Proc. 9th European Conference on Speech Communication and Technology, pp. 309 - 312, Lisboa, Portugal, September 2005.
4. Joyce Y.C. Chan, P. C. Ching and Tan Lee, "Development of Cantonese-English code-mixing speech corpus," in Proc. 9th European Conference on Speech Communication and Technology, pp. 1533 - 1536, Lisboa, Portugal, September 2005.
5. T.Y. Fung, Y.C. Chi, Eddie Sio, Icarus Lee, H. Meng and P. C. Ching, "Embedded Cantonese TTS for multi-device access to web content," in Proc. 9th European Conference on Speech Communication and Technology, pp. 2601 - 2604, Lisboa, Portugal, September 2005.
6. Hua Ouyang and Tan Lee, "A new lip feature representation method for video-based bimodal authentication," in Proc. 2005 NICTA-HCSNet Multimodal User Interaction Workshop, vol. 57, pp. 33 - 37, Sydney, Australia, 13-14 September 2005.
7. N.H. Zheng, Tan Lee and P. C.Ching, "Comparative analysis of discrimination power of the vocal source and vocal tract features for speaker verification," in Proc. 8th National Conference on Man Machine Speech Communication, pp. 210 - 213, Beijing, China, October 22-24, 2005.
8. C. Qin, Tan Lee and H. Meng, "On anti-model design for Cantonese verbal information verification," in Proc. 8th National Conference on Man Machine Speech Communication, pp. 375 - 378, Beijing, China, October 22-24, 2005.
9. YUEN Chi Pun, Meng Yuan, Tan LEE, SOLI Sigfrid, TONG Chi Fai Michael and VAN HASSELT Charles Andrew, "Frequency-specific temporal envelope and periodicity components for lexical tone identification in Cantonese," in Proc. 5th Asia Pacific Symposium on Cochlear Implant and Related Sciences, pp. 84, Hong Kong, 26th - 28th november 2005.
10. N.H. Zheng, C. Qin, Tan Lee and P. C.Ching, "CU2C: A dual-condition Cantonese speech database for speaker recognition applications," in Proc. 2005 International Conference on Speech Databases and Assessment (Oriental-COCOSDA 2005), pp. 67 - 72, Jakarta, Indonesia, December 6-8, 2005.
Publications in 2004 2004
1. W.K. Lo, Helen Meng and P. C. Ching, "Multi-scale spoken document retrieval for Cantonese broadcast news," International Journal on Speech Technology, vol. 7, iss. 2-3, pp. 203 - 219, April 2004.
2. Yujia Li, Tan Lee and Yao Qian, "Analysis and modeling of F0 contours for Cantonese text-to-speech," Journal of ACM Trans. on Asian Language Information Processing, vol. 3, iss. 3, pp. 169-180, September 2004.
3. Yujia Li, Tan Lee and Yao Qian, "F0 analysis and modeling for Cantonese text-to-speech," in Proc. International Conference on Speech Prosody, pp.467 - 470, Nara, Japan, March 2004.
4. Yao Qian, Tan Lee and Frank Soong "Use of tone information in continuous Cantonese speech recognition," in Proc. International Conference on Speech Prosody, pp.587 - 590, Nara, Japan, March 2004.
5. N.H. Zheng and P. C. Ching, "Using HAAR transform vocal source information for automatic speaker recognition," in Proc. ICASSP 2004, vol.I, pp.77 - 80, Montreal, Quebec, Canada, May 2004.
6. H. Meng, Y.C. Li, T.Y. Fung, K.F. Low, K.F. Chow, T.H. Lo, M.C. Ho and P. C. Ching "Bilingual Chinese/English voice browsing based on a voiceXML platform," in Proc. ICASSP 2004, vol.III, pp.769 - 772, Montreal, Quebec, Canada, May 2004.
7. S.W.Lee, P. C. Ching and Tan Lee, "noise-robust automatic speech recognition using mainlobe-resilient time-frequency quantile-based noise estimation," in Proc. IEEE International Symposium on Circuits and Systems, vol. III, pp.425 - 428, Vancouver, Canada, May 2004.
8. S.W. Lee and P. C.Ching, "In-phase feature induction: An effective compensation technique for robust speech recognition," in Proc. 8th International Conference on Spoken Language Processing, vol. I, pp.157 - 160, Jeji Island, Korea, October 2004.
9. Y. Zhu and Tan Lee, "Explicit duration modeling for Cantonese connected-digit recognition," in Proc. 8th International Conference on Spoken Language Processing, vol. I, pp.685 - 688, Jeji Island, Korea, October 2004.
10. Y. Qian, Tan Lee and Frank K. Soong, "Tone information as a confidence measure for improving Cantonese LVCSR," in Proc. 8th International Conference on Spoken Language Processing, vol. III,pp.1965 - 1968, Jeji Island, Korea, October 2004.
11. N.H. Zheng, P. C. Ching and Tan Lee, "Time frequency analysis of vocal source signal for speaker recognition," in Proc. 8th International Conference on Spoken Language Processing, vol. III, pp.2333 - 2336, Jeji Island, Korea, October 2004.
12. C. Yang, Frank K. Soong and Tan Lee, "On noise robustness of dynamic and static features for continuous Cantonese digit recognition," in Proc. 2004 International Symposium on Chinese Spoken Language Processing, pp.277 - 280, Hong Kong, December 2004.
13. Joyce Y.C. Chan, P. C. Ching, Tan Lee and H. Meng, "Detection of language boundary in code-switching utterances by bi-phone probabilities," in Proc. 2004 International Symposium on Chinese Spoken Language Processing, pp.293 - 296, Hong Kong, December 2004.
14. C. Qin and Tan Lee, "Cantonese verbal information verification system using GMM-based anti-model," in Proc. 2004 International Symposium on Chinese Spoken Language Processing, pp.297 - 300, Hong Kong, December 2004.
Publications in 2003 2003
1. W.K. Lo, Helen Meng and P. C. Ching, "Cross-language spoken document retrieval using HMM-based retrieval model with multi-scale fusion," IEEE/ACM Trans. on Asian Language Information Processing, vol.2, iss.1, pp.1 - 26, March 2003.
2. Tan Lee, Helen Meng, W.K. Lo and P. C. Ching, "The state of the art in human-computer speech-based interface technologies," HKIE Trans., vol.10, no. 4, pp. 50 - 61, December 2003.
3. H. Meng, T.H. Lo, C.K. Keung, M.C. Ho, W.K. Lo and P. C. Ching, "CU VOCAL web service: A text-to-speech synthesis web service for voice-enabled web-mediated applications," in Proc. the Twelfth International World Wide Web Conference, Budapest, Hungary, pp. 56 - 57, May 2003.
4. C.F. Chan, W.Han, K.W. Hon, Tan Lee, C.S. Choy, K.P. Pun and P. C. Ching, "An HMM-based speech Recognition IC," in Proc. IEEE International Symposium on Circuits and Systems, Bangkok, vol. II, pp.744 - 747, May 2003.
5. H. Meng, T.Y. Fung, Y.C. Li, M.C. Ho, T.H. Lo, C.K. Keung, W.K. Lo and P. C. Ching, "Recent enhancements in CU VOCAL for Chinese TTS-enabled applications," in Proc. 8th European Conference on Speech Communication and Technology, Geneva, Switzerland, pp. 1253 - 1256, September 2003.
6. Patgi Kam, Tan Lee and Frank Soong, "Modeling Cantonese pronunciation variation by acoustic model refinement," in Proc. 8th European Conference on Speech Communication and Technology, Geneva, Switzerland, pp.1477 - 1480, September 2003.
7. Yao Qian, Tan Lee and Yujia Li, "Overlapped di-tone modeling for tone recognition in continuous Cantonese speech," in Proc. 8th European Conference on Speech Communication and Technology, Geneva, Switzerland, pp.1845 - 1848, September 2003.
8. Wei Han, K.W. Hon, C.F. Chan, Tan Lee, C.S. Choy, K.P. Pun and P. C. Ching, "A real-time Chinese speech recognition IC with double mixtures," in Proc. 5th International Conference on ASIC, Beijing, China, pp.926 - 929, October 2003.
9. Y.C. Li, T.Y. Fung, Helen Meng, and P. C. Ching, "CU VOCAL: A Cantonese text-to-speech synthesizer," in Proc. 11th Annual Conference of the Hong Kong Institution of Science Park, Hong Kong SAR, november 2003.