Spoken language technology: automatic speech recognition, speaker recognition, language identification, keyword search
Speech and audio signal processing: speech enhancement, sound source separation, pitch estimation, music transcription
Rehabilitation technologies for communication disorders: hearing aids and cochlear implants, pathological speech analysis, speech and language assessment
Current Projects
Unsupervised speech modeling for low-resource languages
To be added.
Objective assessment of pathological voices
To be added.
Acoustical analysis of aphasia speech
To be added.
Computer-assisted assessment technology of speech, hearing and language disabilities
Jingyu Li, Aemon Yat Fei Chiu and Tan Lee,"An Investigation of Reprogramming for Cross-Language Adaptation in Speaker Verification Systems,"in Proc. ISCSLP 2024, pp.390-394, Beijing, China, Nov. 7-10, 2024.
Yujia Xiao, Xi Wang, Xu Tan, Lei He, Xinfa Zhu, Sheng Zhao and Tan Lee,"Contrastive Context-Speech Pretraining for Expressive Text-to-Speech Synthesis,"in Proceedings of the 32nd ACM International Conference on Multimedia, pp. 2099-2107, Melbourne, Australia, Oct. 28 - Nov. 1, 2024.
S.-I. Ng, C.W.-Y. Ng, J. Wang and Tan Lee,"Automatic Detection of Speech Sound Disorder in Cantonese-Speaking Pre-School Children,"in IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 32, pp. 4355-4368, 2024.
Wei Liu, Jingyong Hou, Dong Yang, Muyong Cao and Tan Lee,"LUPET: Incorporating Hierarchical Information Path into Multilingual ASR",in Proc. Interspeech 2024, Kos, Greece, pp. 3979-3983, Kos, Greece, Sep. 1-5, 2024.
Wei Liu, Jingyong Hou, Dong Yang, Muyong Cao and Tan Lee,"A Parameter-efficient Language Extension Framework for Multilingual ASR",in Proc. Interspeech 2024, Kos, Greece, pp. 3929-3933, Kos, Greece, Sep. 1-5, 2024.
Dehua Tao, Tan Lee, Harold Chiu and Sarah Luk,"Learning Representation of Therapist Empathy in Counseling Conversation Using Siamese Hierarchical Attention Network",in Proc. Interspeech 2024, Kos, Greece, pp. 1085-1089, Kos, Greece, Sep. 1-5, 2024.
J. Li and Tan Lee,"Efficient Black-Box Speaker Verification Model Adaptation With Reprogramming And Backend Learning",2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), pp. 12732-12736, Seoul, Korea, April 14-19, 2024.
D. Tao, Tan Lee, H. Chui and S. Luk,"Modeling Intrapersonal and Interpersonal Influences for Automatic Estimation of Therapist Empathy in Counseling Conversation",2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), pp. 12692-12696, Seoul, Korea, April 14-19, 2024.
W. Liu, Y. Qin, Z. Peng and Tan Lee,"Sparsely Shared Lora on Whisper for Child Speech Recognition",2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), pp. 11751-11755, Seoul, Korea, April 14-19, 2024.
Y. Tian, J. Li and Tan Lee,"Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss",2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), pp. 11501-11505, Seoul, Korea, April 14-19, 2024.
2023
Y. Tian, W. Liu and Tan Lee,"Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data,"in 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Taipei, Taiwan, Dec. 16-20, 2023.
Si-Ioi Ng, Cymie Wing-Yee Ng and Tan Lee,"A Study on Using Duration and Formant Features in Automatic Detection of Speech Sound Disorder in Children,"in Proc.Interspeech 2023, Dublin, Ireland, pp. 4643-4647, Aug. 20-24, 2023.
Dehua Tao, Tan Lee, Harold Chui and Sarah Luk,"A Study on Prosodic Entrainment in Relation to Therapist Empathy in Counseling Conversation,"in Proc.Interspeech 2023, Dublin, Ireland, pp. 3662-3666, Aug. 20-24, 2023.
Wei Liu, Zhiyuan Peng and Tan Lee,"CoMFLP: Correlation Measure Based Fast Search on ASR Layer Pruning,"in Proc.Interspeech 2023, Dublin, Ireland, pp. 3282-3286, Aug. 20-24, 2023.
Jingyu Li, Wei Liu, Zhaoyang Zhang, Jiong Wang and Tan Lee,"Model Compression for DNN-based Speaker Verification Using Weight Quantization,"in Proc.Interspeech 2023, Dublin, Ireland, pp. 1988-1992, Aug. 20-24, 2023.
Yusheng Tian, Guangyan Zhang and Tan Lee,"Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models,"in Proc.Interspeech 2023, Dublin, Ireland, pp. 4893–4897, Aug. 20-24, 2023.
Yujia Xiao, Shaofei Zhang, Xi Wang, Xu Tan, Lei He, Sheng Zhao, Frank K. Soong and Tan Lee,"ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading,"in Proc.Interspeech 2023, Dublin, Ireland, pp. 4883-4887, Aug. 20-24, 2023.
Monira Islam and Tan Lee,"Functional Connectivity Analysis in Multi-channel EEG for Emotion Detection with Spatial-Temporal Features and 3D CNN,"45th Annual International Conference of the IEEE Engineering in Medicine Biology Society (EMBC), NSW, Sydney, Australia, 24-27 July, 2023.
Jonathan H.N. Lee, Eddie S.K. Chong, Harold Chui, Tan Lee, Sarah Luk, Dehua Tao and Nicolette W.T. Lee,"A curvilinear association between therapists’ use of discourse particles and therapist empathy in psychotherapy,"Journal of Counseling Psychology, 70(5), pp.562-570, July 2023.
Jingyu Li, Yusheng Tian and Tan Lee,"Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification,"2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes Island, Greece, June 4-10, 2023.
Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma and Tan Lee,"Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring,"2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes Island, Greece, June 4-10, 2023.
Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma and Tan Lee,"An ASR-Free Fluency Scoring Approach with Self-Supervised Learning,"2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes Island, Greece, June 4-10, 2023.
Guangyan Zhang, Ying Qin, Wenjie Zhang, Jialun Wu, Mei Li, Yutao Gai, Feijun Jiang and Tan Lee,"iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis Based on Disentanglement Between Prosody and Timbre,"IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 31, pp.1693-1705, April 2023.
2022
Jonathan H.N. Lee, Harold Chui, Tan Lee, Sarah Luk, Dehua Tao and Nicolette W.T. Lee,"Formality in psychotherapy: How are therapists’ and clients’ use of discourse particles related to therapist empathy?"Frontiers in Psychiatry, Dec., 2022.
Dehua Tao, Harold Chui, Sarah Luk and Tan Lee,"CUEMPATHY: A Counseling Speech Dataset for Psychotherapy Research,"in Proc. ISCSLP 2022, pp.354-358, Singapore, Dec. 11-14, 2022.
Zhiyuan Peng, Huanji He, Ke Ding, Tan Lee and Guanglu Wan,"Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition,"in Proc. ISCSLP 2022, pp.324-328, Singapore, Dec. 11-14, 2022.
Daxin Tan, Liqun Deng, Nianzu Zheng, Y.T. Yeung, Xin Jiang, Xiao Chen and Tan Lee,"Correct Speech: A Fully Automated System for Speech Correction and Accent Reduction,"in Proc. ISCSLP 2022, pp.81-85, Singapore, Dec. 11-14, 2022.
Monira Islam and Tan Lee,"Wavelet based Emotion Detection from Multi-channel EEG using a Hybrid CNN-LSTM Model,"in Proc. TENCON 2022 - 2022 IEEE Region 10 Conference (TENCON), IEEE, Hong Kong, Nov. 01-04, 2022.
Dehua Tao, Tan Lee, Harold Chui and Sarah Luk,"Hierarchical Attention Network for Evaluating Therapist Empathy in Counseling Session,"in Proc.Interspeech 2022, pp.2008-2012, Incheon, Korea, Sept. 18-22, 2022.
Dehua Tao, Tan Lee, Harold Chui and Sarah Luk,"Characterizing Therapist's Speaking Style in Relation to Empathy in Psychotherapy,"in Proc.Interspeech 2022, pp.2003-2007, Incheon, Korea, Sept. 18-22, 2022.
Si-Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang and Tan Lee,"Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations,"in Proc.Interspeech 2022, pp.2853-2857, Incheon, Korea, Sept. 18-22, 2022.
Jingyu Li, Wei Liu and Tan Lee,"EDITnet: A Lightweight Network for Unsupervised Domain Adaptation in Speaker Verification,"in Proc.Interspeech 2022, pp.3694-3698, Incheon, Korea, Sept. 18-22, 2022.
Jonathan Him Nok Lee, Dehua Tao, Harold Chui, Tan Lee, Sarah Luk , Nicolette Wing Tung Lee and Koonkan Fung,"Durational Patterning at Discourse Boundaries in Relation to Therapist Empathy in Psychotherapy,"in Proc.Interspeech 2022, pp.5248-5252, Incheon, Korea, Sept. 18-22, 2022.
Yusheng Tian, Jingyu Li and Tan Lee,"Transport-Oriented Feature Aggregation for Speaker Embedding Learning,"in Proc.Interspeech 2022, pp.316-320, Incheon, Korea, Sept. 18-22, 2022.
Zhiyuan Peng, Xuanji He, Ke Ding, Tan Lee and Guanglu Wan,"Unifying Cosine and PLDA Back-ends for Speaker Verification,"in Proc.Interspeech 2022, pp.336-340, Incheon, Korea, Sept. 18-22, 2022.
Guangyan Zhang, Kaitao Song, Xu Tan, Daxin Tan, Yuzi Yan, Yanqing Liu, Gang Wang, Wei Zhou, Tao Qin, Tan Lee and Sheng Zhao,"Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech,"in Proc.Interspeech 2022, pp.456-460, Incheon, Korea, Sept. 18-22, 2022.
Daxin Tan, Guangyan Zhang and Tan Lee,"Environment Aware Text-to-Speech Synthesis,"in Proc.Interspeech 2022, pp.481-485, Incheon, Korea, Sept. 18-22, 2022.
Monira Islam, Tan Lee,"MEMD-HHT based Emotion Detection from EEG using 3D CNN,"44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC 2022), pp. 294-297, Scottish Event Campus, Glasgow, UK, July 11-15, 2022.
Monira Islam, Tan Lee,"Multivariate Empirical Mode Decomposition of EEG for Mental State Detection at Localized Brain Lobes,"44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC 2022), pp. 3759-3762, Scottish Event Campus, Glasgow, UK, July 11-15, 2022.
Rui-Si Ma, Si-Ioi Ng, Tan Lee, Yi-Jian Yang and Raymond Kim-Wai Sum,"Validation of a Speech Database for Assessing College Students' Physical Competence under The Concept of Physical Literacy,"International Journal of Environmental Research and Public Health 2022, vol. 19, no. 12, 7046, 2022.
Si-Ioi Ng, Rui-Si Ma, Tan Lee and Raymond Kim-Wai Sum,"Acoustical Analysis of Speech Under Physical Stress in Relation to Physical Activities and Physical Literacy,"Proc. Speech Prosody 2022, pp. 200-204, Lisbon, Portugal, Portugal, May 23-26, 2022.
G.Y. Zhang, Y.C. Leng, D. Tan, Y. Qin, K.T. Song, X. Tan, S. Zhao and Tan Lee,"A Study on the Efficacy of Model Pre-Training In Developing Neural Text-to-Speech System,"2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), pp. 6087-6091, Singapore, May 22-27, 2022.
Shuiyang Mao, P.C. Ching and Tan Lee,"Enhancing Segment-Based Speech Emotion Recognition by Iterative Self-Learning,"in IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 30, pp. 123-134, 2022.
2021
Hei-Yi Mak and Tan Lee,"Low-resource NMT: a case study on the written and spoken Languages in Hong Kong,”in Proceedings of the 5th International Conference on Natural Language Processing and Information Retrieval (NLPIR), Sanya, China, pp.81-87, Dec. 17-20, 2021.
Jingyu Li, Si-Ioi Ng and Tan Lee,"Improving Text-Independent Speaker Verification With Auxiliary Speakers Using Graph,"in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Cartagena, pp. 198-205, Dec. 13-17, 2021.
W. Liu and Tan Lee,"Utterance-level neural confidence measure for end-to-end children speech recognition,"in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Cartagena, pp. 449-456, Dec. 13-17, 2021.
D. Tan, L. Deng, Y.T. Yeung, X. Jiang, X, Chen and Tan Lee,"EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion,"in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Cartagena, pp. 626-633, Dec. 13-17, 2021.
D. Tan, and Tan Lee,"Fine-Grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement,"in Proc. Interspeech 2021, Brno, Czechia, pp. 4683-4687, Aug 30-Sept. 1, 2021.
Z. Peng, X. Li and Tan Lee,"Pairing Weak with Strong: Twin Models for Defending Against Adversarial Attack on Speaker Verification,"in Proc. Interspeech 2021, Brno, Czechia, pp. 4284-4288, Aug 30-Sept. 1, 2021.
G. Zhang, Y. Qin, D. Tan and Tan Lee,"Applying the Information Bottleneck Principle to Prosodic Representation Learning,"in Proc. Interspeech 2021, Brno, Czechia, pp. 3156-3160, Aug 30-Sept. 1, 2021.
S.-I. Ng, C.W.-Y Li, and Tan Lee,"Detection of Consonant Errors in Disordered Speech Based on Consonant-Vowel Segment Embedding,"in Proc. Interspeech 2021, Brno, Czechia, pp. 2931-2935, Aug 30-Sept. 1, 2021.
Xurong Xie, Xunying Liu, Tan Lee and Lan Wang,"Bayesian learning for deep neural network adaptation,"in IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 29, pp. 2096-2110, May 2021.
G.Y. Zhang, S.R. Qiu, Ying Qin and Tan Lee,"Estimating Mutual Information in Prosody Representation for Emotional Prosody Transfer in Speech Synthesis,"in Proc. ISCSLP 2021, Hong Kong, Jan 24-26, 2021.
Ying Qin, Yao Qian, A. Loukina, P. Lange, A. Misra, K. Evanini and Tan Lee,"Automatic Detection of Word-Level Reading Errors in Non-native English Automatic Detection of Word-Level Reading Errors in Non-native English,"in Proc. ISCSLP 2021, Hong Kong, Jan 24-26, 2021.
2020
Si-Ioi Ng and Tan Lee,"Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder,"in Proc. Interspeech 2020, Shanghai, China, pp. 4476-4480, Oct 25-29, 2020.
Guangyan Zhang, Ying Qin and Tan Lee,"Learning Syllable-Level Discrete Prosodic Representation for Expressive Speech Generation,"in Proc. Interspeech 2020, Shanghai, China, pp. 3426-3430, Oct 25-29, 2020.
Shuiyang Mao, P.C. Ching, C.-C. Jay Kuo and Tan Lee,"Advancing Multiple Instance Learning with Attention Modeling for Categorical Speech Emotion Recognition,"in Proc. Interspeech 2020, Shanghai, China, pp. 2357-2361, Oct 25-29, 2020.
Shuiyang Mao, P.C. Ching and Tan Lee,"EigenEmo: Spectral Utterance Representation Using Dynamic Mode Decomposition for Speech Emotion Classification,"in Proc. Interspeech 2020, Shanghai, China, pp. 2352-2356, Oct 25-29, 2020.
Jingyu Li and Tan Lee,"Text-Independent Speaker Verification with Dual Attention Network,"in Proc. Interspeech 2020, Shanghai, China, pp. 956-960, Oct 25-29, 2020.
Shuiyang Mao, P.C. Ching and Tan Lee,"Emotion Profile Refinery for Speech Emotion Classification,"in Proc. Interspeech 2020, Shanghai, China, pp. 531-535, Oct 25-29, 2020.
Si-Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, Tan Lee, Kathy Y.S. Lee and Michael C.F. Tong,"CUCHILD: A Large-Scale Cantonese Corpus of Child Speech for Phonology and Articulation Assessment,"in Proc. Interspeech 2020, Shanghai, China, pp. 424-428, Oct 25-29, 2020.
Ying Qin, Y. Wu, Tan Lee and A.P.H. Kong,"An End-to-End Approach to Automatic Speech Assessment for Cantonese-speaking People with Aphasia,"Journal of Signal Processing Systems, vol.92, No.8, pp. 819-830, Aug. 2020.
Y.Z. Wu and Tan Lee,"Time-Frequency Feature Decomposition Based On Sound Duration For Acoustic Scene Classification,"2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), pp. 716-720, Virtual Barcelona, Spain, May 4-8, 2020.
Matthew K.-H. Ma, Tan Lee, Manson C.-M. Fong and William S.Y. Wang,"Resting-State EEG-Based Biometrics With Signals Features Extracted By Multivariate Empirical Mode Decomposition,"2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), pp. 991-995, Virtual Barcelona, Spain, May 4-8, 2020.
Z.Y. Peng, S.Y. Feng and Tan Lee,"Mixture Factorized Auto-Encoder For Unsupervised Hierarchical Deep Factorization Of Speech Signal,"2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), pp. 6769-6773, Virtual Barcelona, Spain, May 4-8, 2020.
Y. Qin, Tan Lee and A.P.H. Kong,"Automatic Assessment of Speech Impairment in Cantonese-speaking People with Aphasia,"IEEE Journal of Selected Topic in Signal Processing (JSTSP), vol. 14, no. 2, pp. 331-345, Feb. 2020.
2019
Shuiyang Mao, P.C. Ching, Tan Lee,
"Deep Learning of Segment-Level Feature Representation with Multiple Instance Learning for Utterance-Level Speech Emotion Recognition
," in Proc. Interspeech 2019, Graz, Austria, pp. 1686-1690, Sept 15-19, 2019.
Jiarui Wang, Ying Qin, Zhiyuan Peng, Tan Lee,
"Child Speech Disorder Detection with Siamese Recurrent Network using Speech Attribute Features," in Proc. Interspeech 2019, Graz, Austria, pp. 3885-3889, Sept 15-19, 2019.
Xurong Xie, Xunying Liu, Tan Lee, Lan Wang,
"Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features
," in Proc. Interspeech 2019, Graz, Austria, pp. 759-763, Sept 15-19, 2019.
Ying Qin, Tan Lee, Anthony Pak Hin Kong,
"Automatic Assessment of Language Impairment Based on Raw ASR Output," in Proc. Interspeech 2019, Graz, Austria, pp. 3078-3082, Sept 15-19, 2019.
Siyuan Feng, Tan Lee, Zhiyuan Peng,
"Combining Adversarial Training and Disentangled Speech Representation for
Robust Zero-Resource Subword Modeling," in Proc. Interspeech 2019, Graz, Austria, pp. 1093-1097, Sept 15-19, 2019.
Siyuan Feng, Tan Lee,
"Improving Unsupervised Subword Modeling via Disentangled Speech Representation
Learning and Transformation," in Proc. Interspeech 2019, Graz, Austria, pp. 281-285, Sept 15-19, 2019.
Siyuan Feng, Tan Lee,
"Exploiting Cross-Lingual Speaker and Phonetic Diversity for Unsupervised Subword Modeling," IEEE/ACM Trans.Audio, Speech, Lang. Process., vol 27, no. 12, pp. 2000-2011, 2019.
Yuanyuan Liu, Tan Lee, Thomas Law, Kathy Yuet-Sheung Lee,
"Acoustical Assessment of Voice Disorder With
Continuous Speech Using ASR Posterior Features," IEEE/ACM Trans.Audio, Speech, Lang. Process., vol 27, no. 6, pp. 1047-1059, 2019.
Shuiyang Mao, Dehua Tao, Guangyan Zhang, P.C. Ching, Tan Lee,
"REVISITING HIDDEN MARKOV MODELS FOR SPEECH EMOTION RECOGNITION," in Proc.ICASSP 2019, Brighton, UK,
pp. 6715-6719, May 12-17, 2019.
Xurong Xie, Xunying Liu, Tan Lee, Lan Wang,
"BLHUC: BAYESIAN LEARNING OF HIDDEN UNIT CONTRIBUTIONS FOR DEEP NEURAL NETWORK SPEAKER ADAPTATION," in Proc.ICASSP 2019, Brighton, UK,
pp. 5711-5715, May 12-17, 2019.
Yuzhong Wu, Tan Lee,
"ENHANCING SOUND TEXTURE IN CNN-BASED ACOUSTIC SCENE CLASSIFICATION," in Proc.ICASSP 2019, Brighton, UK,
pp. 815-819, May 12-17, 2019.
Ying Qin, Tan Lee, Anthony Pak Hin Kong,
"COMBINING PHONE POSTERIORGRAMS FROM STRONG AND WEAK RECOGNIZERS
FOR AUTOMATIC SPEECH ASSESSMENT OF PEOPLE WITH APHASIA," in Proc.ICASSP 2019, Brighton, UK,
pp. 6420-6424, May 12-17, 2019.
Zhiyuan Peng, Siyuan Feng, Tan Lee,
"ADVERSARIAL MULTI-TASK DEEP FEATURES AND UNSUPERVISED BACK-END
ADAPTATION FOR LANGUAGE RECOGNITION," in Proc.ICASSP 2019, Brighton, UK,
pp. 5961-5965, May 12-17, 2019.
2018
Shuiyang Mao and P.C. Ching,
"An Effective Discriminative Learning Approach for Emotion-Specific Features Using Deep Neural Networks," in Proc. ICONIP 2018, Siem Reap, Cambodia, pp. 50-61,
Dec 13-16, 2018.
Si Ioi Ng, Dehua Tao, Jiarui Wang, Yi Jiang, Wing Yee Ng, Tan Lee,
"An Automated Assessment Tool for Child Speech Disorders," in Proc. ISCSLP 2018, Taipei, Taiwan, pp. 493-494,
Nov 26-29, 2018.
Yuanyuan Liu, Tan Lee, P. C. Ching, Thomas K. T. Law and Kathy Y. S. Lee,
"Prediction of Voice Disorder Severity: Contributions from Sustained Vowels and Continuous Speech," in Proc. ISCSLP 2018, Taipei, Taiwan, pp. 290-294,
Nov 26-29, 2018.
Yuanyuan Liu, Ying Qin, Siyuan Feng, Tan Lee and P.C. Ching,
"Disordered Speech Assessment Using Kullback-Leibler Divergence Features with Multi-Task Acoustic Modeling," in Proc. ISCSLP 2018, Taipei, Taiwan, pp. 61-65,
Nov 26-29, 2018.
Jiarui Wang, Si Ioi Ng, Dehua Tao, Wing Yee Ng and Tan Lee,
"A Study on Acoustic Modeling for Child Speech Based on Multi-Task Learning," in Proc. ISCSLP 2018, Taipei, Taiwan, pp. 389-393,
Nov 26-29, 2018.
Xurong Xie, Xunying Liu, Tan Lee and Lan Wang,
"Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion," in Proc. ISCSLP 2018, Taipei, Taiwan, pp. 36-40,
Nov 26-29, 2018.
Ying Qin, Tan Lee, Yuzhong Wu and Anthony Pak Hin Kong,
"An End-to-End Approach to Automatic Speech Assessment for People with Aphasia," in Proc. ISCSLP 2018, Taipei, Taiwan, pp 66-70,
Nov 26-29, 2018.
Man-Ling Sung, Siyuan Feng and Tan Lee,
"Unsupervised Pattern Discovery from Thematic Speech Archives based on Multilingual Bottleneck Features," in Proc. APSIPA ASC 2018, Honolulu, USA, pp. 1448-1455,
Nov 12-15, 2018.
Hansjörg Mixdorff, Albert Rilliard, Tan Lee, Matthew K. H. Ma and Angelika Hönemann,
"Cross-cultural (a)symmetries in audio-visual attitude perception," in Proc. Interspeech 2018, Hyderabad, India, pp. 426-430
Sept 2-6, 2018.Paper
Ying Qin, Tan Lee, Siyuan Feng and Anthony Pak Hin Kong,
"Automatic speech assessment for people with aphasia using TDNN-BLSTM with multi-task learning," in Proc. Interspeech 2018, Hyderabad, India, pp 3418-3422,
Sept 2-6, 2018.Paper
Siyuan Feng and Tan Lee,
"Improving cross-lingual knowledge transferability using multilingual TDNN-BLSTM with language-dependent pre-final layer," in Proc. Interspeech 2018, Hyderabad, India, pp. 2439-2443,
Sept 2-6, 2018.Paper
Siyuan Feng and Tan Lee,
"Exploiting speaker and phonetic diversity of mismatched language resources for unsupervised subword modeling," in Proc. Interspeech 2018, Hyderabad, India, pp. 2673-2677,
Sept 2-6, 2018.Paper
Hong Zhang, Mark Liberman and Tan Lee,
"Information structure and prosodic prominence: how does sentence final particle affect Cantonese intonation?," in Proc.Speech Prosody 2018, Poznań, Poland,
pp. 903-907, June 13-16, 2018.
Tan Lee, Matthew K. H. Ma, Albert Rilliard, Hansjörg Mixdorff and Angelika Hönemann,
"Free labeling of audio-visual attitudinal expressions in Cantonese," in Proc.Speech Prosody 2018, Poznań, Poland,
pp. 483-487, June 13-16, 2018.
Lei Xie, Tan Lee and Man-Wai Mak, "Guest
editorial: Advances in deep learning for speech processing," Journal of Signal Processing Systems, pp. 1-3, 2018.
Ying Qin, Tan Lee and
Anthony Pak Hin Kong,
"Automatic speech assessment for aphasic
patients based on syllable-level embedding and supra-segmental duration features," in Proc.ICASSP 2018, Calgary, Canada,
pp. 5994-5998, April 15-20, 2018. Paper
Yuzhong Wu and Tan Lee,
"Reducing model complexity for DNN based
large-scale audio classification," in Proc.ICASSP 2018, Calgary, Canada,
pp. 331-335, April 15-20, 2018. arXiv Preprint
Version
2017
Anna Chi Shan Kam, John Ka Keung Sung, Tan Lee, Terence Ka Cheong Wong and Andrew van Hasselt,"Improving
mobile phone speech recognition by personalized amplification: Application in people with
normal hearing and mild-to-moderate hearing loss," Ear and Hearing, Vol.38, No.2, 2017.
Hansjörg Mixdorff, Angelika Hönemann, Albert Rilliard, Tan Lee and
Matthew K. H. Ma,
"Audio-visual expressions of attitude: How many different attitudes can perceivers decode?," Speech Communication, vol. 95, pp.114 - 126,
December 2017. Online
Version
Hansjörg Mixdorff, Angelika Hönemann, Albert Rilliard, Tan Lee and
Matthew Ma,
"Cross-Language perception of audio-visual
attitudinal
expressions," in Proc.AVSP 2017, Stockholm, SWEDEN,
August 25-26, 2017. Paper
Siyuan Feng and Tan Lee,
"On the linguistic relevance of speech units
learned by unsupervised acoustic modeling," in Proc.Interspeech 2017, Stockholm, SWEDEN,
pp. 2068-2072, August 20-24, 2017. Paper
Xurong Xie, Xunying Liu, Tan Lee and Lan Wang,
"RNN-LDA clustering for feature based DNN
adaptation," in Proc.Interspeech 2017, Stockholm, SWEDEN,
pp. 2396-2400, August 20-24, 2017. Paper
Yuanyuan Liu, Tan Lee, P. C. Ching, Thomas K. T. Law and Kathy Y. S. Lee,"Acoustic
assessment of disordered voice with continuous speech based on
utterance-level ASR posterior features,
" in Proc.Interspeech 2017, Stockholm, SWEDEN,
pp. 2680-2684, August 20-24, 2017. Paper
Lufei Gao, Li Su, Yi-Hsuan Yang, Tan Lee,
"Polyphonic
piano note transcription with non-negative matrix
factorization of differential spectrogram," in Proc.ICASSP 2017, New Orleans, USA,
pp. 291-295, March 4-9, 2017.
Raymond W. M. Ng, Alvin C.M. Kwan, Tan Lee, Thomas Hain, "SHEFCE:
A Cantonese-English bilingual speech corpus for pronunciation assessment," in Proc.ICASSP 2017, New Orleans, USA,
pp. 5825-5829, March 4-9, 2017.
2016
Anna Chi Shan Kam, John Ka Keung Sung, Tan Lee, Terence Ka Cheong Wong, Andrew van Hasselt,
"Improving mobile phone speech recognition by
personalized amplification: application in people with
normal hearing and mild-to-moderate hearing loss," in Proc. Ear and Hearing, Online version available from
november 2, 2016.
Ying Qin, Tan Lee, Anthony Pak Hin Kong, Sam Po Law,
"Towards
automatic
assessment
of
aphasia
speech
using
automatic
speech
recognition
techniques,"
in
Proc. International Symposium on Chinese Spoken Language Processing, Tianjin, China, October, 17-20, 2016.
Siyuan Feng, Tan Lee, Haipeng Wang,
"Exploiting
language-mismatched phoneme
recognizers for unsupervised
acoustic modeling," in Proc. International Symposium on Chinese Spoken Language Processing, Tianjin, China, October, 17-20, 2016.
Tan Lee, Yuanyuan Liu, Yu Ting Yeung, Thomas K. T. Law, Kathy Y. S. Lee,
"Predicting severity of voice disorder from DNN-HMM
acoustic posteriors," in Proc. Interspeech
2016, San Francisco, USA, September 8-12, 2016.
Jen-Tzung Chien, Pei-Wen Huang, Tan Lee,
"Hybrid accelerated optimization for speech
recognition," in Proc. Interspeech
2016, San Francisco, USA, September 8-12, 2016.
Tan Lee, Yuanyuan Liu, Pei-Wen Huang, Jen-Tzung Chien, Wang Kong Lam, Yu
Ying Yeung, Thomas K. T. Law, Kathy Y. S. Lee, Anthony Pak Hin Kong and
Sam Po Law, "Automatic speech recognition for acoustical
analysis and assessment of Cantonese pathological
voice and speech," in Proc. ICASSP 2016, Shanghai, China, March 20-25, 2016.
2015
Yu Ting Yeung, Tan Lee, Cheung-Chi Leung,
"Supervised single-microphone multi-talker speech
separation with conditional random fields," IEEE/ACM
Trans. on Audio, Speech, and Language Processing,
vol.23, no.12, pp.2334-2342, December 2015.
Tan Lee, Wang Kong Lam, Anthony Pak Hin Kong, Sam Po Law,
"Analysis of intonation patterns in cantonese
aphasia speech," in Proc. International Conference Oriental COCOSDA (O-COCOSDA/CASLRE), pp. 86-89, Shanghai, China, October 28-30, 2015.
Lufei Gao and Tan Lee,
"Multi-pitch estimation based on sparse
representation with pre-screened dictionary," in Proc. IEEE 17th International Workshop on Multimedia Signal Processing (MMSP), Xiamen, China, October 19-21, 2015.
Tan Lee,
Anthony Pak Hin Kong, Wang-Kong Lam," in Proc.Measuring
prosodic deficits in oral discourse by speakers with
fluent aphasia," Frontiers in Psychology, Conference Abstract of Academy of Aphasia 53rd
Annual Meeting, September 2015.
Shing Yu,
Tan Lee, Manwa L. Ng "Surface
electromyographic activity of extrinsic
laryngeal muscles in Cantonese tone
production," Journal of Signal Processing Systems First online: 11 July 2015.
Chun Hoy Wong,
Tan Lee, Yu Ting Yeung,
P. C. Ching, "Modeling temporal
dependency for robust estimation of LP model
parameters in speech enhancement," in Proc. Interspeech 2015, Dresden,
Germany, September 6-10, 2015.
Huijun Ding,
Tan Lee, Ing Yann Soon, Chai Kiat Yeo,
Peng Dai and Guo Dan, "Objective
measures for quality assessment of
noise-suppressed speech," Speech
Communication, vol. 71, pp. 62-63, July
2015.
Feng Huang,
Tan Lee, W. Bastiaan Kleijn and Ying-Yee
Kong, "A method of speech
periodicity enhancement using
transform-domain signal decomposition," Speech Communication, vol. 67,
pp.102-112, March 2015.
Haipeng Wang,
Tan Lee, Cheung-Chi Leung, Bin Ma,
Haizhou Li, "Acoustic segment
modeling with spectral clustering methods," IEEE/ACM
Trans. on Audio, Speech and Language
Processing , vol 23, no. 2, pp.
264-277, February 2015.
2014
Chi Shan Kam, John Ka Keung Sung, Tan Lee, Terence Ka Cheong Wong and C. A. van Hasselt, "Improving mobile phone perception by implementing automated customized enhanced technology - application in people with and without hearing loss,"
in Proc. Hong Kong Speech and Hearing Symposium, October 2014.
Nan Yan, Manwa L. Ng, and
Tan Lee, "Improving the sound
quality of an electronic voice box," in
Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA),
Siem Reap, Cambodia, December 9-12, 2014
Wang-Kong Lam and
Tan Lee, "Correcting chord
classification errors based on tonal
organization information of classical," in Proc. IEEE International
Symposium on Multimedia (ISM 2014),
Taichung, Taiwan, December 10-12, 2014.
Wang-Kong Lam and
Tan Lee, "Automatic key
partition based on tonal organization
information of classical music," in Proc. 15th International
Society for Music Information Retrieval
Conference (ISMIR 2014), Taipei,
Taiwan, October 27-31, 2014.
Yu Ting Yeung,
Tan Lee, and Cheng Chi Leung,
"Large-margin conditional random fields for
single-microphone speech separation," in Proc. Interspeech 2014,
pp.983-987, Singapore, September 14-18,
2014.
Haipeng Wang,
Tan Lee, Cheng Chi Leung, Bin Ma and
Haizhou Li, "A graph-based Gaussian
component clustering approach to
unsupervised acoustic modeling," in Proc. Interspeech 2014,
pp. 875-879, Singapore, September 14-18,
2014.
Feng Huang and
Tan Lee, "Multipitch tracking
based on linear programming relaxation and
sparsity-based pitch candidate estimation," in Proc. International Symposium on Chinese
Spoken Language Processing (ISCSLP),
pp. 331-335, Singapore, September 12-14, 2014.
Shing Yu, Tan Lee, and Manwa L. Ng,
"Surface electromyographic activity of non-laryngeal neck muscles in cantonese
tone production," in Proc.International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 304-307, Singapore, September 12-14, 2014.
Tan Lee, Shing Yu, Meng Yuan, Terence Ka Cheong Wong, and Ying-Yee Kong,
"The effect of enhancing temporal periodicity cues on cantonese tone recognition
by cochlear implantees," International journal of audiology (online version) ,
vol.53, no.8, pp. 546-557, August 2014.
Anna Chi Shan Kam, Kwok Shun Leung, John Ka Keung Sung, Tan Lee, and Charles A. van Hasselt,
"Evaluation of a self-administered tinnitus measurement system," in
Proc. 8th International TRI Tinnitus Conference , March 2014.
2013
Feng Huang and Tan Lee,
"Pitch estimation in noisy speech using accumulated peak spectrum and sparse
estimation technique," IEEE/ACM Trans. on Audio Speech and Language Processing,
vol.21, no.1, pp.99-109, Jan. 2013.
Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma and Haizhou Li,
"Shifted-delta MLP features for spoken language recognition," IEEE Signal Processing Letters,
vol. 20, pp. 15-18, Jan 2013.
Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li,
"Using parallel tokenizers with DTW matrix combination for low-resource spoken
term detection," in Proc.ICASSP 2013, Vancouver, Canada,
pp. 8545-8549, May 26-31, 2013.
Yu Ting Yeung, Tan Lee, Cheung-Chi Leung,
"Using dynamic conditional random field on single-microphone speech separation," in Proc.
ICASSP 2013, Vancouver, Canada, pp. 146-150, May 26-31, 2013.
Feng Huang, Yu Ting Yeung, Tan Lee,
"Evaluation of pitch estimation algorithms on separated speech,"
in Proc.
ICASSP 2013, Vancouver, Canada, pp. 6807-6811, May 26-31, 2013.
Yu Ting Yeung, Tan Lee,
"Structured mean field method for single-microphone speech separation with
factorial hidden markov model," in Proc.IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP 2013), pp. 122-126, Beijing, China, July 6-10, 2013.
Wang-Kong Lam, Tan Lee,
"Chord classification of multi-instrumental music using exemplar-based sparse
representation," in Proc. IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP 2013), pp. 113-117, Beijing, China, July 6-10, 2013.
Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, "Spoken language recognition with prosodic features,"
IEEE/ACM
Trans. on Audio, Speech and Language Processing, vol.21, no.9, pp.1841-1853, September 2013.
Chi-Fong Chan, Shing Yu, Tan Lee, Manwa L. Ng, John Ka Keung Sung,
"Investigation of pitch-related activities in surface electromyography (SEMG) of
non-laryngeal neck muscles," in Proc. 6th WACBE World Congress on Bioengineering,pp.431-439, Beijing, China, Aug 5-8, 2013.
Huijun Ding, Tan Lee, Guo
Dan," in Proc.Correlation analysis on objective evaluation and
perceptual judgments for noise-suppressed speech signals with Chinese language,"
in Proc. 6th WACBE World Congress on Bioengineering,pp.517-522, Beijing, China, Aug 5-8, 2013.
Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li,
"Unsupervised mining of acoustic subword units with segment-level Gaussian
posteriorgrams," in Proc.Interspeech, Lyon, France, 25-29 August, 2013.
Meng Yuan, Y. Sun, H. Feng, and Tan Lee,
"A speech enhancement method for cochlear implant listeners," in
Proc. Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC2013), pp.2036-2039, 2013.
Anna Chi Shan Kam, John Ka Keung Sung, Tan Lee, Terence Ka Cheong Wong, and Charles A. van Hasselt, "Clinical evaluation of a computerized self-administered hearing test"
in Proc. 4th World Chinese Otorhinolaryngology Head & Neck Surgery Conference
, organized by World Chinese Academy of Otorhinolaryngology Head & Neck Surgery, June 2013.
Tan Lee, Anthony Pak Hin Kong, Chi-Fong Chan, Haipeng Wang, "Analysis of auto-aligned and auto-segmented oral discourse by speakers with aphasia: a preliminary study on the acoustic parameter of duration"in Proc. Academy of Aphasia 2013 Annual Meeting, Lucerne, Switzerland, October 2013.
Manwa L. Ng, Nan Yan and Tan Lee,
"Improving the sound quality of an electronic voice box," in Proc. 6th International Conference on Biomedical Engineering and Informatics (BMEI 2013), pp. 368-372, 2013.
2012
Feng Huang, Tan Lee, and W. Bastiaan Kleijn,
"Transform-domain wiener filter for speech periodicity enhancement," in Proc.
ICASSP 2012, pp. 4577-4580, Kyoto, Japan, March 25-30, 2012.
Feng Huang and Tan Lee,
"Sparsity-based confidence measure for pitch estimation in noisy speech," in Proc.
ICASSP 2012, pp. 4601-4604, Kyoto, Japan, March 25-30, 2012.
Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li,
"An acoustic segment modeling approach to query-by-example spoken term
detection," in Proc. ICASSP 2012, pp. 5157-5160, Kyoto, Japan, March 25-30, 2012.
Yu Ting Yeung, Tan Lee, Cheung-Chi Leung,
"Integrating multiple observations for model-based single-microphone Speech
separation with conditional random fields," in Proc.ICASSP
2012, pp. 257-260, Kyoto, Japan, March 25-30, 2012.
Feng Huang and Tan Lee,
"Robust pitch estimation using l1-regularized maximum likelihood estimation," in
Proc. Interspeech 2012, Oregon, USA, Sept. 9-13, 2012.
Haipeng Wang and Tan Lee,
"CUHK System for the spoken web search task at mediaeval 2012," in Proc.Working
notes the MediaEval 2012 Workshop, Pisa, Italy, October 4-5, 2012, CEUR-WS.org, ISSN 1613-0073. pdf
Huijun Ding, Tan Lee, and Ing Yann Soon,
"Two objective measures for speech distortion and noise reduction evaluation of
enhanced speech signals," in Proc.International Symposium on Chinese Spoken Language Processing (ISCSLP), pp. 117-121,
Hong Kong, Dec. 5-8, 2012.
Ning Wang, P. C. Ching and Tan Lee,
"Exploration of phase and vocal excitation modulation features for speaker
recognition," in Proc.7th Chinese Conference on Biometric Recognition (CCBR 2012), pp. 251-259, December 2012.
2011
Ning Wang, P. C. Ching, Nengheng Zheng and Tan Lee,
"Robust speaker recognition using denoised vocal source and vocal tract
features," IEEE/ACM Trans. on Audio, Speech and Language Processing,
vol. 19, no. 1, pp. 196-205, January 2011.
Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma and Haizhou Li
"Score fusion and calibration in multiple language detectors with large
performance variation," in Proc.ICASSP 2011, pp. 4404-4407, Prague, Czech Republic, May 22-27, 2011.
Tan Lee and P. C. Ching,
"Dealing with imperfections in human speech communication with advanced
speech processing techniques," in Proc. International Symposium on Signals, Circuits and Systems 2011, Iasi, Romania, June 30-July 1, 2011.
Haipeng Wang, Tan Lee and Cheung-Chi Leung,
"Unsupervised spoken term detection with acoustic segment model," in Proc. Oriental COCOSDA 2011, pp. 106-111, Hsinchu, Taiwan, October 26-28, 2011.
F. Huang, Tan Lee, and W. B. Kleijn,
""Transform-domain speech periodicity enhancement with adaptive coefficient
weighting," in Proc. IEEE
International Symposium on Intelligent Signal Process. and Communication Systems 2011,
Tailand, December 7-9, 2011.
Nengheng Zheng, Tan Lee, Chun-Man Mak, "Model-based
non-negative matrix factorization for single-channel speech separation," in
Proc. IEEE International Conference on Signal Processing, Communications and Computing, pp. 385-388, Xi'an, China, 2011.
Nengheng Zheng, Yi Cai, Xia Li, Tan Lee,
"Semi-blind speech and music separation based on non-negative matrix
factorization and vector similarity," in Proc. National Conference on Man-Machine Speech Communication, Xi'an, China, 2011.
2010
Nengheng Zheng, Chao Qin, Tan Lee and P. C. Ching,
"CU2C: A dual-condition Cantonese speech database for speaker recognition,"
in Proc.Computer Processing of Asian Spoken Languages, Shuichi Itahashi and Chiu-yu Tseng et al., eds., (Japan: Consideration Books, March 2010), pp.90-93.
Houwei Cao, Tan Lee and P. C. Ching,
"Development of the Cantonese-English code-mixing speech corpora," in Proc. Computer Processing of Asian Spoken Languages, Shuichi Itahashi and Chiu-yu Tseng et al., eds., (Japan: Consideration Books, March 2010), pp. 204-207.
Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma and Haizhou Li
"Prosodic attribute model for spoken language identification," in Proc. ICASSP 2010, Dallas, Texas, USA, pp. 5022-5025, April 14-19, 2010.
Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma and Haizhou Li,
"An entropy-based approach for comparing prosodic properties in tonal
and pitch accent languages," in Proc. Proc. Speech Prosody, Chicago, Illinois, USA, May 11-14, 2010.
Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma and Haizhou Li,
"Detection target dependent score calibration for language
recognition," in Proc. Speaker Odyssey, pp. 91-96, Brno, Czech Republic, June 28 - July 01, 2010.
Chun-Man Mak, Tan Lee, Suman Senapati, Yu-Ting Yeung and Wang-Kong Lam,
"Similarity measures for Chinese pop music based on low-level audio
digtal attributes," in Proc. the 11th International Society for Music Information Retrieval Conference (ISMIR 2010), pp. 513-518, Utrecht, Netherlands, Aug 9-13, 2010.
Feng Huang, Tan Lee, W. Bastiaan Kleijn
"A method of speech periodicity enhancement based on transform-domain
signal decomposition," in Proc. EUSIPCO 2010 , pp. 984-988, Aalborg, Denmark, August 23-27, 2010.
Yujia Li and Tan Lee,
"Perception-based automatic approximation of F0 contours in Cantonese speech,"
in Proc. Interspeech 2010, pp. 1425-1428, Chiba, Japan, Sep. 2010.
Raymond W. M. Ng, Cheung-Chi Leung, Ville Hautamäki, Tan Lee, Bin Ma and Haizhou Li,
"Towards long-range prosodic attribute modeling for language recognition," in Proc. Interspeech 2010, pp. 1792-1795,Chiba, Japan, Sep. 2010.
Houwei Cao, Tan Lee and P. C. Ching,
"Cross-lingual speaker adaptation via Gaussian component mapping," in Proc. Interspeech 2010, pp. 869-872, Chiba, Japan, Sep. 2010.
Ning Wang, P. C. Ching, and Tan Lee,
"Exploitation of phase information for speaker recognition," in Proc. Interspeech 2010, pp. 2126-2129, Chiba, Japan, Sep. 2010.
Feng Huang, Tan Lee,
"Pitch estimation in noisy speech based on temporal accumulation of spectrum
peaks," in Proc. Interspeech 2010, pp. 641-644, Chiba, Japan, Sep. 2010.
Yujia LI and Tan Lee,
"Perception and analysis of linearly approximated F0 contours in Cantonese
speech," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP) 2010, pp. 435-439, Tainan & Sun Moon Lack, Taiwan,
nov. 2010.
Ning Wang, P. C. Ching, and Tan Lee,
"Robust speaker verification using phase information of speech," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP) 2010, pp. 483-487, Tainan & Sun Moon Lack, Taiwan,
nov. 2010.
Houwei Cao, P. C. Ching, Tan Lee, and Yu Ting Yeung,
"Semantics-based language modeling for Cantonese-English code-mixing speech
recognition," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP) 2010, pp. 246-250, Tainan & Sun Moon Lack, Taiwan,
nov. 2010.
Chun-Man Mak, Tan Lee, and S.W. Lee,
"Spectral trajectory estimation using non-negative matrix factorization for
model-based monaural speech separation," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP) 2010, pp. 23-28, Tainan & Sun Moon Lack, Taiwan,
nov. 2010.
Nengheng Zheng, Xia Li, Thierry Blu, and Tan Lee,
"SURE-MSE speech enhancement for robust speech recognition," in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP) 2010, pp. 271-274, Tainan & Sun Moon Lack, Taiwan,
nov. 2010.
2009
Kevin C. P. Yuen,
Meng Yuan, K. W. Pang, Tan Lee, Sigfrid D. Soli, Michael C. F. Tong, Charles A. van
Hasselt, "Development of the computerized Cantonese Disyllabic Lexical
Tone Identification Test in noise (CANDILET-N)," Cochlear Implants International, vol. 10 (Suppl 1), pp. 130-137, 2009.
Kevin C. P. Yuen, Lan Luan, Huan Li,
Meng Yuan, Caogang Wei, Keli Cao, Tan Lee"Development of the computerized Mandarin pediatric lexical tone and
disyllabic-word picture identification test in noise (MAPPID-N)," Cochlear Implants International, vol. 10 (Suppl 1), pp. 138-147, 2009.
Kevin C. P. Yuen,
Meng Yuan, Tan Lee, Sigfrid D. Soli, Michael C. F. Tong, Charles A. van Hasselt,
"Cantonese lexical tone recognition from frequency-specific temporal
envelope and periodicity components in the same versus different noise band
carriers," Cochlear Implants International, vol. 10 (Suppl 1), pp. 148-158, 2009.
Meng Yuan, Tan Lee,
Kevin C. P. Yuen, Sigfrid Soli, Charles A. van Hasselt, and Michael C. F. Tong,
"Cantonese tone recognition with enhanced temporal periodicity
cues," Journal of Acoustical Society
of America, vol. 126(1), pp. 327-337, 2009.
Houwei Cao, P. C. Ching
and Tan Lee, "Effects of
language mixing for automatic recognition of Cantonese-English code-mixing
utterances," in Proc. 10th Annual Conference of the International
Speech Communication Association (Interspeech 2009), pp. 3011-3014,
Brighton, UK, September 6-10, 2009.
S. W. Lee, Frank
K. Soong and Tan Lee,
"Model-based speech separation: identifying transcription using
orthogonality,"
in Proc. 10th Annual Conference of the International Speech
Communication Association (Interspeech 2009), pp. 1343-1346, Brighton, UK,
September 6-10, 2009.
Ning Wang, P. C. Ching
and Tan Lee,
"Exploration of vocal excitation modulation features for speaker
recognition,"
in Proc. 10th Annual Conference of the International Speech
Communication Association (Interspeech 2009), pp. 892-895, Brighton, UK,
September 6-10, 2009.
Raymond W. M. Ng,
Tan Lee, Cheung-Chi Leung,
Bin Ma and Haizhou Li, "Analysis and selection of prosodic features for language
identification," in Proc.International Conference on Asian Language Processing (IALP
2009), pp. 123-128, Singapore, December 7-9, 2009.
Raymond W. M. Ng,
Tan Lee, Cheung-Chi Leung,
Bin Ma and Haizhou Li, "Analysis and selection of prosodic features for Asian
language recognition," International Journal of Asian Language Processing,
vol. 19(4), pp. 139-152, 2009.
Joyce Y.C. Chan, Houwei Cao, P. C. Ching and Tan Lee,
"Automatic recognition of Cantonese-English code-mixing speech," in Proc. International Journal of Computational Linguistics and Chinese Language Processing,
vol.14, no.3, pp.281-304, September 2009.
2008
Wentao Gu and Tan Lee,
"Effects of tone and emphatic focus on speech
prosody - A comparison between standard Chinese and
Cantonese," in Proc. 8th Phonetic Conference of China and
the International Symposium on Phonetic Frontiers, Beijing, China, April
18-20, 2008.
Kevin C. P. Yuen, Lan Luan, Huan Li,
Meng Yuan, Caogang Wei, Keli Cao,
Tan Lee,
"Computerized Mandarin pediatric lexical tone and disyllabic-word picture
identification test in noise (MAPPID-N): development and standardization," in
Proc. abstract presented atInternational Congress of Audiology (ICA2008), pp. 72,
Hong Kong, June 8-12, 2008.
Yao Qian,
Frank K Soong and Tan Lee,
"Tone-enhanced generalized character posterior probability(GCPP) for
Cantonese LVCSR," in Proc.Computer Speech and Language,
vol. 22, no. 4
pp. 360-373, October, 2008.
Yu Ting Yeung, Houwei Cao, N. H. Zheng,
Tan Lee and
P. C. Ching, "Language modeling
for speech recognition of spoken Cantonese," in Proc.Interspeech
2008, pp. 1570-1573, Brisbane, Australia, September 22-26 2008.
Yu Ting Yeung, Yao Qian,
Tan Lee and
Frank K. Soong, "Prosody for Mandarin speech
recognition: a comparative study of read and spontaneous speech,"
in Proc.Interspeech 2008, pp. 1133-1136, Brisbane, Australia, September 22-26 2008.
Hoi-To Wai, S. W. Lee, Wang Kong Lam, and
Tan Lee, "On pitch
tracking and melody characterization for music signal analysis: A singing voice
database," in Proc.Oriental COCOSDA2008, pp. 97-102, Kyoto, Japan,
november 25-27, 2008.
Jiang Cao, Xiaojun Wu, Yu Ting Yeung,
Tan Lee and Thomas Fang
Zheng, "Automatic collecting of text data for Cantonese language modeling,"
in Proc.Oriental COCOSDA2008, pp.
130-134, Kyoto, Japan, november 25-27, 2008.
Wentao Gu, Tan Lee and
P. C. Ching, "Prosodic
variation in Cantonese-English code-mixed speech,"
in Proc. 2008 International
Symposium on Chinese Spoken Language Processing, pp. 342-345, Kunming,
China, December 16-19, 2008.
S. W. Lee, Frank K. Soong,
P. C.
Ching and Tan Lee, "Pitch
tracking for model-based speech separation," in
Proc. 2008 International
Symposium on Chinese Spoken Language Processing, pp. 145-148, Kunming,
China, December 16-19, 2008.
Y. J. Li and Tan
Lee, "A perceptual study of approximated Cantonese tone contours," in Proc.
2008 International Symposium on Chinese Spoken Language Processing, pp.
49-52, Kunming, China, December 16-19, 2008.
Raymond W., M. Ng and Tan Lee,
"Entropy-based analysis of the prosodic features of Chinese
dialects," in Proc.2008 International Symposium on Chinese Spoken
Language Processing, pp. 65-68, Kunming, China, December 16-19, 2008.
Meng Yuan,
Tan Lee and
Sigfrid D. Soli, "Mandarin tone perception with temporal envelope and
periodicity cues from different frequency regions," in Proc. 2008 International
Symposium on Chinese Spoken Language Processing, pp. 338-341, Kunming,
China, December 16-19, 2008.
Nengheng Zheng, Xia Li, Houwei Cao,
Tan Lee and
P. C. Ching,
"Deriving MFCC parameters from the dynamic spectrum for robust speech
recognition," in Proc. 2008 International Symposium on Chinese Spoken Language Processing, pp.
85-88, Kunming, China, December 16-19, 2008.
2007
Nengheng Zheng,
Tan Lee and
P. C. Ching,
"Integration of complementary acoustic features for
speaker recognition," IEEE Signal Processing Letters,
vol. 14, no. 3, pp. 181-184, March 2007.
C. Yang, Frank K. Soong and
Tan Lee, "Static
and dynamic spectral features: Their noise robustness and optimal weights
for ASR," IEEE/ACM Trans. on Audio, Speech and Language Processing,
vol. 15, no. 3, pp. 1087-1097, March 2007.
Kevin C.P. Yuen, Meng Yuan, Tan Lee, Sigfrid Soli, Michael C.F. Tong, Charles A. van Hasselt,
"Frequency-specific temporal envelope and periodicity components for
lexical tone identification in Cantonese," Ear & Hearing,
vol.28(2) Supplement, pp.107S - 113S, 2007
Yao Qian, Tan Lee and Frank K Soong,
"Tone recognition in continuous Cantonese speech
using supratone models," Journal of the Acoustical Society of America,
vol. 121, pp. 2936-2945, May 2007.
W.N. Chan, Nengheng Zheng and
Tan Lee,
"Discrimination power of vocal source and vocal tract features for speaker
recognition," IEEE/ACM Trans. on Audio, Speech and Language Processing,
vol. 15, no. 6, pp. 1884-1892, August 2007.
Nengheng Zheng,
Tan Lee, N. Wang and
P. C. Ching,
"Integrating of complementary features from vocal source and vocal tract
for speaker identification," Computational Linguistics & Chinese Language Processing,
vol. 12, no. 3, pp. 273-290, September 2007.
Jing Zhang and P. C. Ching,
"Blind separation of moving speech sources using short-time LOD based
ICA method," in Proc. ICASSP 2007, vol. III, pp. 957-960, Honolulu, Hawaii, USA, April 15-20, 2007.
Wentao Gu and Tan Lee,
"Effects of focus on prosody of Cantonese speech - A comparison of
surface feature analysis and model-based analysis," in Proc. International Workshop on Paralinguistic Speech - between Models and Data
(ParaLing07), pp. 59-64, Saarbrücken, Germany, August 3, 2007.
Wentao Gu and Tan Lee,
"Effects of tonal context and focus on Cantonese F0," in Proc. 16th International Congress of Phonetic Sciences, pp. 1033-1036,
Saarbrücken, Germany, August 6-10, 2007.
Wentao Gu and Tan Lee,
"Quantitative analysis of F0 contours of emotional speech of Mandarin," in
Proc. 6th ISCA Speech Synthesis Workshop, pp. 228-233, Bonn, Germany, August 22-24, 2007.
Meng Yuan,
Tan Lee, Kevin C. P. Yuen, Sigfrid D. Soli, Michael
C. F. Tong and Charles A. van Hasselt, "Band-specific temporal
periodicity enhancement for Cantonese tone perception with
noise-excited vocoder," in Proc. 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 694-697, Lyon, France, August 23-26, 2007
S. W. Lee, Frank K. Soong and
P. C. Ching,
"Model-based speech separation with single-microphone input," in Proc. 10th European Conference on Speech Communication and Technology (Interspeech 2007), pp. 850-853, Antwerp, Belgium, August 27-31, 2007.
Wentao Gu, Rerrario Shui-Ching Ho, and
Tan Lee,
"Modeling tones in Hakka on the basis of the command-response model,"
in Proc. 10th European Conference on Speech Communication and Technology (Interspeech 2007), pp. 2633-2636, Antwerp, Belgium, August 27-31, 2007.
Hiroko Hirano, Keikichi Hirose, Goh Kawai, Wentao
Gu, and nobuaki Minematsu, "F0 models show Chinese speakers of Japanese insert intonational
boundaries and drop pitch," in Proc.10th European Conference on Speech
Communication and Technology (Interspeech 2007), pp. 1885-1888, Antwerp,
Belgium, August 27-31, 2007.
Yujia Li and
Tan Lee,
"Perceptual equivalence of approximated Cantonese
tone contours," in Proc. 10th European Conference on Speech Communication and Technology (Interspeech 2007), page 2677-2680, Antwerp, Belgium, August, 2007.
Houwei Cao, Tan Lee and
P. C. Ching,
"A study of pronunciation variation in Cantonese-English code-mixing
speech," in Proc. Oriental COCOSDA2007, pp. 143-148, Hanoi, Vietnam, Dec. 4-6, 2007.
Ning Wang, P. C. Ching,
N.H. Zheng and
Tan Lee,
"Robust speaker recognition using both vocal source
and vocal tract features estimated from noisy input
utterances," in Proc. 7th IEEE International Symposium on Signal Processing and Information Technology (ISSPIT 2007), pp. 772-777, Cairo, Egypt, Dec. 15-18, 2007.
Meng Yuan,
Tan Lee, Kevin C. P. Yuen, Sigfrid D. Soli, Michael C. F. Tong, Charles A. van Hasselt,
"F0-related periodicity enhancement in temporal envelope for
Cantonese tone recognition," in Proc.Asia Pacific Symposium on Cochlear Implant and Related Sciences
(APSCI2007), pp. 143-144, Sydney, Australia, Oct. 30 - nov. 2, 2007.
2006
Tan Lee and Yao Qian,
"Tone modeling for speech recognition," Advances in Chinese Spoken Language Processing ed. by C.H.Lee, H. Li, L.S. Lee, R. Wang and Q. Huo, pp. 179-200, Singapore: Springer-Verlag, Dec, 2006
P. C. Ching, Tan Lee, W.K. Lo and Helen Meng,
"Cantonese speech recognition and synthesis," Advances in Chinese Spoken Language Processing ed. by C.H.Lee, H. Li, L.S. Lee, R. Wang and Q. Huo, pp. 365-386, Singapore: Springer-Verlag, Dec, 2006
Y. Zhu and Tan Lee,
"Using duration information in Cantonese connected-digit recognition,"
Computational Linguistics & Chinese Language Processing,
vol. 11, no. 1, pp. 1 - 16, March 2006.
Tan Lee, P. Kam and Frank K. Soong,
"Modeling Cantonese pronunciation variations for large-vocabulary
continuous speech recognition,"
Computational Linguistics & Chinese Language Processing, vol. 11,
no. 1, pp. 17 - 35, March 2006.
Meng Yuan, Tan Lee, P. C. Ching and Y. Zhu,
"Speech recognition on DSP: Issues on computational efficiency and
performance analysis," Microprocessors and Microsystems,
vol. 30, Issue 3, pp. 155-164, May 2006.
Yujia Li, "Tone ratios
combined with F0 register in Cantonese as speaker-dependent
characteristic," in Proc. International Conference on Speech Prosody,
vol. 1, pp. 169 - 172, Dresden, Germany, May 2-5, 2006
Helen MENG, P. C. Ching, Tan Lee, MAK Man Wai, MAK Brian, Moon Yiu Sang, Siu Man-hung, Tang Xiaoou, Hui Pak Sum Henry, Lee Pun Yuen Andrew, W.K. Lo, MA Bin and Sio Kok Tou,
"The multi-biometric, multi-device and multilingual (M3) corpus," in Proc. Multimodal User Authentication (MMUA) Workshop 2006, 8 pgs. Toulouse, France, May 11, 2006
Yao Qian, Frank Soong and Tan Lee,
"Tone-enhanced generalized character posterior probability(GCPP) For
Cantonese LVCSR ," in Proc. ICASSP 2006, vol. I, pp. 133 - 136, Toulouse, France, May 14-19, 2006.
S.W. Lee, Frank K. Soong and P. C. Ching,
"An iterative trajectory regeneration algorithm for separating mixed
speech sources," in Proc. ICASSP 2006, vol. I, pp. 157 - 160, Philadelphia, Toulouse, France, May 14-19, 2006.
W.N. Chan, Tan Lee, N.H. Zheng and H. Ouyang,
"Use of vocal source features in speaker segmentation," in Proc.
ICASSP 2006, vol. I, pp. 657 - 660, Toulouse, France, May 14-19, 2006.
H. Ouyang, Tan Lee and W.N. Chan,
"Feature extraction from talking mouths for video-based bi-modal speaker
verification," in Proc. ICASSP 2006, vol. V, pp. 513 - 516, Toulouse, France, May 14-19, 2006.
Y.C. Chan, P. C. Ching, Tan Lee and Houwei Cao,
"Automatic speech recognition of Cantonese-English code-mixing
utterances," in Proc. 9th International Conference on
Spoken Language Processing (Interspeech 2006 - ICSLP), pp. 113 - 116, Pennsylvania, USA, September 17-21, 2006.
Xin Lei, Manhung Siu, Mei-yuh Hwang, Mari Ostendorf and Tan Lee,
"Improved tone modeling for Mandarin broadcast news speech recognition,"
in Proc. 9th International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP), pp. 1237 - 1240, Pennsylvania, USA, September 17-21, 2006.
S. Zhang, P. C. Ching and Fan-rang Kong,
"Automatic emotion recognition of speech signal in Mandarin," in Proc. 9th International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP), pp.1810 - 1813, Pennsylvania, USA, September 17-21, 2006.
W. M. Ng, Tan Lee and W. Gu,
"Towards automatic parameter extraction of command-response model for
Cantonese," in Proc. 9th International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP), pp.2358 - 2361, Pennsylvania, USA, September 17-21, 2006.
Houwei Cao, P. C. Ching, Tan Lee and Ning Wang,
"An extended Cantonese-English code-mixing speech corpus: exCUMIX," in
Proc. Oriental COCOSDA 2006, pp. 1-5, Penang, Malaysia, December 9-11, 2006.
Nengheng Zheng, Ning Wang, Tan Lee and P. C. Ching,
"Speaker verification using complementary information from vocal
source and vocal tract," in Proc. 2006 International Symposium on Chinese Spoken Language Processing, (ser. Lecture
notes in Computer Science, Q. Huo, B. Ma, C. E. Siong, and H. Li, Eds.,
vol. 4274), vol. I, pp. 518-528, Springer, Singapore, December 2006.
Nengheng Zheng, P. C. Ching, Ning Wang and Tan Lee,
"Integrating complementary features with a confidence measure for
speaker identification," in Proc. 2006 International Symposium on Chinese Spoken Language Processing, (ser. Lecture
notes in Computer Science, Q. Huo, B. Ma, C. E. Siong, and H. Li, Eds.,
vol. 4274), vol. I, pp. 549-557, Springer, Singapore, December 2006.
Sheng Zhang, P. C. Ching and Fanrang Kong,
"Acoustic analysis of emotional speech in Mandarin Chinese," in Proc. 2006 International Symposium on Chinese Spoken Language Processing,
vol. II, pp. 57-66, Singapore, December 2006.
Jing Zhang and P. C. Ching,
"Short-time ICA for blind separation of noisy speech," in Proc. 2006 International Symposium on Chinese Spoken Language Processing,
vol. II, pp. 258-267, Singapore, December 2006.
Meng Yuan, Kevin C. P. Yuen, Tan Lee, Sigfrid Soli, Michael C. F. Tong and Charles A. van Hasselt,
"Frequency-specific expansion of temporal cues for lexical-tone
identification in Cantonese," in Proc. International Hearing Aid Research Conference (IHCON), Lake Tahoe,
California, August 16 - 20, 2006.
2005
C. Yang, Frank K. Soong and Tan Lee,
"Static and dynamic spectral features: Their noise robustness and optimal
weights for ASR," in Proc. ICASSP 2005, vol. I, pp.241 - 244, Philadelphia, PA, USA, March 2005.
Meng Yuan, Tan Lee and P. C. Ching,
"Speech recognition on DSP: Issues on computational efficiency and performance
analysis," in Proc. IEEE Conference on Communications, Circuits and Systems 2005,
vol. II, pp. 852 - 856, HKUST, Hong Kong, May 2005.
S.W. Lee, Frank K. Soong and P. C. Ching,
"Harmonic filtering for joint estimation of pitch and voiced source with
single-microphone input," in Proc.9th European Conference on Speech Communication and Technology, pp. 309 - 312, Lisboa, Portugal, September 2005.
Joyce Y.C. Chan, P. C. Ching and Tan Lee,
"Development of Cantonese-English code-mixing speech corpus," in Proc. 9th European Conference on Speech Communication and Technology, pp. 1533 - 1536, Lisboa, Portugal, September 2005.
T.Y. Fung, Y.C. Chi, Eddie Sio, Icarus Lee, H. Meng and P. C. Ching,
"Embedded Cantonese TTS for multi-device access to web content," in Proc. 9th European Conference on Speech Communication and Technology, pp. 2601 - 2604, Lisboa, Portugal, September 2005.
Hua Ouyang and Tan Lee,
"A new lip feature representation method for video-based bimodal
authentication,"in Proc. 2005 NICTA-HCSNet Multimodal User Interaction Workshop,
vol. 57, pp. 33 - 37, Sydney, Australia, 13-14 September 2005.
N.H. Zheng, Tan Lee and P. C.Ching,
"Comparative analysis of discrimination power of the vocal source and vocal
tract features for speaker verification," in Proc. 8th National Conference on Man Machine Speech Communication, pp. 210 - 213, Beijing, China, October 22-24, 2005.
C. Qin, Tan Lee and H. Meng,
"On anti-model design for Cantonese verbal information verification," in Proc. 8th National Conference on Man Machine Speech Communication, pp. 375 - 378, Beijing, China, October 22-24, 2005.
YUEN Chi Pun, Meng Yuan, Tan LEE, SOLI Sigfrid, TONG Chi Fai Michael and VAN HASSELT Charles Andrew,
"Frequency-specific temporal envelope and periodicity components for lexical
tone identification in Cantonese," in Proc. 5th Asia Pacific Symposium on Cochlear Implant and Related Sciences, pp. 84,
Hong Kong, 26th - 28th november 2005.
N.H. Zheng, C. Qin, Tan Lee and P. C.Ching,
"CU2C: A dual-condition Cantonese speech database for speaker recognition
applications," in Proc. 2005 International Conference on Speech Databases and Assessment (Oriental-COCOSDA 2005), pp. 67 - 72, Jakarta, Indonesia, December 6-8, 2005.
2004
W.K. Lo, Helen Meng and P. C. Ching,
"Multi-scale spoken document retrieval for Cantonese
broadcast news,"
International Journal on Speech Technology, vol. 7, iss. 2-3, pp. 203 - 219, April 2004.
Yujia Li, Tan Lee and Yao Qian,
"Analysis and modeling of F0 contours for Cantonese text-to-speech," Journal of ACM
Trans. on Asian Language Information Processing, vol. 3, iss. 3, pp. 169-180, September 2004.
Yujia Li, Tan Lee and Yao Qian,
"F0 analysis and modeling for Cantonese text-to-speech," in Proc. International Conference on Speech Prosody, pp.467 - 470, Nara, Japan, March 2004.
Yao Qian, Tan Lee and Frank Soong
"Use of tone information in continuous Cantonese speech recognition," in Proc.International Conference on Speech Prosody, pp.587 - 590, Nara, Japan, March 2004.
N.H. Zheng and P. C. Ching,
"Using HAAR transform vocal source information for automatic speaker
recognition," in Proc. ICASSP 2004, vol.I, pp.77 - 80, Montreal, Quebec, Canada, May 2004.
H. Meng, Y.C. Li, T.Y. Fung, K.F. Low, K.F. Chow, T.H. Lo, M.C. Ho and P. C. Ching
"Bilingual Chinese/English voice browsing based on a voiceXML
platform," in Proc. ICASSP 2004, vol.III, pp.769 - 772, Montreal, Quebec, Canada, May 2004.
S.W.Lee, P. C. Ching and Tan Lee,
"noise-robust automatic speech recognition using mainlobe-resilient
time-frequency quantile-based noise estimation," in Proc. IEEE International Symposium on Circuits and Systems,
vol. III, pp.425 - 428, Vancouver, Canada, May 2004.
S.W. Lee and P. C.Ching,
"In-phase feature induction: An effective compensation technique for
robust speech recognition," in Proc.8th International Conference on Spoken Language Processing,
vol. I, pp.157 - 160, Jeji Island, Korea, October 2004.
Y. Zhu and Tan Lee,
"Explicit duration modeling for Cantonese connected-digit
recognition," in Proc. 8th International Conference on Spoken Language Processing,
vol. I, pp.685 - 688, Jeji Island, Korea, October 2004.
Y. Qian, Tan Lee and Frank K. Soong,
"Tone information as a confidence measure for improving Cantonese
LVCSR," in Proc. 8th International Conference on Spoken Language Processing,
vol. III,pp.1965 - 1968, Jeji Island, Korea, October 2004.
N.H. Zheng, P. C. Ching and Tan Lee,
"Time frequency analysis of vocal source signal for speaker
recognition," in Proc. 8th International Conference on Spoken Language Processing,
vol. III, pp.2333 - 2336, Jeji Island, Korea, October 2004.
C. Yang, Frank K. Soong and Tan Lee,
"On noise robustness of dynamic and static features for continuous
Cantonese digit recognition," in Proc. 2004 International Symposium on Chinese Spoken Language Processing, pp.277 - 280,
Hong Kong, December 2004.
Joyce Y.C. Chan, P. C. Ching, Tan Lee and H. Meng,
"Detection of language boundary in code-switching utterances by
bi-phone probabilities," in Proc. 2004 International Symposium on Chinese Spoken Language Processing, pp.293 - 296,
Hong Kong, December 2004.
C. Qin and Tan Lee,
"Cantonese verbal information verification system using GMM-based
anti-model," in Proc. 2004 International Symposium on Chinese Spoken Language Processing, pp.297 - 300,
Hong Kong, December 2004.
2003
W.K. Lo, Helen Meng and P. C. Ching,
"Cross-language spoken document retrieval using
HMM-based retrieval model with multi-scale fusion,"
IEEE/ACM Trans. on Asian Language Information Processing,
vol.2, iss.1, pp.1 - 26, March 2003.
Tan Lee, Helen Meng, W.K. Lo and P. C. Ching,
"The state of the art in human-computer speech-based interface
technologies," HKIE Trans., vol.10, no. 4, pp. 50 - 61, December 2003.
H. Meng, T.H. Lo, C.K. Keung, M.C. Ho, W.K. Lo and P. C. Ching,
"CU VOCAL web service: A text-to-speech synthesis web service for
voice-enabled web-mediated applications," in Proc. the Twelfth International World Wide Web Conference, Budapest, Hungary,
pp. 56 - 57, May 2003.
C.F. Chan, W.Han, K.W. Hon, Tan Lee, C.S. Choy, K.P. Pun and P. C. Ching,
"An HMM-based speech Recognition IC," in Proc.IEEE International Symposium on Circuits and Systems, Bangkok,
vol. II, pp.744 - 747, May 2003.
H. Meng, T.Y. Fung, Y.C. Li, M.C. Ho, T.H. Lo, C.K. Keung, W.K. Lo and P. C. Ching,
"Recent enhancements in CU VOCAL for Chinese TTS-enabled applications,"
in Proc.8th European Conference on Speech Communication and Technology, Geneva, Switzerland, pp.
1253 - 1256, September 2003.
Patgi Kam, Tan Lee and Frank Soong,
"Modeling Cantonese pronunciation variation by acoustic model
refinement,"
in Proc. 8th European Conference on Speech Communication and Technology, Geneva, Switzerland,
pp.1477 - 1480, September 2003.
Yao Qian, Tan Lee and Yujia Li,
"Overlapped di-tone modeling for tone recognition in continuous
Cantonese speech,"
in Proc. 8th European Conference on Speech Communication and Technology, Geneva, Switzerland,
pp.1845 - 1848, September 2003.
Wei Han, K.W. Hon, C.F. Chan, Tan Lee, C.S. Choy, K.P. Pun and P. C. Ching,
"A real-time Chinese speech recognition IC with double mixtures,"
in Proc. 5th International Conference on ASIC, Beijing, China,
pp.926 - 929, October 2003.
Y.C. Li, T.Y. Fung, Helen Meng, and P. C. Ching,
"CU VOCAL: A Cantonese text-to-speech synthesizer," in
Proc. 11th Annual Conference of the Hong Kong Institution of Science Park,
Hong Kong SAR, november 2003.