Home CV Research Publication

 

 

[PDF version]

Ming Li

3710 McClintock Ave, Room RTH 320, Los Angeles, CA, 90089 l (213) 446-0346 l  mingli at usc dot edu

--------------------------------------------------------------------------------------------------------------------------------------------------------

EDUCATION

 

University of Southern California                                                                                Los Angeles, USA

Ph.D. student in Electrical Engineering Department                                                           2008-present

l        Signal Analysis and Interpretation Laboratory

l        Advisor: Shrikanth Narayanan

l        Provost Fellowship

Institute of Acoustics, Chinese Academy of Sciences                                                   Beijing, China

Master of Engineering in Signal and Information Processing                                                 2005-2008

l        ThinkIT Speech Laboratory

l        Advisor: Yonghong Yan

l        Research Assistant

Nanjing University                                                                                                    Nanjing, China

Bachelor of Science in Communication Engineering                                                            2001-2005               

l        Graduate with the highest honor

l        GPA: 91/100

l        Rank: top1 within 50 students in communication engineering major

 

INDUSTRY

Qualcomm Corporate R&D                                                                                                              San Diego, USA

Summer Intern                                                                                                      2011 summer

 

EXPERIENCE

 

Signal Analysis and Interpretation Laboratory                                                            Los Angeles, USA

Provost Fellowship, Research Assistant                                                                          August 2008 to present

l        Robust language identification

l        Speaker verification using sparse representation

l        Multimodal audio-visual biometric algorithms with liveness detection

l        Speaker states recognition using latent factor analysis based Eigenchannel factor vector modeling

l        Speaker age and gender recognition system using acoustic and prosodic information fusion

l        Multimodal physical activity recognition system in the KNOWME wireless sensor network

l        ECG biometric algorithm using the temporal and cepstral information fusion

l        Interspeech 2011 speaker state challenge (Intoxication detection) winner as a core team member

l        Co-author of best paper award in the 5th IEEE International Conference on Distributed Computing in Sensor Systems

l        Body Computing Slam Contest 2009 winner as a core team member

ThinkIT Speech Laboratory                                                                                        Beijing, China

Research Assistant                                                                                                     August 2005 to July 2008

l        Developed the monitor system of audio signals in the satellite transmission, based on audio watermarking techniques

l        Developed the shortwave broadcast quality monitoring system, based on audio watermarking techniques

l        Developed the shortwave/satellite broadcast quality monitoring equipment, which is based on TI DSP platforms.

l        Developed the embedded real-time voice changing equipment, which is based on TI DSP platform.

l        Developed the accompaniment suppression system for karaoke media.

l        Main researcher of the language identification system using high quality phoneme recognizer, score vector modeling and support vector machine

l        Main researcher of the speaker verification system using GMM supervector, SVM and NAP/LFA session variability compensation method

l        Participated in the NIST Language Recognition Evaluation (LRE) 2007

l        Participated in the NIST Speaker Recognition Evaluation (SRE) 2008

l        Researched in the field of Computational Auditory Scene Analysis (Co-channel speech separation)

Nanjing University                                                                                                        Nanjing, China

Student                                                                                                                     August 2001 to July 2005

l        Researcher as an undergraduate student in Texas Instruments-Nanjing University joint founded DSP lab.

l        Visit UCB, MIT, UTD and RUTGERS for an American tour of Chinese traditional music performance

l        Nanjing University Renmin Scholarship, 2001-2004

l        Nanjing University Excellent Student, 2001-2004

l        Robert Mundell Scholarship, 2002-2003

l        First Prize of “520 innovation design contest” in Nanjing University, 2004

l        Second Prize of Intel Cup National Undergraduate Electronic Design Contest, 2004

 

PUBLICATIONS

Journal papers:

l        Ming Li, Kyu J. Hanb and Shrikanth Narayanan, “Automatic Speaker Age and Gender Recognition using acoustic and prosodic level information fusion”, to appear Computer speech and language 2012. [PDF]

l        Ming Li, Viktor Rozgic, Gautam Thatte, Sangwon Lee, Adar Emken, Murali Annavaram, Urbashi Mitra, Donna Spruijt-Metz and Shrikanth Narayanan, "Multimodal Physical Activity Recognition by Fusing Temporal and Cepstral Information," IEEE Transactions on Neural Systems & Rehabilitation Engineering, vol 18, issue 4, August, 2010. [PDF]

l        U. Mitra, A. Emken, S. Lee, M. Li, V. Rozgic, G. Thatte, H. Vathsangam, D. Zois, M. Annavaram, S. Narayanan, D. Spruijt-Metz, and G. Sukhatme, "KNOWME:  a Case Study in Wireless Body Area Sensor Network Design", submitted to IEEE Communications Magazine (revised)

l        Gautam Thatte, Ming Li, Sangwon Lee, Adar Emken, Murali Annavaram, Shri Narayanan, Donna Spruijt-Metz, Urbashi Mitra, “Optimal Time-Resource Allocation for Energy-Efficient Physical Activity Detection”, IEEE Transaction on Signal Processing, vol 59, issue 4, April, 2011. [PDF]

l        Gautam Thatte, Ming Li, Sangwon Lee, Adar Emken, Shri Narayanan, Urbashi Mitra, Donna Spruijt-Metz and Murali Annavaram, “KNOWME: An Energy-Efficient and Multimodal Body Area Sensing System for Physical Activity Monitoring,” to appear on ACM Transactions in Embedded Computing Systems. [PDF]

l        Adar Emken, Ming Li, Gautam Thatte, Sangwon Lee, Murali Annavaram, Urbashi Mitra, Shrikanth Narayanan, Donna Spruijt-Metz, “Recognition of Physical Activities in Overweight Hispanic Youth using KNOWME Networks”, Journal of Physical Activity and Health, May 2011. [PDF]

l        Hongbin Suo, Ming Li, Ping Lu, Yonghong Yan, “Automatic language identification with discriminative language characterization based on SVM”, IEICE transaction on Information and Systems, Special Section on Robust Speech processing in Realistic Environment, 2008. [PDF]

l        Hongbin Suo, Ming Li, Ping Lu, Yonghong Yan, "Using SVM as back-end classifier for language identification", EURASIP Journal on Audio, Speech, and Music Processing, Special Issue on Intelligent Audio, Speech, and Music Processing Applications, 2008. [PDF]

Conference papers:

l        Ming Li, Charley Lu, Anne Wang, Shrikanth Narayanan, "Speaker Verification using Lasso based Sparse Total Variability Supervector and Probabilistic Linear Discriminant Analysis”, to appear in NIST Speaker Recognition Workshop, Atlanta, 2011. [PDF][slides]

l        Ming Li, Angeliki Metallinou, Daniel Bone, Shrikanth Narayanan, "Speaker states recognition using latent factor analysis based Eigenchannel factor vector modeling", ICASSP 2012. [PDF]

l        Ming Li, Xiang Zhang, Yonghong Yan and Shrikanth Narayanan, "Speaker Verification using Sparse Representations on Total Variability I-Vectors,”, proceeding of International Conference on Spoken Language Processing, INTERSPEECH, Florence, Italy, 2011. [PDF]

l         Ming Li, Shrikanth Narayanan, “Robust talking face video verification using joint factor analysis and sparse representation on GMM mean shifted supervectors”, IEEE International Conference on Audio, Speech and Signal Processing (ICASSP), Prague, Czech Republic, 2011. [PDF]

l        Ming Li, Shrikanth Narayanan, “ECG Biometrics by Fusing Temporal and Cepstral Information”, 20th conference of the International Association for Pattern Recognition, ICPR 2010, Turkey. [PDF][Poster]

l        Ming Li, Chi-Sang Jung and Kyu Jeong Han, “Combining Five Acoustic Level methods for Automatic Speaker Age and Gender Recognition”, proceeding of International Conference on Spoken Language Processing, INTERSPEECH 2010. [PDF][Poster]

l        Ming Li, Adar Emken, Shri Narayanan, Gautam Thatte, Sangwon Lee, Harshvardhan Vathsangam, Gaurav Sukhatme, Urbashi Mitra, Murali Annavaram and Donna Spruijt-Metz, "Using the KNOWME Networks Mobile Biomonitoring System to Characterize Physical Activity in Overweight Hispanic Youth," ACSM Health and Fitness Summit, Austin, TX (April 2010).

l        Ming Li, Chuan Cao, Di Wang, Ping Lu, Qiang Fu, and Yonghong Yan, “Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping”, proceeding of International Conference on Spoken Language Processing, INTERSPEECH 2008. [PDF]

l        Ming Li, Hongbin Suo, Xiao Wu, Ping Lu, Yonghong Yan, “Spoken Language Identification Using Score Vector Modeling and Support Vector Machine”, proceeding of International Conference on Spoken Language Processing, INTERSPEECH 2007. [PDF] [SLIDES]

l        Ming Li, Yun Lei, Xiang Zhang, Jian Liu, Yonghong Yan, “authentication and quality monitoring based audio watermark for analog AM shortwave broadcasting”, proceeding of IEEE International Conference on Intelligent Information Hiding and Multimedia Signal ProcessingIIH-MSP 2007. [PDF]

l        Ming Li, Yun Lei, Jian Liu, Yonghong Yan, "A Novel Audio Watermarking in Wavelet Domain", proceeding of IEEE International Conference on Intelligent Information Hiding and Multimedia Signal ProcessingIIH-MSP 2006. [PDF] [SLIDES]

l        Samuel Kim, Ming Li, Sangwon Lee, Urbashi Mitra, Adar Emken, Donna Spruijt-Metz, Murali Annavaram, Shrikanth Narayanan, "Modeling high-level descriptions of real-life physical activities using latent topic modeling of multimodal sensor signals", the 33st Annual International Conference of the IEE Engineering in Medicine and Biology Society (EMBC'11), boston, 2011.

l        Daniel Bone, Matthew P. Black, Ming Li, Angeliki Metallinou, Sungbok Lee and Shrikanth Narayanan, "Intoxicated Speech Detection by Fusion of Speaker Normalized Hierarchical Features and GMM Supervectors", proceeding of International Conference on Spoken Language Processing, INTERSPEECH, Florence, Italy, 2011.

l        Gautam Thatte, Viktor Rozgic, Ming Li, Sabyasachi Ghosh, Urbashi Mitra, Shri Narayanan, Murali Annavaram, Donna Spruijt-Metz, "Optimal Time-Resource Allocation for Activity-Detection via Multimodal Sensing," in Proceedings of the Fourth International Conference on Body Area Networks (BodyNets'09), Los Angeles, CA (April 2009). [PDF]

l        Gautam Thatte, Viktor Rozgic, Ming Li, Sabyasachi Ghosh, Urbashi Mitra, Shri Narayanan, Murali Annavaram and Donna Spruijt-Metz, "Optimal Allocation of Time-Resources for Multihypothesis Activity-Level Detection," in Proceedings of the 5th IEEE International Conference on Distributed Computing in Sensor Systems (DCOSS'09), Marina Del Rey, CA (June 2009). (Best paper award!) [PDF]

l        Sangwon Lee, Murali Annavaram, Gautam Thatte, Vikor Rozgic, Ming Li, Urbashi Mitra, Shri Narayanan and Donna Spruijt-Metz, "Sensing for Obesity: KNOWME Implementation and Lessons for an Architect," in Proceedings of the Workshop on Biomedicine in Computing: Systems, Architectures, and Circuits (BiC2009), Austin, TX (June 2009). [PDF]

l        Gautam Thatte, Ming Li, Adar Emken, Urbashi Mitra, Shri Narayanan, Murali Annavaram and Donna Spruijt-Metz, "Energy-Efficient Multihypothesis Activity-Detection for Health-Monitoring Applications," the 31st Annual International Conference of the IEE Engineering in Medicine and Biology Society (EMBC'09), Minneapolis, MN (September 2009). [PDF]

l        Donna Spruijt-Metz, Ming Li, Gautam Thatte, Gaurav Sakhatme, Murali Annavaram, Sabyasachi Ghosh, Viktor Rozgic, Urbashi Mitra, Nenad Medvidovic, Britni Belcher and Shrikanth Narayanan, "Differentiating physical activity modalities in youth using heartbeat waveform shape and differences between adjacent waveforms," in Proceedings of the 7th International Conference on Diet and Activity Methods (ICDAM 7), Washington DC (June 2009).

l        Gautam Thatte, Ming Li, Adar Emken, Urbashi Mitra, Shri Narayanan, Murali Annavaram and Donna Spruijt-Metz, "Energy-Efficient Activity-Detection via Multihypothesis Testing for Pediatric Obesity," the 7th Annual CENS Research Review, Los Angeles, CA (October 2009).

l        D. Spruijt-Metz, S. Narayanan, U. Mitra, G. Sukhatme, M. Li, G. Thatte, A. Emken, S. Lee, H. Vathsangam and M. Annavaram, "KNOWME Networks:  Mobile Device Biomonitoring to Prevent and Treat Obesity in Underserved Minority Youth," mHealth Summit, Washington, DC (October 2009).

l        Hongbin Suo, Ming Li, Xiang Xiao, Xiang Zhang, Xiang Wang, Ping Lu, Yonghong Yan, “IOA ThinkIT Speech Laboratory System Description for NIST LRE07”, NIST language identification evaluation workshop, 2007.

l        Hongbin Suo, Ming Li, Tantan Liu, Ping Lu, Yonghong Yan,The Design of Backend Classifiers in PPRLM System for Language Identification”, proceeding of International Conference on Natural Computation, ICNC 2007. [PDF]

l        Hongbin Suo, Ming Li, Ping Lu, Yonghong Yan, “Language identification based on parallel PRLM system”, proceeding of Chinese national conference on network security, 2007 (in Chinese).

 

ADDITIONAL INFORMATION

 

Skills

l        C/C++, Matlab, Perl, TI DSP CCS

Fields of Interest

Human state recognition including:

1.Speech signal processing: Speaker verification, spoken language identification, speaker age and gender identification

2.Multimodal biometrics: Audio-visual joint biometrics, emerging behavior biometrics (ECG biometrics), multimodal fusion

3.Body sensing, processing and modeling methods in metabolic health monitoring: Multimodal physical activity recognition, multimodal emotion recognition, energy efficient sensing and modeling, compressive sensing [KNOWME network]

My previous research interests:

Audio watermarking: Robust frequency domain audio watermarking, content adaptive audio watermarking in wavelet domain

Computational acoustics scene analysis: co-channel speech separation

Hobbies

l        Play ErHu, a kind of Chinese traditional music instrument which is also called as “Chinese Violin”

l        Basketball

l        Swimming

 

                                

 

     

Home | CV | Research | Publication

This site was last updated 02/01/12

Copyright Notice: Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage. To copy otherwise, or to republish, requires a fee and/or specific permission of the ACM/IEEE or other original publisher.

The University of Southern California does not screen or control the content on this website and thus does not guarantee the accuracy, integrity, or quality of such content. All content on this website is provided by and is the sole responsibility of the person from which such content originated, and such content does not necessarily reflect the opinions of the University administration or the Board of Trustees