Xiaofei LI, Ph. D.

Audio Signal and Information Processing Lab

CONTACT

Email: lixiaofei@westlake.edu.cn

Website: https://audio.westlake.edu.cn/

xiaofei li westlake university
xiaofei li westlake university

Xiaofei LI, Ph. D.

Audio Signal and Information Processing Lab

CONTACT

Email: lixiaofei@westlake.edu.cn

Website: https://audio.westlake.edu.cn/

“Focus on scientific research, contribute more to the development of Westlake University.”

Biography

I work at Westlake University as an assistant Professor since March 2020.  Prior to this, I worked in the PERCEPTION team at INRIA Grenoble Rhône-Alpes, France, as a  post-doctoral researcher from Feb. 2014 to Jan. 2016, and as a starting research scientist from Feb. 2016 to Dec. 2019, hosted by Dr. Radu Horaud.  I did my PhD in Electronics at Peking University, during 2007 to 2013, supervised by Prof. Hong Liu. I received a Bachelor degree in Electronic Information from Beijing Institute of Machinery in 2007.

Research

My field of expertise is acoustic/audio/speech signal processing, including the research topics of channel identification/equalization, noise estimation, sound source localization, speech enhancement, speech separation, robust speech recognition, etc. I have two major contributions in the field: the applications of convolutive transfer function  to sound source localization and speech dereverberation; narrow-band deep filtering applies deep neural network for signal filtering, more specifically speech denoising.

Representative Publications

1. Xiaofei Li*, Laurent Girin, Sharon Gannot and Radu Horaud. Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27 (9), pp. 1365 - 1377, 2019.

2. Xiaofei Li*#, Yutong Ban#, Laurent Girin, Xavier Alameda-Pineda and Radu Horaud. Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environment. IEEE Journal of Selected Topics in Signal Processing, 13 (1), pp. 88 - 103, 2019. (#equal contribution)

3. Xiaofei Li*, Laurent Girin, Sharon Gannot and Radu Horaud. Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27 (3), pp. 645 - 659, 2019.

4. Xiaofei Li*, Simon Leglaive, Laurent Girin, and Radu Horaud. Audio-noise Power Spectral Density Estimation Using Long Short-term Memory. IEEE Signal Processing Letters, 26 (6), pp. 918 - 922, 2019.

5. Xiaofei Li*, Sharon Gannot, Laurent Girin and Radu Horaud. Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26 (10), pp. 1755 - 1768, 2018.

6. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization with Spatial Sparsity Regularization. IEEE/ACM Transactions on Audio, Speech and Language Processing, 25 (10), pp. 1997 - 2012, 2017.

7. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization. IEEE/ACM Transactions on Audio, Speech and Language Processing, 24 (11), pp. 2171 – 2186, 2016.

8. Xiaofei Li and Hong Liu*. Sound Source Localization for HRI Using FOC-based Time Difference Feature and Spatial Grid Matching. IEEE Transactions on Cybernetics, 43 (4), pp. 1199-1212, 2013

9. Bing Yang, Hong Liu*, Cheng Pang and Xiaofei Li. Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27(8), 1241-1255, 2019.

10. Israel D. Gebru*, Silèye Ba, Xiaofei Li and Radu Horaud. Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40 (5), pp. 1086 - 1099, 2018.

11.Cheng Pang, Hong Liu*, Jie Zhang and Xiaofei Li. Binaural Sound Localization Based on Reverberation Weighting and Generalized Parametric Mapping. IEEE/ACM Transactions on Audio, Speech and Language Processing, 25 (8), pp. 1618 - 1632, 2017. 

12. Pingping Wu, Hong Liu*, Xiaofei Li, Ting Fan, and Xuewu Zhang. A Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion. IEEE Transactions on Multimedia, 18 (3), pp. 326 - 338, 2016.

13. Xiaofei Li* and Radu Horaud. Multichannel Speech Enhancement based on Time-frequency Masking using Subband Long Short-Term Memory. WASPAA, 2019.

14. Xiaofei Li*, Sharon Gannot,  Laurent Girin and Radu Horaud. Multisource MINT Using the Convolutive Transfer Function. ICASSP, 2018.

15. Xiaofei Li*, Laurent Girin and Radu Horaud. An EM algorithm for audio source separation based on the convolutive transfer function. WASPAA,  2017.

16. Xiaofei Li*, Laurent Girin and Radu Horaud. Audio Source Separation based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization. ICASSP,  2017.

17. Xiaofei Li*, Laurent Girin, Radu Horaud and Sharon Gannot. Estimation of Relative Transfer Function in the Presence of Stationary Noise Based on Segmental Power Spectral Density Matrix Subtraction. ICASSP, 2015.