JISE

[1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15]

Journal of Information Science and Engineering, Vol. 34 No. 6, pp. 1469-1491

Locally Linear Embedding Based Post-Filtering for Speech Enhancement

HSIN-TE HWANG^1,3, YI-CHIAO WU¹, SYU-SIANG WANG², CHIN-CHENG HSU¹,
YU TSAO², HSIN-MIN WANG¹, YIH-RU WANG³ AND SIN-HORNG CHEN³
¹Institute of Information Science
²Research Center for Information Technology Innovation
Academia Sinica
Taipei, 115 Taiwan
³Department of Electrical and Computer Engineering
National Chiao Tung University
Hsinchu, 300 Taiwan
E-mail: {hwanght; tedwu; jeremycchsu; whm}@iis.sinica.edu.tw; sypdbhee@gmail.com;
yu.tsao@citi.sinica.edu.tw; {yrwang; schen}@mail.nctu.edu.tw

We present a novel speech enhancement method based on locally linear embedding (LLE). The proposed method works as a post-filter to further suppress the residual noises in the enhanced speech signals obtained by a speech enhancement system to attain improved speech quality and intelligibility. We design two types of LLE-based post-filters: the direct LLE-based post-filter (called the DL post-filter) and the LLE-based difference compensation post-filter (called the LDC post-filter). The key technique of the proposed post-filters is to apply the LLE-based feature prediction method, which integrates the LLE algorithm, a classical manifold learning method, with the exemplar-based feature prediction method, to predict either the spectral features of the clean speech from those of the enhanced speech (for DL) or the spectral difference of {clean speech; noisy speech} from that of {enhanced speech; noisy speech} (for LDC). As a result, for DL, the predicted clean speech signals can be directly reconstructed from the predicted clean spectral features. On the other hand, for LDC, the predicted clean spectral features are obtained by compensating the spectral features of the noisy speech with the predicted clean-noisy spectral difference, and then the predicted clean speech signals can be reconstructed accordingly. Experimental results demonstrate the effectiveness of the proposed post-filters for two representative speech enhancement methods, namely the deep denoising autoencoder (DDAE) and the minimum mean-square-error (MMSE) spectral estimation methods.

Keywords: speech enhancement, locally linear embedding, post-filter/postfilter, exemplar-based, manifold learning

Retrieve PDF document (JISE_201806_07.pdf)