Combining Deep Speaker Specific Representations with GMM-SVM for Speaker Verification

Ryan Price; Sangeeta Biswas; Koichi Shinoda

論文・著書情報

タイトル

和文:
英文:	Combining Deep Speaker Specific Representations with GMM-SVM for Speaker Verification

著者

和文:	Price RyanWilliam, Sangeeta Biswas, 篠田浩一.
英文:	Ryan Price, Sangeeta Biswas, Koichi Shinoda.

言語

English

掲載誌/書名

和文:	INTERSPEECH2013
英文:	INTERSPEECH2013

巻, 号, ページ

pp. 2788-2792

出版年月

2013年8月25日

出版者

和文:
英文:

会議名称

和文:
英文:	INTERSPEECH2013

開催地

和文:	リヨン
英文:	Lyon

公式リンク

http://www.interspeech2013.org/

アブストラクト

This study combines a Gaussian mixture model support vector machine (GMM-SVM) system with a nonlinear feature transformation, discriminatively trained to extract speaker specific features from MFCCs. Separation of the speaker information component and non-speaker related information in the speech signal is accomplished using a regularized siamese deep network (RSDN). RSDN learns a hidden representation that well characterizes speaker information by training a subset of the hidden units using pairs of speech segments. MFCC features are input to a trained RSDN and a subset of hidden layer outputs are used as new input features in a GMM-SVM system. We demonstrate the potential of this approach for text-independent speaker verification by applying it to a subset of the NIST SRE 2006 1conv4w-1conv4w task. The hybrid RSDN GMM-SVM system achieves about 5% relative improvement over the baseline GMM-SVM system. Index Terms: speaker verification, neural networks, feature extraction, GMM-SVM

Home

各種検索

サポート

T2R2について

関連リンク

論文・著書情報