Home >

news ヘルプ

論文・著書情報


タイトル
和文: 
英文:Combining Deep Speaker Specific Representations with GMM-SVM for Speaker Verification 
著者
和文: Price RyanWilliam, Sangeeta Biswas, 篠田 浩一.  
英文: Ryan Price, Sangeeta Biswas, Koichi Shinoda.  
言語 English 
掲載誌/書名
和文:INTERSPEECH2013 
英文:INTERSPEECH2013 
巻, 号, ページ         pp. 2788-2792
出版年月 2013年8月25日 
出版者
和文: 
英文: 
会議名称
和文: 
英文:INTERSPEECH2013 
開催地
和文:リヨン 
英文:Lyon 
公式リンク http://www.interspeech2013.org/
 
アブストラクト This study combines a Gaussian mixture model support vector machine (GMM-SVM) system with a nonlinear feature transformation, discriminatively trained to extract speaker specific features from MFCCs. Separation of the speaker information component and non-speaker related information in the speech signal is accomplished using a regularized siamese deep network (RSDN). RSDN learns a hidden representation that well characterizes speaker information by training a subset of the hidden units using pairs of speech segments. MFCC features are input to a trained RSDN and a subset of hidden layer outputs are used as new input features in a GMM-SVM system. We demonstrate the potential of this approach for text-independent speaker verification by applying it to a subset of the NIST SRE 2006 1conv4w-1conv4w task. The hybrid RSDN GMM-SVM system achieves about 5% relative improvement over the baseline GMM-SVM system. Index Terms: speaker verification, neural networks, feature extraction, GMM-SVM

©2007 Institute of Science Tokyo All rights reserved.