[1] Kinnunen T, Li H Z. An overview if text-independent speaker recognition: From features to supervectors[J]. Speech Comm, 2010(52):12-40.[2] Yang X L, Tan B H, Ding J H, et al. Comparative study on voice activity detection algorithm[C]//International Conference on Electrical and Control Engineering. 2010: 599-602.[3] G.723.1, Annex A:Silence compression scheme[S]. ITU-T, Nov 1996.[4] Qian Y M, Liu J. Cross-entropy OSF-based voice activity detection algorithm[J]. J Tsinghua Uni, 2009, 49(10): 87-90(in Chinese). 钱彦旻, 刘加. 基于交叉熵顺序统计滤波的语音端点检测算法[J]. 清华大学学报, 2009, 49(10):87-90.[5] Ramírez J, Segura J C, Benítez C. An effective subband OSF-based VAD with noise reduction for robust speech recognition[J]. IEEE Transaction on Speech and Audio Processing, 2005, 13(6): 1119-1129.[6] Ramírez J, Segura J C, Benítez C. A new Kullback-Leibler VAD for s peech recognition in noise[J]. IEEE Signal Processing Letters, 2004, 11(2): 266-269.[7] Juang B H, Rabiner L R. Hidden markov models for speech recognition[J]. Technometrics, 1991, 33(3):251-272.[8] BenZeghiba M F, Gauvain J L, Lamel L. Context-dependent phone models and models adaptation for phonotactic language recognition[C]//Proceedings of Interspeech 2008.Brisbane, 2008:313-316.[9] Jesus Antonio Villalba Lopez. Segmentation Experiments for NIST SRE[R]. Brno University of Technology, 2009.[10] Reynolds D A, Quatieri T F, Dunn R B. Speaker verification using adapted Gaussian mixture models[J]. Digital Signal Processing, 2000, 10: 19-41.[11] NIST. 2008 NIST Speaker Recognition Evaluation[EB/OL]. (2008-10-04)[2013-05-10]. http://www.itl.nist.gov/iad/mig/tests/spk/2008/index.html.[12] NIST. 2010 NIST Speaker Recognition Evaluation[EB/OL]. (2010-04-21)[2013-05-10]. http://www.itl.nist.gov/iad/mig/tests/spk/2010/index.html.[13] 丁爱明.作为说话人识别特征参量的MFCC的提取过程[J].电子工程师, 2006, 32(1):51-53.[14] Hermansky H.Perceptual linear predictive (PLP) analysis of speech[J]. Journal Acoustical Society of America, 1990, 87(4): 1 738-1 752.[15] Hermansky H, Hanson B A. Perceptually based linear predictive analysis of speech[C]//Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Tampa USA, 1985, X: 509-512. |