参考文献

[1]Rabiner L, Juang B H. Fundamentals of Speech Recognition[M]. New Jersey: Prentice Hall PTR, 1993.

[2]易克初,田斌,付强.语音信号处理[M].北京:国防工业出版社,2000.

[3]Huang X D, Acero A, Hon H, et al. Spoken Language Processing: A Guide to Theory, Algorithm and System Development[M]. New Jersey: Prentice Hall PTR, 2001.

[4]杨行峻,迟惠生.语音信号数字处理[M].北京:电子工业出版社,1995.

[5]刘加.汉语大词汇量连续语音识别系统研究进展[J].电子学报,2000,28(1):85-91.

[6]张全.语言声学的进展[J].应用声学,2002,21(1):35-39.

[7]吕士楠,张连毅,林凡.TTS技术的发展和展望:第六届全国人机语音通讯学术会议论文集[C].深圳:2001.

[8]Fine S, Navratil J, Gopinath R A. A Hybrid GMM/SVM Approach to Speaker Identification. In: Proceedings of 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing[C], 2001, 1:417-420.

[9]凌震华.基于统计声学建模的语音合成技术研究[D].合肥:中国科学技术大学,2008.

[10]范睿,鲍长春,李锐.基于ACELP的嵌入式语音编码算法[J].通信学报,2007,28(10):48-54.

[11]Hinton G E, Salakhutdinov R R. Reducing the Dimensionality of Data with Neural Networks[J]. Science, 2006, 313(5786):504-507.

[12]Seide F, Li G, and Yu D. Conversational Speech Transcription using Context-Dependent Deep Neural Networks[C]. Interspeech, 2011, 437-440.

[13]邓力,俞栋.深度学习方法及应用[M].谢磊,译.北京:机械工业出版社,2016.

[14]殷翔.语音合成中的神经网络声学建模方法研究[D].合肥:中国科技大学,2016.