In this paper we provide a formal description of a speech recognizer designed on the basis of elaborate articulatory timing that is asynchronous across the multiple articulatory-feature dimensions. Three recently improved critical components of the recognizer are described in detail. Evaluation results, obtained from a standard TIMIT phonetic recognition task confined within the N-best rescoring scenario, are reported on comparative performances between the new feature-based recognizer and a recognizer using the conventional context-dependent triphone units. The results demonstrate an overall superior quality of the rescored N-best list from the feature-based recognizer over that from the triphone-based recognizer. Greater performance improvements are observed as the top number of candidate sentences increases.
|頁（從 - 到）||385-388|
|期刊||ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings|
|出版狀態||已發佈 - 一月 1 1995|
|事件||Proceedings of the 1995 20th International Conference on Acoustics, Speech, and Signal Processing. Part 1 (of 5) - Detroit, MI, USA|
持續時間: 五月 9 1995 → 五月 12 1995
ASJC Scopus subject areas