Performing Fine Phonetic Distinctions: Templates versus Features

doi:10.4324/9781315802350-15

ABSTRACT

Despite intensive research in computer speech recognition during the past 10 years, there is still a very large gap between human and machine recognition of speech. Human speech perception is robust and flexible: A person can recognize a novel sentence produced by an unfamiliar talker in a background of other conversations. By comparison, computer speech-recognition systems typically require training to each new speaker and perform well only when word choice is limited to a small number of acoustically distinct items.