Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
563462 | Computer Speech & Language | 2007 | 21 Pages |
Abstract
We describe a dynamic Bayesian network for articulatory feature recognition. The model is intended to be a component of a speech recognizer that avoids the problems of conventional “beads-on-a-string” phoneme-based models. We demonstrate that the model gives superior recognition of articulatory features from the speech signal compared with a state-of-the-art neural network system. We also introduce a training algorithm that offers two major advances: it does not require time-aligned feature labels and it allows the model to learn a set of asynchronous feature changes in a data-driven manner.
Related Topics
Physical Sciences and Engineering
Computer Science
Signal Processing
Authors
Joe Frankel, Mirjam Wester, Simon King,