Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4977872 | Speech Communication | 2016 | 10 Pages |
Abstract
Building on findings from converging linguistic evidence on the gestural model of Articulatory Phonology as well as the neural basis of speech perception, we hypothesize that phonological posteriors convey properties of linguistic classes at multiple time scales, and this information is embedded in their support (index) of active coefficients. To verify this hypothesis, we obtain a binary representation of phonological posteriors at the segmental level which is referred to as first-order sparsity structure; the high-order structures are obtained by the concatenation of first-order binary vectors. It is then confirmed that the classification of supra-segmental linguistic events, the problem known as linguistic parsing, can be achieved with high accuracy using a simple binary pattern matching of first-order or high-order structures.
Related Topics
Physical Sciences and Engineering
Computer Science
Signal Processing
Authors
Milos Cernak, Afsaneh Asaei, Hervé Bourlard,