Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
536056 | Pattern Recognition Letters | 2010 | 8 Pages |
Abstract
Audio classification typically involves feeding a fixed set of low-level features to a machine learning method, then performing feature aggregation before or after learning. Instead, we jointly learn a selection and hierarchical temporal aggregation of features, achieving significant performance gains.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Vision and Pattern Recognition
Authors
Paul Ruvolo, Ian Fasel, Javier R. Movellan,