کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
527072 869282 2013 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Canonical locality preserving Latent Variable Model for discriminative pose inference
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Canonical locality preserving Latent Variable Model for discriminative pose inference
چکیده انگلیسی

Discriminative approaches for human pose estimation model the functional mapping, or conditional distribution, between image features and 3D poses. Learning such multi-modal models in high dimensional spaces, however, is challenging with limited training data; often resulting in over-fitting and poor generalization. To address these issues Latent Variable Models (LVMs) have been introduced. Shared LVMs learn a low dimensional representation of common causes that give rise to both the image features and the 3D pose. Discovering the shared manifold structure can, in itself, however, be challenging. In addition, shared LVM models are often non-parametric, requiring the model representation to be a function of the training set size. We present a parametric framework that addresses these shortcomings. In particular, we jointly learn latent spaces for both image features and 3D poses by maximizing the non-linear dependencies in the projected latent space, while preserving local structure in the original space; we then learn a multi-modal conditional density between these two low-dimensional spaces in the form of Gaussian Mixture Regression. With this model we can address the issue of over-fitting and generalization, since the data is denser in the learned latent space, as well as avoid the need for learning a shared manifold for the data. We quantitatively compare the performance of the proposed method to several state-of-the-art alternatives, and show that our method gives a competitive performance.

Figure optionsDownload high-quality image (111 K)Download as PowerPoint slideHighlights
► Parametric framework learns multi-modal models in high dimensional spaces.
► Finding latent variables keeping local structure while maximizing the correlation.
► Dealing with multi-modality in the form of GMR.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Image and Vision Computing - Volume 31, Issue 3, March 2013, Pages 223–230
نویسندگان
, , , ,