دانلود رایگان مقاله: نمایندگی آموزش و نمایندگی تصویر دیجیتال با ظاهر عمیق و برنامه نویسی فضایی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
525690	869012	2015	9 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Learning representative and discriminative image representation by deep appearance and spatial coding

ترجمه فارسی عنوان

نمایندگی آموزش و نمایندگی تصویر دیجیتال با ظاهر عمیق و برنامه نویسی فضایی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

طبقه بندی عکس، یادگیری عمیق، اسپارتی سازه

Structured sparsity - اسپارتی سازه Image classification - طبقه بندی تصویر Deep learning - یادگیری عمیق

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو

پیش نمایش مقاله

نمایندگی آموزش و نمایندگی تصویر دیجیتال با ظاهر عمیق و برنامه نویسی فضایی

چکیده انگلیسی

• Propose a deep appearance and spatial coding model to learn image representation.
• Adopt stacked ISA network combined with sparse RBM to perform appearance coding.
• Adopt over-complete spatial max-pooling to incorporate various spatial information.
• Introduce a structured sparse Auto-encoder to carry out spatial coding.
• The experimental performances on five challenging benchmark datasets outperform baselines and related work.

How to build a suitable image representation remains a critical problem in computer vision. Traditional Bag-of-Feature (BoF) based models build image representation by the pipeline of local feature extraction, feature coding and spatial pooling. However, three major shortcomings hinder the performance, i.e., the limitation of hand-designed features, the discrimination loss in local appearance coding and the lack of spatial information. To overcome the above limitations, in this paper, we propose a generalized BoF-based framework, which is hierarchically learned by exploring recently developed deep learning methods. First, with raw images as input, we densely extract local patches and learn local features by stacked Independent Subspace Analysis network. The learned features are then transformed to appearance codes by sparse Restricted Boltzmann Machines. Second, we perform spatial max-pooling on a set of over-complete spatial regions, which is generated by covering various spatial distributions, to incorporate more flexible spatial information. Third, a structured sparse Auto-encoder is proposed to explore the region representations into the image-level signature. To learn the proposed hierarchy, we layerwise pre-train the network in unsupervised manner, followed by supervised fine-tuning with image labels. Extensive experiments on different benchmarks, i.e., UIUC-Sports, Caltech-101, Caltech-256, Scene-15 and MIT Indoor-67, demonstrate the effectiveness of our proposed model.

Figure optionsDownload high-quality image (304 K)Download as PowerPoint slide

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Vision and Image Understanding - Volume 136, July 2015, Pages 23–31

نویسندگان

Bingyuan Liu, Jing Liu, Hanqing Lu,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : نمایندگی آموزش و نمایندگی تصویر دیجیتال با ظاهر عمیق و برنامه نویسی فضایی

دسترسی سریع

ارتباط

English Website