دانلود رایگان مقاله: از محتوا به لینک: تعبیه تصویر اجتماعی با مدل عمیق چندجمله ای

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
10151016	1666104	2018	14 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

From content to links: Social image embedding with deep multimodal model

ترجمه فارسی عنوان

از محتوا به لینک: تعبیه تصویر اجتماعی با مدل عمیق چندجمله ای

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

تعبیه تصویر اجتماعی، تعبیه شبکه، مدل توجه، شبکه سیآیای-سه گانه،

Network embedding - تعبیه شبکه Attention model - مدل توجه

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

از محتوا به لینک: تعبیه تصویر اجتماعی با مدل عمیق چندجمله ای

چکیده انگلیسی

With the popularity of social network, social media data embedding has attracted extensive research interest and boomed many applications, such as image classification and cross-modal retrieval. In this paper, we examine the scenario of social images containing multimodal content (e.g., visual content and textual tags) and connecting with each other (e.g., two images submitted to the same group). In such a case, both the multimodal content and link information provide useful clues for representation learning. Therefore, simply learning the embedding from network structure or data content results in sub-optimal social image representation. In this paper, we propose a Deep Multimodal Attention Networks (DMAN) to combine multimodal content and link information for social image embedding. Specifically, to effectively incorporate the multimodal content, a visual-textual attention model is proposed to encode the fine-granularity correlation between multimodal content, i.e., the alignment between image regions and textual words. To incorporate the network structure for embedding learning, a novel Siamese-Triplet neural network is proposed to model the first-order proximity and the second-order proximity among images. Then the two modules are integrated into a joint deep model for social image embedding. Once the representation has been learned, a wide variety of data mining problems can be solved by using the task-specific algorithms designed for handling vector representations. Extensive experiments are conducted to demonstrate the effectiveness of our approach on multi-label classification and cross-modal search.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 160, 15 November 2018, Pages 251-264

نویسندگان

Feiran Huang, Xiaoming Zhang, Zhoujun Li, Zhonghua Zhao, Yueying He,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : از محتوا به لینک: تعبیه تصویر اجتماعی با مدل عمیق چندجمله ای

دسترسی سریع

ارتباط

English Website