کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
386729 660890 2010 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A novel dual wing harmonium model aided by 2-D wavelet transform subbands for document data mining
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
A novel dual wing harmonium model aided by 2-D wavelet transform subbands for document data mining
چکیده انگلیسی

A novel dual wing harmonium model that integrates multiple features including term frequency features and 2-D wavelet transform features into a low dimensional semantic space is proposed for the applications of document classification and retrieval. Terms are extracted from the graph representation of document by employing weighted feature extraction method. 2-D wavelet transform is used to compress the graph due to its sparseness while preserving the basic document structure. After transform, low-pass subbands are stacked to represent the term associations in a document. We then develop a new dual wing harmonium model projecting these multiple features into low dimensional latent topics with different probability distributions assumption. Contrastive divergence algorithm is used for efficient learning and inference. We perform extensive experimental verification in document classification and retrieval, and comparative results suggest that the proposed method delivers better performance than other methods.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 37, Issue 6, June 2010, Pages 4403–4412
نویسندگان
, , ,