Text classification using graph mining-based feature extraction

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
403080	677048	2010	7 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Feature extraction - استخراج ویژگی Text classification - طبقه بندی متن Graph mining - معدن گراف Graph representation - نمایش گراف

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

Text classification using graph mining-based feature extraction

چکیده انگلیسی

A graph-based approach to document classification is described in this paper. The graph representation offers the advantage that it allows for a much more expressive document encoding than the more standard bag of words/phrases approach, and consequently gives an improved classification accuracy. Document sets are represented as graph sets to which a weighted graph mining algorithm is applied to extract frequent subgraphs, which are then further processed to produce feature vectors (one per document) for classification. Weighted subgraph mining is used to ensure classification effectiveness and computational efficiency; only the most significant subgraphs are extracted. The approach is validated and evaluated using several popular classification algorithms together with a real world textual data set. The results demonstrate that the approach can outperform existing text classification algorithms on some dataset. When the size of dataset increased, further processing on extracted frequent features is essential.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 23, Issue 4, May 2010, Pages 302–308

نویسندگان

Chuntao Jiang, Frans Coenen, Robert Sanderson, Michele Zito,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Text classification using graph mining-based feature extraction

دسترسی سریع

ارتباط

English Website