کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
10481954 933248 2013 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Keyword extraction by entropy difference between the intrinsic and extrinsic mode
ترجمه فارسی عنوان
استخراج کلمات کلیدی با تفاوت آنتروپی بین حالت ذاتی و بیرونی
کلمات کلیدی
استخراج کلید واژه، تفاوت آنتروپی، حالت ذاتی، حالت بیرونی،
موضوعات مرتبط
مهندسی و علوم پایه ریاضیات فیزیک ریاضی
چکیده انگلیسی
This paper proposes a new metric to evaluate and rank the relevance of words in a text. The method uses the Shannon's entropy difference between the intrinsic and extrinsic mode, which refers to the fact that relevant words significantly reflect the author's writing intention, i.e., their occurrences are modulated by the author's purpose, while the irrelevant words are distributed randomly in the text. By using The Origin of Species by Charles Darwin as a representative text sample, the performance of our detector is demonstrated and compared to previous proposals. Since a reference text “corpus” is all of an author's writings, books, papers, etc. his collected works is not needed. Our approach is especially suitable for single documents of which there is no a priori information available.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Physica A: Statistical Mechanics and its Applications - Volume 392, Issue 19, 1 October 2013, Pages 4523-4531
نویسندگان
, , , ,