کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
402146 676862 2016 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Estimating term domain relevance through term frequency, disjoint corpora frequency - tf-dcf
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Estimating term domain relevance through term frequency, disjoint corpora frequency - tf-dcf
چکیده انگلیسی

This paper proposes a new relevance index for terms extracted from domain corpora. We call it term frequency, disjoint corpora frequency (tf-dcf), and it is based on the absolute frequency of each term tempered by its frequency in other (contrasting) corpora. Conceptual differences and mathematical computation of the proposed index are discussed in respect with other similar approaches that also take contrasting corpora into account. To illustrate the efficiency of our index, this paper evaluates tf-dcf against other similar approaches. Finally, other experiments are made in order to analyze the tf-dcf behavior according to the characteristics of contrasting corpora.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Knowledge-Based Systems - Volume 97, 1 April 2016, Pages 237–249
نویسندگان
, , ,