کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6926044 1448889 2018 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Term discrimination for text search tasks derived from negative binomial distribution
ترجمه فارسی عنوان
تبعیض دوره ای برای وظایف جستجوی متن حاصل از توزیع دوتایی منفی است
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی
Accurate term discrimination in information retrieval is essential for identifying important terms in specific documents. In addition to the widely known inverse document frequency (IDF) method, alternative approaches such as the residual inverse document frequency (RIDF) scheme have been introduced for term discrimination. However, existing methods' performance is not unconditionally convincing. We propose a new collection frequency weighting scheme derived from the negative binomial distribution model of term occurrences. Factorial experiments were performed to examine potential interaction effect between collection frequency weight methods and term frequency weight methods according to the mean average precision and normalized discounted cumulative gain performance assessors. The results indicate that our proposed term discrimination method offers a significant gain in accuracy as compared to the IDF and RIDF scheme. This finding is reinforced by the fact that the results show no interaction effects among factors.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing & Management - Volume 54, Issue 3, May 2018, Pages 370-379
نویسندگان
, , ,