کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
515611 867049 2012 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
The tipping point: F-score as a function of the number of retrieved items
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
The tipping point: F-score as a function of the number of retrieved items
چکیده انگلیسی

One of the best known measures of information retrieval (IR) performance is the F-score, the harmonic mean of precision and recall. In this article we show that the curve of the F-score as a function of the number of retrieved items is always of the same shape: a fast concave increase to a maximum, followed by a slow decrease. In other words, there exists a single maximum, referred to as the tipping point, where the retrieval situation is ‘ideal’ in terms of the F-score. The tipping point thus indicates the optimal number of items to be retrieved, with more or less items resulting in a lower F-score. This empirical result is found in IR and link prediction experiments and can be partially explained theoretically, expanding on earlier results by Egghe. We discuss the implications and argue that, when comparing F-scores, one should compare the F-score curves’ tipping points.


► The F-score curve has a distinct shape: a concave increase followed by a convex decrease.
► There is a single maximum (the tipping point) where the retrieval situation is ‘best’ in terms of the F-score.
► We show empirical F-score curves for IR and link prediction experiments.
► The shape of the curve can be explained using a continuous model.
► When comparing F-scores, one should compare the F-score curves’ tipping points.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing & Management - Volume 48, Issue 6, November 2012, Pages 1171–1180
نویسندگان
, , ,