The h-index: A case of the tail wagging the dog?

Article ID	Journal	Published Year	Pages	File Type
10358557	Journal of Informetrics	2013	10 Pages	PDF

Abstract

From the way that it was initially defined (Hirsch, 2005), the h-index naturally encourages focus on the most highly cited publications of an author and this in turn has led to (predominantly) a rank-based approach to its investigation. However, Hirsch (2005) and Burrell (2007a) both adopted a frequency-based approach leading to general conjectures regarding the relationship between the h-index and the author's publication and citation rates as well as his/her career length. Here we apply the distributional results of Burrell, 2007a, Burrell, 2013b to three published data sets to show that a good estimate of the h-index can often be obtained knowing only the number of publications and the number of citations. (Exceptions can occur when an author has one or more “outliers” in the upper tail of the citation distribution.) In other words, maybe the main body of the distribution determines the h-index, not the wild wagging of the tail. Furthermore, the simple geometric distribution turns out to be the key.

Keywords

Citation distribution Geometric distribution Non-linear equations