Article ID Journal Published Year Pages File Type
10358557 Journal of Informetrics 2013 10 Pages PDF
Abstract
From the way that it was initially defined (Hirsch, 2005), the h-index naturally encourages focus on the most highly cited publications of an author and this in turn has led to (predominantly) a rank-based approach to its investigation. However, Hirsch (2005) and Burrell (2007a) both adopted a frequency-based approach leading to general conjectures regarding the relationship between the h-index and the author's publication and citation rates as well as his/her career length. Here we apply the distributional results of Burrell, 2007a, Burrell, 2013b to three published data sets to show that a good estimate of the h-index can often be obtained knowing only the number of publications and the number of citations. (Exceptions can occur when an author has one or more “outliers” in the upper tail of the citation distribution.) In other words, maybe the main body of the distribution determines the h-index, not the wild wagging of the tail. Furthermore, the simple geometric distribution turns out to be the key.
Related Topics
Physical Sciences and Engineering Computer Science Computer Science Applications
Authors
,