Article ID Journal Published Year Pages File Type
515729 Information Processing & Management 2008 14 Pages PDF
Abstract

The continuous version of the Lotka distribution, more generally referred to outside of informetrics as the Pareto distribution, has long enjoyed a central position in the theoretical development of informetrics despite several reported drawbacks in modelling empirical data distributions, most particularly that the inverse power form seems mainly to be evident only in the upper tails. We give a number of published examples graphically illustrating this shortcoming. In seeking to overcome this, we here draw attention to an intuitively reasonable generalization of the Pareto distribution, namely the Pareto type II distribution, of which we consider two versions. We describe its basic properties and some statistical features together with concentration aspects and argue that, at least in qualitative terms, it is better able to describe many observed informetric phenomena over the full range of the distribution. Suggestions for further investigations, including truncated and time-dependent versions, are also given.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science Applications
Authors
,