Article ID Journal Published Year Pages File Type
1153003 Statistics & Probability Letters 2010 6 Pages PDF
Abstract

Suppose a trial has rr possible outcomes, with the ii-th outcome having probability pipi, ∑i=1rpi=1 and p=(p1,⋯,pr). The outcome of nn independent trials can be described by the frequencies k=(k1,…,kr), ∑i=1rki=n, where outcome ii was selected with frequency ki∈Nki∈N. A mode (there can be multiple modes) is an outcome with maximal likelihood over all possible outcomes.Despite the ubiquity of multinomial distributions in statistical applications, the best algorithm to date for finding modes has unknown computational complexity and highly variable performance in practice. It is also vulnerable to precision problems due to accumulated roundoff error.We propose a new algorithm, GreedyModeFind, for calculating the mode(s) of this distribution, given nn and p. GreedyModeFind is simple, efficient and numerically robust, requiring O(rlogr)O(rlogr) time and O(r)O(r) space to find one mode. A concise representation of the full set of joint modes can be found for an additional time cost of O(rlogr)O(rlogr). In practice this algorithm drastically improves on the performance of earlier algorithms. We provide a freely available C++ implementation of GreedyModeFind.

Related Topics
Physical Sciences and Engineering Mathematics Statistics and Probability
Authors
, ,