Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
534856 | Pattern Recognition Letters | 2011 | 9 Pages |
We address the balanced clustering problem where cluster sizes are regularized with submodular functions. The objective function for balanced clustering is a submodular fractional function, i.e., the ratio of two submodular functions, and thus includes the well-known ratio cuts as special cases. In this paper, we present a novel algorithm for minimizing this objective function (submodular fractional programming) using recent submodular optimization techniques. The main idea is to utilize an algorithm to minimize the difference of two submodular functions, combined with the discrete Newton method. Thus, it can be applied to the objective function involving any submodular functions in both the numerator and the denominator, which enables us to design flexible clustering setups. We also give theoretical analysis on the algorithm, and evaluate the performance through comparative experiments with conventional algorithms by artificial and real datasets.
Research highlights► Clustering where cluster sizes are balanced with submodular functions is addressed. ► A discrete Newton method is applied to submodular fractional programming. ► Any submodular functions can be involved in both the numerator and denominator. ► The algorithm is applied to document co-clustering datasets.