A fast algorithm for identifying friends-of-friends halos

Article ID	Journal	Published Year	Pages	File Type
4963644	Astronomy and Computing	2017	8 Pages	PDF

Abstract

We describe a simple and fast algorithm for identifying friends-of-friends features and prove its correctness. The algorithm avoids unnecessary expensive neighbor queries, uses minimal memory overhead, and rejects slowdown in high over-density regions. We define our algorithm formally based on pair enumeration, a problem that has been heavily studied in fast 2-point correlation codes and our reference implementation employs a dual KD-tree correlation function code. We construct features in a hierarchical tree structure, and use a splay operation to reduce the average cost of identifying the root of a feature from O[logL] to O[1] (L is the size of a feature) without additional memory costs. This reduces the overall time complexity of merging trees from O[LlogL] to O[L], reducing the number of operations per splay by orders of magnitude. We next introduce a pruning operation that skips merge operations between two fully self-connected KD-tree nodes. This improves the robustness of the algorithm, reducing the number of merge operations in high density peaks from O[Î´2] to O[Î´]. We show that for cosmological data set the algorithm eliminates more than half of merge operations for typically used linking lengths bâ¼0.2 (relative to mean separation). Furthermore, our algorithm is extremely simple and easy to implement on top of an existing pair enumeration code, reusing the optimization effort that has been invested in fast correlation function codes.

Keywords

Algorithm Simulation Feature identification Halo Cosmology