Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
6906019 | Astronomy and Computing | 2018 | 14 Pages |
Abstract
Cosmological N-body simulations play a vital role in studying models for the evolution of the Universe. To compare to observations and make a scientific inference, statistic analysis on large simulation datasets, e.g., finding halos, obtaining multi-point correlation functions, is crucial. However, traditional in-memory methods for these tasks do not scale to the datasets that are forbiddingly large in modern simulations. Our prior paper (Liu et al., 2015) proposes memory-efficient streaming algorithms that can find the largest halos in a simulation with up to 109 particles on a small server or desktop. However, this approach fails when directly scaling to larger datasets. This paper presents a robust streaming tool that leverages state-of-the-art techniques on GPU boosting, sampling, and parallel I/O, to significantly improve performance and scalability. Our rigorous analysis of the sketch parameters improves the previous results from finding the centers of the 103 largest halos (Liu et al., 2015) to â¼104â105, and reveals the trade-offs between memory, running time and number of halos. Our experiments show that our tool can scale to datasets with up to â¼1012 particles while using less than an hour of running time on a single GPU Nvidia GTX 1080.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science Applications
Authors
N. Ivkin, Z. Liu, L.F. Yang, S.S. Kumar, G. Lemson, M. Neyrinck, A.S. Szalay, V. Braverman, T. Budavari,