Towards understanding HPC users and systems: A NERSC case study

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
4951516	1441474	2018	16 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

HPC supercomputer - ابر رایانه Scheduling - برنامه‌ریزی Workload analysis - تجزیه و تحلیل حجم کار k-Means - میانگین ـ کی Heterogeneity - ناهمگونی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات

پیش نمایش صفحه اول مقاله

Towards understanding HPC users and systems: A NERSC case study

چکیده انگلیسی

- A method to perform analysis of HPC systems' workloads is proposed including per year detailed and time evolution analyses.
- A method to measure heterogeneity in job geometry is proposed.
- State of workload of three reference HPC systems are presented.
- Job geometry heterogeneity in queue is shown to affect wait time predictability.

High performance computing (HPC) scheduling landscape currently faces new challenges due to the changes in the workload. Previously, HPC centers were dominated by tightly coupled MPI jobs. HPC workloads increasingly include high-throughput, data-intensive, and stream-processing applications. As a consequence, workloads are becoming more diverse at both application and job levels, posing new challenges to classical HPC schedulers. There is a need to understand the current HPC workloads and their evolution to facilitate informed future scheduling research and enable efficient scheduling in future HPC systems.In this paper, we present a methodology to characterize workloads and assess their heterogeneity, at a particular time period and its evolution over time. We apply this methodology to the workloads of three systems (Hopper, Edison, and Carver) at the National Energy Research Scientific Computing Center (NERSC). We present the resulting characterization of jobs, queues, heterogeneity, and performance that includes detailed information of a year of workload (2014) and evolution through the systems' lifetime (2010-2014).

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 111, January 2018, Pages 206-221

نویسندگان

Gonzalo P. Rodrigo, P.-O. Ãstberg, Erik Elmroth, Katie Antypas, Richard Gerber, Lavanya Ramakrishnan,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Towards understanding HPC users and systems: A NERSC case study

دسترسی سریع

ارتباط

English Website