کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4968309 1449572 2016 22 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Continuous whole-system monitoring toward rapid understanding of production HPC applications and systems
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
Continuous whole-system monitoring toward rapid understanding of production HPC applications and systems
چکیده انگلیسی
In this paper we present both system and application profiling results based on data obtained through synchronized system wide monitoring on a production HPC cluster at Sandia National Laboratories (SNL). We demonstrate analytic and visualization techniques that we are using to characterize application and system resource usage under production conditions for better understanding of application resource needs. Our goals are to improve application performance (through understanding application-to-resource mapping and system throughput) and to ensure that future system capabilities match their intended workloads.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Parallel Computing - Volume 58, October 2016, Pages 90-106
نویسندگان
, , , , , , , , ,