کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
524660 868815 2012 32 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Trace profiling: Scalable event tracing on high-end parallel systems
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
Trace profiling: Scalable event tracing on high-end parallel systems
چکیده انگلیسی

Accurate performance analysis of high end systems requires event-based traces to correctly identify the root cause of a number of the complex performance problems that arise on these highly parallel systems. These high-end architectures contain tens to hundreds of thousands of processors, pushing application scalability challenges to new heights. Unfortunately, the collection of event-based data presents scalability challenges itself: the large volume of collected data increases tool overhead, and results in data files that are difficult to store and analyze. Our solution to these problems is a new measurement technique called trace profiling that collects the information needed to diagnose performance problems that traditionally require traces, but at a greatly reduced data volume. The trace profiling technique reduces the amount of data stored by capitalizing on the repeated behavior of programs, and on the similarity of the behavior and performance of parallel processes in an application run. Trace profiling is a hybrid between profiling and tracing, collecting summary information about the event patterns in an application run. Because the data has already been classified into behavior categories, we can present reduced, partially analyzed performance data to the user, highlighting the performance behaviors that comprised most of the execution time.


► Trace profiling collects event trace data at a greatly reduced data volume.
► Trace profiling reduces traces by identifying repeated behavior patterns.
► Trace profiling can reduce program perturbation by reducing write overheads.
► Scalable visualization is achieved through pattern detection-based trace reduction.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Parallel Computing - Volume 38, Issues 4–5, April–May 2012, Pages 194–225
نویسندگان
, ,