کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6945046 1450454 2018 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Dynamic reliability management based on resource-based EM modeling for multi-core microprocessors
ترجمه فارسی عنوان
مدیریت قابلیت اطمینان دینامیکی بر اساس مدل سازی مبتنی بر منابع برای میکروپروسسورهای چند هسته ای
کلمات کلیدی
الکترو مهاجرت، مدیریت قابلیت اطمینان، منابع مبتنی بر، مدل سازی قابلیت اطمینان،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر سخت افزارها و معماری
چکیده انگلیسی
This article presents a new approach for system-level reliability management for multi/many core microprocessors. In the new approach, the electro-migration (EM) induced time to failure (TTF) at the system level is modeled as a resource, which is abstracted, from a recently proposed physics-based EM model, at the chip level. In this model, a single core can spend the TTF resources at different rates specified by the temperature and the related power consumption. As a result, the new resource-based EM model allows more flexible EM-reliability management for multi/many-core systems. As an application of the new model, we propose a novel task migration method to explicitly balance consumption of EM resources for all the cores. The new method aims at equalizing the probability of failure of each core, which will maximize the lifetime of the whole multi/many core system. To more efficiently regulate the lifetime of the multi/many core system, dynamic voltage and frequency scaling (by using different performance states (p-states), which can represent different operating voltages and frequencies) is further employed to compensate for the excessively consumed lifetime of all the cores when the chip is loaded with heavy tasks for a certain period of time. In this way, the TTF of all the cores could be compensated to meet the lifetime requirement, giving the multi/many core system more flexibility to handle heavy task assignment on demand. Experimental results on a 36-core processor platform show that the proposed task migration scheme could balance the lifetime consumption of all the cores, and maintain even-distributed TTF slacks across different cores, while the existing temperature-based task migration schemes lead to diverged TTF of cores. The results also show that the TTF consumption could be easily compensated for by switching to a low p-state, which could not be achieved if TTF consumption diverged across different cores.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Microelectronics Journal - Volume 74, April 2018, Pages 106-115
نویسندگان
, , ,