Article ID Journal Published Year Pages File Type
542577 Integration, the VLSI Journal 2016 11 Pages PDF
Abstract

•Periodic online detection of permanent faults in a many-core processor.•Testing is based on execution of software-based test routines called test snippets.•Processing core may miss execution of some test snippets stochastically.•Duration of time that a test snippet is available for execution has been optimized.•an extended form of Petri Net has been used to model and analyze the system.

A promising solution to reliability challenges in nano-scale fabrication technologies is self-test and reconfiguration. In this direction, we propose an autonomous test mechanism for online detection of permanent faults in many-core processors. Several hardware test components are incorporated in the many-core architecture. Some of these components distribute software-based self-test routines among the processing cores and make each test routine accessible for a limited amount of time. A processing core that has an idle slot executes the test routine, otherwise it skips it without loss of test continuity. Several components of the proposed test architecture monitor behavior of the processing cores during execution of test routines, detect faulty cores, and make their omission from the system possible. We propose the use of an extended form of Petri NET modeling method to model and analyze the proposed test mechanism and tune our test architecture to preserve quality of test, and at the same time, manage the overall test time. Our experimental results show that test time and hardware overhead of the proposed test mechanism are low and its performance overhead is zero. Furthermore, the proposed test architecture can efficiently scale to a many-core with a large number of processing cores.

Related Topics
Physical Sciences and Engineering Computer Science Hardware and Architecture
Authors
, ,