کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
4956238 | 1444446 | 2017 | 13 صفحه PDF | دانلود رایگان |
- By supporting new techniques such as branch prediction, variable configuration methods for setup overhead mitigation, and revisions to the physical compiler, CCUs are now able to demonstrate performance and power consumption improvements in comparison to its previous work.
- This work uses Gem5 and OpenMP framework, with Berkeley Dwarf application testing.
- Full RTL prototypes have now been developed. Accordingly, this work demonstrates an average 1.29x decrease in area overhead, and 55% of the power consumption required of a conventional OoO core.
Multicore processors are customary within current generation computing systems. The overall concept of general purpose processing however remains a challenge as architects must provide increased performance for each advancing generation without solely relying on transistor scaling and additional cache levels. Although architects have steered towards heterogeneity to increase the performance and efficiency for a variety of workloads, the fundamental issue of how a single core's architecture may be improved and applied to the multiprocessor domain remains. This work builds upon the concept of Configurable Computing Units (CCU) - a nuanced approach to processor architectures and microarchitectures, employing reconfigurable datapaths and task-based execution. This work improves upon the efficiency of CCUs by applying various new design techniques including branch prediction, variable configuration, an OpenMP programming model, and Berkeley Dwarf testing. Experimental results using Gem5 demonstrate that a single CCU core can achieve dual-core performance, with a 1.29x decrease in area overhead and 55% of the power consumption required by a conventional CPU.
Journal: Journal of Systems Architecture - Volume 75, April 2017, Pages 107-119