Dynamic
Thermal Management in CMPs
High Performance Computing Laboratory
Chip
Multiprocessors (CMPs) have been prevailing in the modern microprocessor
market. As the significant heat is converted by the ever-increasing power
density and current leakage, the raised operating temperature in a chip
have already threatened the system reliability and led the thermal
control to be one of the most important issues needed to be addressed
immediately in the chip design. Due to the cost and complexity of
designing thermal packaging, many Dynamic Thermal Management (DTM)
schemes have been wildly adopted in the modern processors as a technique
to control CPU power dissipation. However, it is known that the overall
temperature of a CMPs is highly correlated with
temperature of each core in the CMPs environments; hence, the thermal
model for uniprocessor environments cannot be directly applied in CMPs
due to the potential heterogeneity. To our best knowledge, none of prior
DTM schemes considers the thermal correlation effect among neighboring
cores, neither the dynamic workload behaviors which present different
thermal behaviors. We believe that it is necessary to develop an
efficient online workload estimation scheme for DTM to be applicable to
the real world applications which have variable workload behaviors and
different thermal contributions to the increased chip temperature.
Predictive
Dynamic Thermal Management
Recently, processor power density has been increasing at an alarming rate
resulting in high on-chip temperature. Higher temperature increases
current leakage and causes poor re- liability. In this work, we propose a
Predictive Dynamic Thermal Management (PDTM) based on Application-based
Thermal Model (ABTM) and Core-based Thermal Model (CBTM) in the multicore
systems. ABTM predicts future temperature based on the application speci?c thermal be- havior, while CBTM estimates core temperature pattern
by steady state temperature and workload. The accuracy of our prediction
model is 1.6% error in average compared to the model in HybDTM, which has at most 5% error. Based on
predicted temperature from ABTM and CBTM, the pro- posed PDTM can
maintain the system temperature below a desired level by moving the
running application from the possible overheated core to the future
coolest core (migra- tion)
and reducing the processor resources (priority schedul-
ing) within multicore systems. PDTM enables the
explo- ration of the tradeoff between
throughput and fairness in temperature-constrained multicore systems.
We implement PDTM
on Intel's Quad-Core system with a specific device driver to access
Digital Thermal Sensor (DTS). Compared against Linux standard scheduler,
PDTM can decrease av- erage
temperature about 10%, and peak temperature by 5
degrees with negligible impact of performance under 1%, while running
single SPEC2006 benchmark. Moreover, our PDTM outperforms HRTM [10] in
reducing average temperature by about 7% and peak temperature by about 3
degrees with perfor- mance
overhead by 0.15% when running single benchmark.
Comparisons between
without DTM and PDTM
|
|
|
Without DTM
|
PDTM
|
Hybrid
Dynamic Thermal Management Multimedia applications
become one of the most popular applications in mobile devices such as
wireless phones, PDAs, and laptops. However, typical mobile systems are
not equipped with cooling components, which eventually causes critical
thermal deficiencies. Although many low-power and low-temperature
multimedia playback techniques have been proposed, they failed to provide
QoS (Quality of Service) while controlling
temperature due to the lack of proper understanding of multimedia
applications. We propose Hybrid Dynamic Thermal Management (HDTM) which
exploits thermal characteristics of both multimedia applica-
tions and systems. Specifically, we model
application characteristics as the probability distribution of the number
of cycles required to decode a frame. We also improve existing system
thermal models by considering the effect of workload. This scheme finds an
optimal clock frequency in order to prevent overheating with minimal
performance degradation at runtime.
The
proposed scheme is implemented on Linux in a Pentium- M processor which
provides variable clock frequencies. In or- der to evaluate the performance
of the proposed scheme, we exploit three major codecs, namely MPEG-4,
H.264/AVC and H.264/AVC streaming. Our results show that HDTM lowers the
overall temperature by 15 degrees and the peak temperature by 20 degrees,
while maintaining frame drop ratio under 0.2% compared to previous
thermal management schemes such as feedback control DTM, Frame-based DTM
and GOP-based DTM.
Instructions and Frequency
|
|
|
The
number of instructions
|
The
estimated frequency
|
Correlation-Aware
Thermal Management The overall temperature of a CMPs is highly correlated with temperature of
each core in the CMPs environments; hence, the thermal model for
uniprocessor environments cannot be directly applied in CMPs due to the
potential heterogeneity. To our best knowledge, none of prior DTM schemes
considers the thermal correlation effect among neighboring cores, neither
the dynamic workload behaviors which present different thermal behaviors.
We believe that it is necessary to develop an efficient online workload
estimation scheme for DTM to be applicable to the real world applications
which have variable workload behaviors and different thermal
contributions to the increased chip temperature. In this work, we propose
a light runtime workload estimation using the cumulative distribution
function to observe the processes¡¯ dynamic workload behaviors, and
present a proper thermal model for CMPs systems to analyze the thermal
correlation effect by profiling the thermal impacts from neighboring
cores under the specific workload. Hence, according to the estimated
representative workload and modeled thermal correlation effect, we
estimate each core¡¯s future temperature more accurately with only 2.4%
error in average. Next, Proactive Correlation-Aware Thermal Management (ProCATM) is introduced to avoid thermal emergencies
and provide thermal fairness with negligible performance overhead.
we
implement and evaluate ProCATM in an Intel Quad
Core Q6600 and an Intel Core i7 965 processor systems running grouped
multimedia application and several benchmarks for server environments.
According to the experimental results, ProCATM
reduces the peak temperature by up to 9.09% and 7.94% in our 4-cores
system and 8-cores system with only 2.28% and 0.54% performance overhead
respectively compared to the Linux standard scheduler.
Correlation-Aware Thermal Management
|
|
|
System Overview
|
|
- "Temperature-Aware Scheduler Based on
Thermal Behavior Grouping in Multicore Systems," in Design,
Automation & Test in Europe (DATE 2009), Nice, France, April,
2009.
|