mirror of https://github.com/torvalds/linux.git
Documentation: thermal: Document thermal throttling on Intel platforms
Add documentation for Intel thermal throttling reporting events. Signed-off-by: Srinivas Pandruvada <srinivas.pandruvada@linux.intel.com> Tested-by: Randy Dunlap <rdunlap@infradead.org> [ rjw: Subject adjustment, file name change, minor edits ] Link: https://patch.msgid.link/20251113212104.221632-1-srinivas.pandruvada@linux.intel.com Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
This commit is contained in:
parent
172880f7c9
commit
3402bc010d
|
|
@ -6,3 +6,4 @@ Thermal Subsystem
|
||||||
:maxdepth: 1
|
:maxdepth: 1
|
||||||
|
|
||||||
intel_powerclamp
|
intel_powerclamp
|
||||||
|
intel_thermal_throttle
|
||||||
|
|
|
||||||
|
|
@ -0,0 +1,91 @@
|
||||||
|
.. SPDX-License-Identifier: GPL-2.0
|
||||||
|
.. include:: <isonum.txt>
|
||||||
|
|
||||||
|
=======================================
|
||||||
|
Intel thermal throttle events reporting
|
||||||
|
=======================================
|
||||||
|
|
||||||
|
:Author: Srinivas Pandruvada <srinivas.pandruvada@linux.intel.com>
|
||||||
|
|
||||||
|
Introduction
|
||||||
|
------------
|
||||||
|
|
||||||
|
Intel processors have built in automatic and adaptive thermal monitoring
|
||||||
|
mechanisms that force the processor to reduce its power consumption in order
|
||||||
|
to operate within predetermined temperature limits.
|
||||||
|
|
||||||
|
Refer to section "THERMAL MONITORING AND PROTECTION" in the "Intel® 64 and
|
||||||
|
IA-32 Architectures Software Developer’s Manual Volume 3 (3A, 3B, 3C, & 3D):
|
||||||
|
System Programming Guide" for more details.
|
||||||
|
|
||||||
|
In general, there are two mechanisms to control the core temperature of the
|
||||||
|
processor. They are called "Thermal Monitor 1 (TM1) and Thermal Monitor 2 (TM2)".
|
||||||
|
|
||||||
|
The status of the temperature sensor that triggers the thermal monitor (TM1/TM2)
|
||||||
|
is indicated through the "thermal status flag" and "thermal status log flag" in
|
||||||
|
MSR_IA32_THERM_STATUS for core level and MSR_IA32_PACKAGE_THERM_STATUS for
|
||||||
|
package level.
|
||||||
|
|
||||||
|
Thermal Status flag, bit 0 — When set, indicates that the processor core
|
||||||
|
temperature is currently at the trip temperature of the thermal monitor and that
|
||||||
|
the processor power consumption is being reduced via either TM1 or TM2, depending
|
||||||
|
on which is enabled. When clear, the flag indicates that the core temperature is
|
||||||
|
below the thermal monitor trip temperature. This flag is read only.
|
||||||
|
|
||||||
|
Thermal Status Log flag, bit 1 — When set, indicates that the thermal sensor has
|
||||||
|
tripped since the last power-up or reset or since the last time that software
|
||||||
|
cleared this flag. This flag is a sticky bit; once set it remains set until
|
||||||
|
cleared by software or until a power-up or reset of the processor. The default
|
||||||
|
state is clear.
|
||||||
|
|
||||||
|
It is possible that when user reads MSR_IA32_THERM_STATUS or
|
||||||
|
MSR_IA32_PACKAGE_THERM_STATUS, TM1/TM2 is not active. In this case,
|
||||||
|
"Thermal Status flag" will read "0" and the "Thermal Status Log flag" will be set
|
||||||
|
to show any previous "TM1/TM2" activation. But since it needs to be cleared by
|
||||||
|
the software, it can't show the number of occurrences of "TM1/TM2" activations.
|
||||||
|
|
||||||
|
Hence, Linux provides counters of how many times the "Thermal Status flag" was
|
||||||
|
set. Also presents how long the "Thermal Status flag" was active in milliseconds.
|
||||||
|
Using these counters, users can check if the performance was limited because of
|
||||||
|
thermal events. It is recommended to read from sysfs instead of directly reading
|
||||||
|
MSRs as the "Thermal Status Log flag" is reset by the driver to implement rate
|
||||||
|
control.
|
||||||
|
|
||||||
|
Sysfs Interface
|
||||||
|
---------------
|
||||||
|
|
||||||
|
Thermal throttling events are presented for each CPU under
|
||||||
|
"/sys/devices/system/cpu/cpuX/thermal_throttle/", where "X" is the CPU number.
|
||||||
|
|
||||||
|
All these counters are read-only. They can't be reset to 0. So, they can potentially
|
||||||
|
overflow after reaching the maximum 64 bit unsigned integer.
|
||||||
|
|
||||||
|
``core_throttle_count``
|
||||||
|
Shows the number of times "Thermal Status flag" changed from 0 to 1 for this
|
||||||
|
CPU since OS boot and thermal vector is initialized. This is a 64 bit counter.
|
||||||
|
|
||||||
|
``package_throttle_count``
|
||||||
|
Shows the number of times "Thermal Status flag" changed from 0 to 1 for the
|
||||||
|
package containing this CPU since OS boot and thermal vector is initialized.
|
||||||
|
Package status is broadcast to all CPUs; all CPUs in the package increment
|
||||||
|
this count. This is a 64-bit counter.
|
||||||
|
|
||||||
|
``core_throttle_max_time_ms``
|
||||||
|
Shows the maximum amount of time for which "Thermal Status flag" has been
|
||||||
|
set to 1 for this CPU at the core level since OS boot and thermal vector
|
||||||
|
is initialized.
|
||||||
|
|
||||||
|
``package_throttle_max_time_ms``
|
||||||
|
Shows the maximum amount of time for which "Thermal Status flag" has been
|
||||||
|
set to 1 for the package containing this CPU since OS boot and thermal
|
||||||
|
vector is initialized.
|
||||||
|
|
||||||
|
``core_throttle_total_time_ms``
|
||||||
|
Shows the cumulative time for which "Thermal Status flag" has been
|
||||||
|
set to 1 for this CPU for core level since OS boot and thermal vector
|
||||||
|
is initialized.
|
||||||
|
|
||||||
|
``package_throttle_total_time_ms``
|
||||||
|
Shows the cumulative time for which "Thermal Status flag" has been set
|
||||||
|
to 1 for the package containing this CPU since OS boot and thermal vector
|
||||||
|
is initialized.
|
||||||
Loading…
Reference in New Issue