Skip to content

Add module_power_instant Metric#3

Merged
NanoNabla merged 1 commit intoscore-p:masterfrom
cvonelm:module_power_instant
Jul 29, 2025
Merged

Add module_power_instant Metric#3
NanoNabla merged 1 commit intoscore-p:masterfrom
cvonelm:module_power_instant

Conversation

@cvonelm
Copy link
Contributor

@cvonelm cvonelm commented Jul 17, 2025

This commit adds the module_power_instant metric.

On GraceHopper systems, this represents the instantaneous power consumption of the complete compute module (CPU + GPU).

This is exactly how "Module Power" is read by nvidia-smi.

The resulting data from this interface is much cleaner than trying to read the hwmon counters documented in [1] (less zero values, greatly decreased jitter of the power readings).

[1] https://docs.nvidia.com/grace-perf-tuning-guide/power-thermals.html

This commit adds the module_power_instant metric.

On GraceHopper systems, this represents the instantaneous power
consumption of the complete compute module (CPU + GPU).

This is exactly how "Module Power" is read by `nvidia-smi`.

The resulting data from this interface is much cleaner than trying to
read the hwmon counters documented in [1] (less zero values, greatly
decreased jitter of the power readings).

[1] https://docs.nvidia.com/grace-perf-tuning-guide/power-thermals.html
@NanoNabla NanoNabla merged commit 6952f17 into score-p:master Jul 29, 2025
1 of 4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants