Fix Warning & Operation metrics #14323

hamistao · 2024-10-22T12:27:01Z

lxd_operations_total and lxd_warnings_total are currently being taken clusterwide, instead of returning just the entities related to the Node responding the metrics request. This goes against the overall design of the metrics, that are supposed to be per node and queried on each node on a cluster.
To fix this, this PR filters the queries for those entities based on the Node.

tomponline

Do warnings always have a node?

hamistao · 2024-10-22T12:30:31Z

Do warnings always have a node?

They do not, this is marked as Draft while I/we decide on how to handle nodeless warnings, I am thinking just include them in all node's metrics.

tomponline · 2024-10-22T12:33:01Z

Do warnings always have a node?

They do not, this is marked as Draft while I/we decide on how to handle nodeless warnings, I am thinking just include them in all node's metrics.

Yeah, or just on the leader?

hamistao · 2024-10-22T12:39:22Z

Yeah, or just on the leader?

I think this makes even more sense. Since Prometheus is supposed to scrape them on every node, counting them for each node can have the effect of readundantly counting these warnings many times when aggregating the measurements.

lxd/api_metrics.go

Signed-off-by: hamistao <[email protected]>

Signed-off-by: hamistao <[email protected]> (cherry picked from commit 24f150cf451eef796d879f1a191004ef0504b301)

Signed-off-by: hamistao <[email protected]> (cherry picked from commit decddf5adfbd0c34c1f189c4eff16c19b3527abd)

Signed-off-by: Mark Laing <[email protected]> (cherry picked from commit 5b844ba)

Signed-off-by: Mark Laing <[email protected]> (cherry picked from commit 51526cb)

Signed-off-by: Mark Laing <[email protected]> (cherry picked from commit 2492cd5)

Filters query for Warnings on the metrics handler by Node. Since some Warnings do not have a node, nodeless Warnings are only being included if querying the metrics from the leader node. Signed-off-by: hamistao <[email protected]>

hamistao · 2024-10-23T10:05:23Z

@tomponline @markylaing I am thinking on waiting for #14261 before proceeding here since I intend to use the LeaderInfo function defined there. I have cherry picked some commits to test and it all works fine.

tomponline reviewed Oct 22, 2024

View reviewed changes

lxd/api_metrics.go Outdated Show resolved Hide resolved

hamistao and others added 7 commits October 23, 2024 06:48

lxd/api_metrics: Filter Operation query by node

495fca6

Signed-off-by: hamistao <[email protected]>

lxd/db/cluster/warnings: Allow filtering by Node only

83dfb78

Signed-off-by: hamistao <[email protected]> (cherry picked from commit 24f150cf451eef796d879f1a191004ef0504b301)

lxd/db/cluster: Run make update-schema

711f038

Signed-off-by: hamistao <[email protected]> (cherry picked from commit decddf5adfbd0c34c1f189c4eff16c19b3527abd)

lxd: Add method to get leader information.

2b0248a

Signed-off-by: Mark Laing <[email protected]> (cherry picked from commit 5b844ba)

lxd/state: Add LeaderInfo method to state.

6301d0d

Signed-off-by: Mark Laing <[email protected]> (cherry picked from commit 51526cb)

lxd: Set LeaderInfo on (*Daemon).State().

4fa9fb6

Signed-off-by: Mark Laing <[email protected]> (cherry picked from commit 2492cd5)

hamistao force-pushed the fix_warning_operation_metrics branch from 471cc6e to 880764a Compare October 23, 2024 10:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Warning & Operation metrics #14323

Fix Warning & Operation metrics #14323

hamistao commented Oct 22, 2024

tomponline left a comment

hamistao commented Oct 22, 2024

tomponline commented Oct 22, 2024

hamistao commented Oct 22, 2024

hamistao commented Oct 23, 2024

Fix Warning & Operation metrics #14323

Are you sure you want to change the base?

Fix Warning & Operation metrics #14323

Conversation

hamistao commented Oct 22, 2024

tomponline left a comment

Choose a reason for hiding this comment

hamistao commented Oct 22, 2024

tomponline commented Oct 22, 2024

hamistao commented Oct 22, 2024

hamistao commented Oct 23, 2024