-
Notifications
You must be signed in to change notification settings - Fork 122
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multi-Cloud Metrics Server #281
Comments
will it focus on cpu/memory (resources metrics) + network ones (as external/custom) or will allow exposing others via for example prometheus ? |
We should be exposing cpu and memory by default, and possibly network and file io
This looks like a custom metrics resource based on promql or some other syntax, which looks good and is a good idea, and we can consider supporting it after we implement the basic features |
@Iceber what API you would like to implement first?
in first implementation ? As |
@grzesuav We will first implement a basic usable minimal functionality, and the first stage will not involve prometheus or opentelemery. And we will first implement the resource metrics(metrics.k8s.io) —— cpu and memory, which can be obtained directly from /api/v1/nodes/< node >/proxy/metrics/resource. (similar to the metrics server's logic) Then we'll use /api/v1/nodes/< node >/proxy/metrics/cadvisor to get more information, including network and io, etc In the third phase, we will start to implement more features based on opentelemetry or prometheus. More suggestions are welcome. |
/assign |
@Iceber how you distinguish from which cluster metrics come from ? |
@grzesuav In the early stage, the two implementations of the multicluster metrics server would be two separate modules, and users would have to choose between them to deploy. Then we would consider combining the two implementations to decide via crd whether to use thanos/otel or node proxy for metric api information for a cluster |
What would you like to be added?
The clusterpedia apiserver will have a built-in multi-cluster metrics server that will provide at least the following features:
kubectl --cluster <cluster> top node/pod
to view the resource consumption of a node or pod under a clusterWe will refer to the following project implementation
The multi-cloud metrics server will be implemented in two ways:
We plan to implement the first way first in Q3
Why is this needed?
Users could easily get the resource consumption through clusterpedia
The text was updated successfully, but these errors were encountered: