-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Combining metrics profiles files #411
Combining metrics profiles files #411
Conversation
Can one of the admins verify this patch? |
@rsevilla87 @mohit-sheth @dry923 can you all please take a look when you get the chance |
0b429c2
to
4f0ee21
Compare
@rsevilla87 @dry923 are we able to get this PR merged? The default metrics profile doesn't collect all the data that is available when using all the touchstone-configs here
|
@rsevilla87 @jtaleric @dry923 any thoughts on this? Can we get this merged? the only difference in those 2 files is the podCPU and podMemory. This small difference is also causing discrepancies in the grafana dashboards in the node section because it’s looking for “podCPU” that isn’t captured in the metrics.yaml file that nodes that aren't for the UUID are being listed |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm - I am however concerned touchstone doesn't do a good job with error handling :(
Think the issue with touchstone is similar to : cloud-bulldozer/benchmark-comparison#57 not sure if it's the exact same thing though |
- query: sum(irate(container_cpu_usage_seconds_total{name!="",namespace=~"openshift-(etcd|oauth-apiserver|.*apiserver|ovn-kubernetes|sdn|ingress|authentication|.*controller-manager|.*scheduler|monitoring|logging|image-registry)"}[2m]) * 100) by (pod, namespace, node) | ||
metricName: podCPU | ||
|
||
- query: sum(container_memory_rss{name!="",namespace=~"openshift-(etcd|oauth-apiserver|.*apiserver|ovn-kubernetes|sdn|ingress|authentication|.*controller-manager|.*scheduler|monitoring|logging|image-registry)"}) by (pod, namespace, node) | ||
metricName: podMemory |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Now that we're using containerCPU and containerMemory metrics in touchstone I think we can skip add these two queries here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
with this being the only difference in these 2 files can we not just add these 2 queries and combine?
@@ -0,0 +1,21 @@ | |||
{ | |||
"elasticsearch": { | |||
"ripsaw-kube-burner": [ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why 4 configuration files rather than a single one holding all queries?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I sent a example on slack on how to consolidate this.
Hey @paigerube14, I managed to query CPU and memory metrics from pods using the new touchstone config files you've added:
I think we can skip podCPU and podMemory metrics in metrics.yaml |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
* combining metrics profiles * adding container touchstone configs * adding overall container metrics * deleting old files * adding metrics-ovn back
Description
Want to combine the metrics and metrics-ovn yaml files so the default metrics file has podCPU and podMemory that can be used when creating the benchmark-comparison for kube-burner
Fixes