-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Check the resource usage of Alloy #3724
Comments
Actual RAM usage for Alloy on
|
Our Alloy app is already up-to-date
|
So what's the outcome here then?
This sounds closer to what prometheus-agents had - so we improved here and closed the gap? |
Looking into Putting this into perspective with Note that this comparison is made across 2 different installations and might be inaccurate due the nature of the underlying data which might differ in terms of labels cardinality which plays a big part in the actual memory usage. |
I replaced Alloy with Prometheus agent on |
I struggle with reading those graphs.
So I'll refer to your numbers:
With
So,
That's a bit more RAM usage for Alloy (+10%), but I think we can accept that. |
Yes I think that's something we can accept. Maybe we should find ways to reduce the difference in resource usage between the 2 pods (I think it's KSM and we should maybe shard) but those are good results :) Is there anything else to do here? Maybe write those results down somewhere? |
There seems to be something wrong in the scaling for prometheus agent pods, as the number of observed timeseries and current shards do not match with what the operator is supposed to have configured I added more unit test cases in the operator to ensure the number of shards computed by the operator is correct, giantswarm/observability-operator#160 We'll leave this for now, and anyone in the future observing a similar difference is gladly welcomed to investigate further :) |
We configured the sharding value in config at 500.000 timeseries and not 1.000.000 anymore. That could explain the différence you see |
But as long as we're okay with thé résults we don't have a reason to keep this issue right? |
Having a lower threshold questions even more the results we are observing. But for now we are good here and we can close this. |
Motivation
We've seen some weird numbers when checking the golem installation on which alloy is already rolled out. It seems that alloy is using significantly more resources than promtail and prometheus-agent combined. This is not good.
Todo
Outcome
The text was updated successfully, but these errors were encountered: