-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[prometheus] "Error on ingesting out-of-order exemplars" message in logs #1795
Comments
@beeme1mr This issue in particular did not exists with prior versions of flagd I did an incremental change to 0.11.3, and saw this error stopped happening, so I suspect this is something very specific to the 0.11.3 -> 0.11.4 upgrade. The release notes for flagd, are not super obvious on why this error would occur outside of a prometheus client upgrade. |
We didn't make any telemetry-related changes in the last release but I'll look into it.
Sorry, reread your comment. |
I can't get the demo app running locally due to an unrelated error.
Another user reported this already, but the fix has been reverted. |
Possibly related. |
@beeme1mr the latest release doesn't have that anymore. |
Yeah, I'm trying to run the latest version of the demo in Ubuntu on WSL. |
Ah, I see. Re-reading your message it actually makes sense. Multiple users reported that they were facing issues to run with the rslave param. Could you check if updating docker solves for you? But ideally the demo would run in all setups. |
I'm running the latest version available through |
Looks like flagd 0.11.4 contained an updated of flagd/core to 0.10.3 which itself contained a change from 1.28.0 to 1.30.0 of the opentelemetry-go monorepo. This definitely seems like a possible cause of the issue. Not sure what all is in that change but it doesn't look like it is flagd's fault, since they're just using basic APIs and not doing anything overly fancy. Indeed, they're not doing anything specific to exemplars at all. @open-telemetry/go-maintainers is there any chance there is a known issue which may have caused this? |
Exemplars were enabled by default in 1.31.0, so that wouldn't have changed between 1.28.0 and 1.30.0. But if it was 1.31, that would possibly explain it.
So sounds like this could be out of order or a duplicate issue. Are we sending OTLP to Prometheus? Or are we exporting prometheus or PRW from the collector? |
we are sending OTLP to Prometheus |
Got it. So it is probably an issue with the implementation of exemplar translation in the OTLP receiver of the prometheus server. The exemplar validation code probably assumes things about exemplars that aren't correct for OTel exemplars. |
@puckpuck if you can add details of our setup to prometheus/prometheus#13933, that would be helpful. Some hypotheses to check:
|
Bug Report
The following error continues to show up in the Prometheus logs:
This error started happening after upgrading flagd to version 0.11.4 in this PR
The text was updated successfully, but these errors were encountered: