-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Query is only working with "Service" command in Alert page #195
Comments
I was able to reproduce this and from my initial investigation it seems like the AWS api that we're using only supports ID, service, and edge functions. I'll take another look tomorrow and see if there's something else we can use |
Took a look at this one and also am hitting interesting api limitations. I can use tracelist queries to create a list of filtered queries by status and service like so But I'm unable to create an alert on them because we don't have a Time field. I noticed that TraceList is using GetTraceSummaries which as far as I can tell should return to us a matchedTimeEvent field that I would think we could surface through grafana that we could then use as a Time field, but when I add that code in, I see at least with my test data it's always nil. I'm not sure if that's a setting or something I'm misunderstanding. Will follow up with AWS to see if they have other recommendations on how to query this kind of data. Or if you find other ways of using their api to get traces by service/status and with some kind of time field let us know! My only other thought as a weird work around would be to use transformations to make some kind of Time Column for this data and since we generally only get back 6 hours worth of traces the value for that time wouldn't necessarily be particularly relevant. But still enough to get alerting working? But this seems a bit of a hack. |
Spoke briefly with AWS and their suggestion was that we use GetTraceSummaries rather than GetTimeSeriesServiceStatistics to support querying by service and status code, which is interesting because I thought we supported querying by both (or at least I see references to both calls in our codebase) but I haven't taken a deeper look at it. Seems like something to look deeper into. Will backlog this issue for now, since we don't have specific timeline just yet on fixing this. But whenever we plan work for our next quarter we'll have a record of this without our findings to make improvements. |
I'm also running into this issue, do you have any timeline on when we can expect this to be looked at? 😄 |
It seems the only command that works in Query box in Alert definition is
Service
command.What happened:
I'm trying to create an alert which fires every time the "http status 403 error rate" increases.
I have a service called
grafana-poc-test-service
which is sending its traces to the AWS X-Ray and I can confirm theservice("grafana-poc-test-service")
query will return the data and can be used in alerts but it seemsservice
is the only key word accepted in the query box. Any simple or complex query other than that will raise aFailed to evaluate the query
error. I tested with bellow queries and all of them are raising the same exception:This query will work though:
service("grafana-poc-test-service")
How to reproduce it (as minimally and precisely as possible):
Screenshots
Working scenario
Anything else we need to know?:
Environment:
The text was updated successfully, but these errors were encountered: