Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(eap): Add confidence interval calculations for count #6568

Draft
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

davidtsuk
Copy link
Contributor

@davidtsuk davidtsuk commented Nov 13, 2024

@volokluev
Copy link
Member

If I have multiple timeseries in one request. For example a sum, count, avg, p90

How would I know which confidence intervals are for which aggregations?

aggregation: AttributeAggregation,
) -> list[Expression]:
"""
Percentiles require special handling as they require sorting the data by the attribute value
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

and where is this sorting being done?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

still working on it, but it's gonna be done after the results are fetched from clickhouse

Copy link
Member

@volokluev volokluev Nov 14, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

so you're going to sort in-memory of the api server?

Copy link
Contributor Author

@davidtsuk davidtsuk Nov 14, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah that was my plan, I could also do it in the query, but the downside is that the query will use more memory. I figured that doing this sorting is gonna slow regardless of where we do it, but doing on the api server will at least reduce overall memory usage in the system. I think we should definitely rethink this in the future though.

@davidtsuk
Copy link
Contributor Author

If I have multiple timeseries in one request. For example a sum, count, avg, p90

How would I know which confidence intervals are for which aggregations?

Reliability is added for each data point in the time series, and data points directly correspond to a timeseries. I'm not sure if that answers your question.

@davidtsuk davidtsuk changed the title feat(eap): Add confidence interval calculations feat(eap): Add confidence interval calculations for count Nov 14, 2024
Copy link

codecov bot commented Nov 14, 2024

❌ 1 Tests Failed:

Tests completed Failed Passed Skipped
1 1 0 0
View the top 1 failed tests by shortest run time
::tests.admin.test_api
Stack Traces | 0s run time
No failure message available

To view more test analytics, go to the Test Analytics Dashboard
Got feedback? Let us know on Github

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants