-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Define ILM policy for Kibana APM data #124147
Comments
@nikulinivan Do you know who were the people involved in defining the ILM policy for APM data? |
Data from all services will generally end up in the same data streams for See apm-data-streams for more details. |
Thanks for the input. I think that as we move forward to collect data from customer deployments, we'll find the need to be able to customize this, but at the moment, I think this is not a high priority. 🙏🏻 |
Pinging @elastic/apm-ui (Team:apm) |
We recently started collecting APM data for kibana and kibana front end.
At this stage we're collecting it for a subset of our monitoring deployments, but the longer term goal is to sample APM stats for all customer deployments, to allow us to monitor them better as well as troubleshoot performance issues in production.
Kibana APM data size
kibana-frontend
transactionskibana
transactions are not sampled (100% reported)On
us-east-1
region,kibana
generates ~5m records a day.kibana-frontend
generates a negligible amount of records (usage is low for these clusters). Given an average document size is 1.5KB, this would result in APM data for kibana weighing 7.5 GB per day for a single region.For reference, the allocator generates ~680m records a day (>100GB a day) on the same region. This means that the kibana data is negligible in size compared to the rest of the data in these indices.
ILM Policy
While it's important to set up an ILM policy for Kibana APM data, since it's size is negligible in comparison to other services, we can ignore this for now. In the longer term, the cloud observability team plans to roll all data older than 7 days in searchable snapshots.
Some interesting questions to consider
Once we upgrade to 8.x and use data streams, each service will have it's own stream. We would then be able to control each stream's ILM policy separately, if we choose to.
Need to identify owners
How are those deployed across the monitoring clusters?
The text was updated successfully, but these errors were encountered: