Alerts and Recommended Actions
1. Infra Alerts and Recommended Actions
This section covers the configuration of alerts related to the system’s infrastructure. These alerts focus on hardware, network, and other critical infrastructure issues. It also includes recommended actions to resolve any issues that arise, such as scaling or resource reallocation.
2. Ingestion System Alerts and Recommended Actions
This section outlines the alerts related to the ingestion system. These include data input errors, performance issues, or system failures. Suggested actions are provided to address these issues, such as checking data sources, optimizing data flow, or reconfiguring ingestion pipelines.
3. Processing System Alerts and Recommended Actions
This section details the alerts that monitor the processing system, such as performance degradation, task failures, or delays in processing. Recommended actions might include reviewing processing pipelines, adjusting resource allocation, or analyzing logs to identify bottlenecks.
4. Querying System Alerts and Recommended Actions
Here, we focus on alerts related to query performance or failures. These include issues like long-running queries, high resource usage, downtime of queuing. The documentation provides recommended actions like optimizing queries, increasing system resources, or adjusting indexing strategies.
5. Storage System Alerts and Recommended Actions
This section outlines alerts related to the storage system, such as disk space running low, database failures, or read/write errors. The recommended actions include expanding storage, backing up data, or resolving connectivity issues between storage nodes.
Last updated
