Alerts and Recommended Actions

This section covers the configuration of alerts related to the system’s infrastructure. These alerts focus on hardware, network, and other critical infrastructure issues. It also includes recommended actions to resolve any issues that arise, such as scaling or resource reallocation.

This section outlines the alerts related to the ingestion system. These include data input errors, performance issues, or system failures. Suggested actions are provided to address these issues, such as checking data sources, optimizing data flow, or reconfiguring ingestion pipelines.

This section details the alerts that monitor the processing system, such as performance degradation, task failures, or delays in processing. Recommended actions might include reviewing processing pipelines, adjusting resource allocation, or analyzing logs to identify bottlenecks.

Here, we focus on alerts related to query performance or failures. These include issues like long-running queries, high resource usage, downtime of queuing. The documentation provides recommended actions like optimizing queries, increasing system resources, or adjusting indexing strategies.

This section outlines alerts related to the storage system, such as disk space running low, database failures, or read/write errors. The recommended actions include expanding storage, backing up data, or resolving connectivity issues between storage nodes.

Last updated