Getatlas 4p6hf8wn8p
Help CenterTroubleshootingDebugging Monitor Issues

Debugging Monitor Issues

Last updated August 29, 2024

While Rootly provides robust monitoring, occasional issues might arise with your monitor configurations. This guide provides troubleshooting steps and debugging techniques to help identify and resolve problems with your monitors.

Troubleshooting Monitor Issues

  • Verify Monitor Configuration: Start by double-checking the monitor's settings:
  • URL or Endpoint: Ensure the URL or API endpoint you're monitoring is correct, accessible, and functioning properly.
  • Frequency: Confirm that the monitor's check frequency aligns with your needs. If it's too frequent, it might overload your application or create unnecessary noise.
  • Thresholds: Review the alert thresholds to ensure they are set appropriately.
  • Type: Make sure the monitor type (uptime check, API check, custom metric) matches the service or metric you're trying to monitor.
  • Check for Network Issues: Monitor connectivity between your servers and the target service or endpoint:
  • Ping Tests: Use a ping test to verify that your server can reach the target service.
  • Traceroute: Perform a traceroute to identify potential network hops where the connection is failing.
  • Network Monitoring Tools: Use network monitoring tools to inspect network traffic and identify any connectivity bottlenecks.
  • Analyze Monitor Logs: Examine the monitor's logs for error messages or clues about the issue:
  • Rootly Logs: Access the logs associated with your monitor within the Rootly dashboard to understand the specific errors or reasons for failure.
  • Target Service Logs: Check the logs on the target service or application for any errors or warnings that might be related to the monitor's failures.
  • Test Manual Checks: Perform a manual check of your service or endpoint (e.g., visiting the website, making an API request) to see if the monitor is accurately reporting its status.
  • Examine Downtime History: Review the monitor's downtime history to identify patterns or trends:
  • Frequent Downtime: If downtime occurs frequently at specific times or days, investigate potential causes like server load, resource constraints, or scheduled maintenance.
  • Sudden Downtime: If downtime occurs suddenly, consider recent changes or deployments that might have impacted your service.
  • Simulate Incidents: Trigger a simulated incident to test your alert system and verify that your notifications are accurate and reaching the intended recipients.

Debugging Specific Monitoring Types

  • Uptime Checks: Verify that the URL or endpoint is accessible from your servers, confirm that you're using the correct HTTP method (GET, POST, etc.), and review the response codes.
  • API Checks: Ensure the API endpoint exists, confirm the request method and parameters are accurate, and check for any rate limiting or security issues.
  • Custom Metrics: Carefully review the scripts or integrations used to collect your custom metrics. Ensure the scripts are correctly configured and executing without errors.

By following these troubleshooting steps and analyzing the specific monitoring type in detail, you can effectively debug monitor issues and ensure your Rootly monitors provide accurate and reliable performance insights.

Was this article helpful?