# The AI SRE observability stack

30x cheaper than Datadog. Predictable pricing.  
Exceptional customer support.

Datadog bill too high? Migrate today, the rest of your contract is on us.  
Migration assistance and bespoke onboarding included. [Book a consultation](https://calendly.com/betterstack/consultation-migrating-from-datadog)

[eBPF-based service map](/tracing.md) [Log management](/log-management.md) [OpenTelemetry-native tracing](/tracing.md#otel) [Infrastructure monitoring](/infrastructure-monitoring.md) [AI-native error tracking](/error-tracking.md) [Incident management](/incident-management.md) [Status page](/status-page.md) [Agentic AI SRE](/tracing.md#sre) [Resolve incidents in Slack or Teams](/incident-management.md#slack)

## At a fraction of your current costs

Get an unrivaled price-to-performance ratio. Forget sampling and ingest all your data or decrease your costs by 80x.

Ingest up to 80x more data with the same budget, or save up to 98% of your costs.

| Provider | 1 TB traces + 1 TB logs + 1 TB metrics per month |
| --- | --- |
| Datadog | approx. $55,574 per month |
| Better Stack | $687 per month |

_An estimate only. Assumes annual payments, European data location, 1 responder with a Tera bundle, the average event size of 1 kB. Further assumes Datadog's $0.1 per ingested GB of spans & logs, and $2.5/million indexed spans & logs for 30 days._

[Explore pricing](/pricing.md#logs)

## AI SRE

[Explore AI SRE](/ai-sre.md)

### [Robust MCP server](https://betterstack.com/demos/uptime-api/#mcp-server)

Integrate your logs, metrics, traces, and errors into your existing LLM  
workflows with a top tier MCP server.

### [Get an AI SRE](/ai-sre.md#agent)

Claude Code-like UI with the knowledge of your infrastructure.

### [Smart incident merging](/incident-management.md#ai-features)

10 incidents created at the same time? Acknowledge them with a single tap and keep your phone from ringing while fixing the issue.

### [Explain with AI](/incident-management.md#ai-features)

Analyze MTR, trace route, SSL certificates or connection errors.

### [Linear tickets suggested by AI](/incident-management.md#ai-features)

Downtime? Create Linear tickets to fix the root cause using AI-based  
suggestions with a single tap.

### [AI written post-mortems](/incident-management.md#ai-features)

Get an automated post-mortem based on the incident timeline and Slack.

## Tracing

[Explore tracing](/tracing.md)

### [eBPF-based service map](/tracing.md#otel)

See network flows, auto-instrument databases, and track SLOs.

### [Instrument clusters with OpenTelemetry with no code change](/tracing.md#ebpf)

Gather traces, logs, and metrics using eBPF and OpenTelemetry. Remotely monitor collector’s throughput and adjust sampling, compression, and batching as needed.

### [Explore with "bubble up"](/tracing.md)

Investigate slow requests visually with drag & drop to find root cause.

### [Get an AI SRE: Claude Code with the knowledge of your telemetry](/tracing.md#sre)

Leverage automated root cause analysis based on the eBPF-based service map and log analysis. A human is always in charge.

### [Apdex and RED metrics](/tracing.md)

Understand the overall health of a service in a single dashboard.

## Incident management

[Explore incident management](/incident-management.md)

### [Slack-based incident management](/incident-management.md)

Get the right team members involved with powerful templated workflows directly in Slack and decrease your MTTR.

### [Smart incident merging](https://betterstack.com/docs/uptime/incident-grouping/)

10 incidents created at the same time? Acknowledge them with a single tap and keep your phone from ringing while fixing the issue.

### [AI post-mortems](/incident-management.md#ai-post-mortems)

Learn from every incident instead of manually rewriting what happened.

## Uptime monitoring

[Explore uptime monitoring](/uptime.md)

### [Screenshots for errors](/uptime.md#uptime-monitoring)

We record the API errors and take a screenshot of your app being down.

### [Traceroute & MTR for timeouts](/uptime.md#uptime-monitoring)

Understand connection timeouts and request timeouts with edge-based traceroute and MTR outputs.

### [Playwright-based transaction checks](/website-monitoring.md#transaction-monitoring)

Run tests with a real Chrome browser instance with a JavaScript runtime.

### [Phone call alerts & SMS included](https://betterstack.com/docs/uptime/monitoring-start/#step-2-choosing-the-alerting-options)

Unlimited global phone call alerts, sms, push notifications, and Slack notifications included with every Responder license.

## Log management

[Explore log management](/log-management.md)

### [Analyze raw logs at scale](/log-management.md)

Run ad-hoc queries on raw logs at scale with our built-in query time sampling.  
 Query with SQL, PromQL, Drag & drop or a simple log filtering.

### [Don’t get billed for useless logs](/log-management.md#query)

Mark irrelevant logs as spam to exclude them and don’t get charged.

### [One-click pattern filtering](/log-management.md#query)

Group similar logs, filter or exclude patterns with a single click.  
 See surrounding logs from noisy neighbors.

### [SQL via HTTP API and MCP server](/log-management.md#wide-events)

Query your logs, spans or metrics with SQL over HTTP API or our MCP server.

### [Store logs in your own S3 bucket](/log-management.md#wide-events)

No more ‘hot storage’ and ‘cold storage’. Access all your logs all the time.  
 No need to rehydrate your logs from S3 ever again.

### [Drag & drop dashboards](/log-management.md#query)

Drag & drop the metrics you want to visualize. No SQL code necessary.

## Infrastructure monitoring

[Explore infrastructure monitoring](/infrastructure-monitoring.md)

### [Anomaly detection alerts](/telemetry#resolve)

Trigger alerts in real-time based on anomalies in logs and metrics. No need to configure exact alert thresholds. Get alerts via Slack, e-mail, phone, SMS, and more.

### [Collaboration built-in](/telemetry)

Observe your teammates and comment on note-worthy data spikes.

### [Query with Drag & drop, SQL or PromQL](/telemetry#troubleshoot)

Get answers fast with a powerful SQL query builder. No need to learn a new querying language or ask your data analyst.

### [OpenTelemetry & Prometheus-native](/infrastructure-monitoring.md)

Connect metrics in minutes using existing open-source collectors.

## Error tracking

[Explore error tracking](/error-tracking.md)

### [Made for Slack, Linear, Microsoft Teams, and Jira](/error-tracking.md#features)

Integrate with tools you already use to get alerted with rich contextual notifications. Need an urgent fix? Leverage the built-in incident management.

### [Compatible with Sentry SDKs](/error-tracking.md#sentry-compatible)

Track errors from 100+ platforms with the industry-standard Sentry SDKs.

### [Integrate with Claude Code or Cursor](/error-tracking.md#ai-first)

Integrate Better Stack in seconds with a single prompt. Copy, run, and start tracking errors with Better Stack.

### [Terraform, API, and MCP server](/error-tracking.md#features)

Integrate new apps with Terraform and investigate errors with MCP.

### [Configurable exception grouping](/error-tracking.md#features)

Have a non-standard stack and want to group exceptions in an atypical way? We got you covered.

## Real user monitoring

[Explore real user monitoring](/real-user-monitoring.md)

### [Session replay](/real-user-monitoring.md#session-replay)

See how users interact with your product. Watch at 2x speed, skip pauses, filter for rage indicators.

### [Auto-capture user events](/real-user-monitoring.md#session-replay)

Collect clicks, form fills, and rage clicks with a single code snippet.

### [Product analytics with event funnels](/real-user-monitoring.md#product-analytics)

See what parts of the critical path of your onboarding need improving by evaluating user actions step-by-step.

### [Correlate frontend with backend](/real-user-monitoring.md#product-analytics)

See web events alongside backend traces and other log events.

### [Track web vitals](/real-user-monitoring.md#features)

See Largest Contentful Paint, Cumulative Layout Shift, and Interaction to Next Paint segmented per URL.

### [Real-time website data at scale](/real-user-monitoring.md#website-analytics)

Are you getting more visitors from ChatGPT or Gemini?

## Status page

[Explore status pages](/status-page.md)

### [Branded page on your own sub-domain](/status-page.md)

Beautifully designed status page. Fully customizable with CSS and Javascript.

### [Subscribe to status page updates](/status-page.md#branded)

Send automated updates to your customers when incident occurs. Let your customers subscribe to the entire status page or just selected components.

### [Translated into any language](/status-page.md)

Be perceived as a local by your foreign customers. Customize every translation.

### [Embed custom charts](/status-page.md)

Show pre-built charts with response times or add custom metrics with advanced visualizations directly to your status page.

## Warehouse

[Explore warehouse](/warehouse.md)

### [So affordable you’ll ask what’s wrong with it](/warehouse.md#cost-comparison)

Benefit from our economies of scale. We pass the savings on to you.  
Cheaper than self-hosting on AWS.

### [Get time series data warehouse as an API](/warehouse.md#ch-api)

Save query as a high-performance API you can securely call from frontend.

### [Built-in vector embeddings and approximate KNN](/warehouse.md#features)

Generate vector embeddings without having to call an external API with our  
built-in embedding model and query embeddings fast with vector indexes.

### [Open formats in your own S3 bucket](/warehouse.md#s3-bucket)

Store everything in your own cloud.  
No vendor lock-in.

### [Analyze trends with analytical queries](/warehouse.md#features)

Run arbitrary SQL queries at petabyte scale leveraging query time sampling.

## Happy customers, growing market presence

Ship higher-quality software faster. Be the hero of your engineering teams.
