Datadog

Connects your observability data to automated response workflows, triggering incident remediation and infrastructure updates based on real time metrics and logs.

Try Datadog in Ceven

Ask Ceven anything
Standard

Why use Ceven?

  1. AI native Datadog integration

    • Describe the outcome and Ceven picks the right Datadog calls, fills the parameters, and checks the result.
    • Structured, agent friendly tool schemas so each call runs reliably instead of by guesswork.
    • Rich coverage for reading, writing, and querying your Datadog data, across all 42 of its actions.
  2. Managed auth

    • Built in OAuth with automatic token refresh and rotation.
    • One place to manage, scope, and revoke Datadog access.
    • Per user and per environment credentials instead of shared keys.
  3. Agent optimized design

    • Actions are tuned from real success and error rates so reliability climbs over time.
    • Full execution logs so you always know what ran in Datadog, when, and on whose behalf.
    • The agent pauses and asks when Datadog is unclear instead of plowing ahead.
  4. Enterprise grade security

    • Fine grained access so you control which agents and people can reach Datadog.
    • Least privilege by default, read scopes first and only the writes a workflow needs.
    • A full audit trail of every Datadog action to support review and sign off.

Supported tools

Every action Ceven's agents can run on Datadog, and when to use it.

Create monitor
Use this to set up a new alert for metrics or logs with specific notification thresholds.
Search logs
Query Datadog logs using filters to find specific error patterns or transaction IDs.
Create downtime
Suppress alerts for a specific host or service during a maintenance window to avoid false alarms.
Query metrics
Pull time series data for specific metrics to analyze trends or build custom reports.
Create event
Log a deployment, configuration change, or outage event into the Datadog timeline.
Get trace by ID
Pull detailed span and timing information for a specific distributed trace.
Create SLO
Define a service level objective to track reliability targets over a set window.
List incidents
Pull a list of all current and past incidents to track resolution times and impact.
Mute monitor
Temporarily silence a monitor to stop notifications during an active investigation.
Create synthetic test
Set up an API test to monitor endpoint uptime and performance from global locations.
Get service dependencies
Pull the dependency map for a service to see upstream and downstream callers.
Submit metrics
Push custom application metrics or business KPIs directly into Datadog.
List hosts
Pull a list of all infrastructure hosts including their current status and tags.
Create dashboard
Build a new visualization board with widgets to monitor specific service health.
Create Synthetic API Test
Create a synthetic api test in datadog. creates a new synthetic api test that continuously monitors api endpoints from multiple locations worldwide. useful for proactive monitoring of api uptime, performance, and functionality.
Create Webhook
Create a webhook in datadog. webhooks enable you to receive notifications from datadog monitors and alerts to external services and applications.
Delete Dashboard
Delete a dashboard in datadog. permanently removes a dashboard from your organization. this action cannot be undone. use with caution.
Delete monitor
Deletes a datadog monitor permanently. use with caution as this action cannot be undone.
Get Dashboard
Get a specific dashboard from datadog. retrieves detailed information about a dashboard including its widgets, layout, template variables, and metadata.
Get monitor
Retrieves detailed information about a specific datadog monitor, including its current state, configuration, and any active downtimes.
Get Synthetics Locations
Tool to retrieve all available public and private locations for synthetic tests in datadog. use when you need a list of location identifiers for creating or managing synthetic tests.
Get host tags
Retrieves all tags associated with a specific host in datadog. useful for understanding host metadata and organizing infrastructure.
Get usage summary
Retrieves usage summary information from datadog including api calls, hosts, containers, and other billable usage metrics. useful for cost monitoring and usage analysis.
List All Tags
List all tags from datadog. tags help organize and filter your infrastructure and applications. this action shows all tags in use across your organization.
List API Keys
List api keys in datadog. retrieves all api keys in the organization for security auditing, access management, and key rotation planning. helps maintain security posture by tracking key usage and ownership.
List APM Services
List apm services from datadog. application performance monitoring (apm) provides deep visibility into your applications, helping you track performance, errors, and dependencies.
List AWS Integration
List aws integrations in datadog. retrieves all configured aws account integrations, showing which aws accounts are monitored by datadog and their configuration settings. useful for cloud infrastructure management and ensuring comprehensive
List dashboards
Lists all datadog dashboards with basic information. useful for dashboard management and getting an overview of available dashboards.
List events
Lists events from datadog within a specified time range. events track important occurrences like deployments, outages, and configuration changes.
List Log Indexes
Tool to retrieve a list of all log indexes configured in datadog. use when you need to get the names and configurations of log indexes.

30 actions · scroll to see them all

Frequently asked questions

Ceven implements an intelligent queuing system that respects Datadog API rate limits by tracking the X RateLimit headers in every response. If the agent detects it is approaching the limit for a specific endpoint, it automatically throttles outgoing requests and implements an exponential backoff strategy. This prevents your organization from being blocked during critical incident response windows. For high volume metric submissions, we batch requests into the largest possible payloads allowed by Datadog to minimize the total number of API calls. You can configure custom priority levels in your workflow to ensure that critical monitor reads always take precedence over non urgent dashboard updates.
Yes. The agent can use the Create Dashboard action to programmatically build visualizations. You can tell the agent to create a dashboard for a new microservice, and it will pull the relevant metrics, define the layout, and add the necessary widgets based on your standard observability templates. This is particularly useful for teams that spin up ephemeral environments or launch new services frequently. The agent can also update existing dashboards to add new monitors as your infrastructure evolves, ensuring that your visibility grows alongside your code without requiring manual configuration in the Datadog UI every time a new metric is added.
Ceven can both create and manage Synthetic API tests. You can instruct the agent to deploy a new test whenever a new API endpoint is merged into production. The agent handles the configuration of the test, including the request method, headers, and expected response codes. It can also monitor the results of these tests and trigger a remediation workflow if a test fails from a specific geographic location. This allows you to maintain a proactive monitoring posture where your tests are always in sync with your latest API specifications without needing to manually update the Synthetic suite in the Datadog console.
When an alert triggers, the agent uses the Search Traces action to find requests that match the error criteria. It filters by service, operation, and time window to isolate the problematic requests. Once it finds a relevant trace, it uses the Get Trace by ID action to dive into the individual spans. This allows the agent to pinpoint exactly which downstream service or database query is causing the latency or error. The agent then summarizes this technical data into a human readable format, providing the exact trace link and the identified bottleneck to the engineer on call.
The integration requires an API key and an application key with permissions scoped to the actions you intend to use. For read only workflows, you only need the read permissions for metrics and logs. However, to automate remediation, such as creating downtimes or muting monitors, the application key must have write access to those specific resources. We recommend creating a dedicated Datadog application key for Ceven rather than using a root user key. This follows the principle of least privilege and allows you to audit every action the agent takes in the Datadog audit logs under a unique identifier.
Yes, the agent can use the Get Usage Summary tool to pull billable usage metrics. You can set up a weekly workflow where the agent analyzes your host count, container usage, and API call volume. If the agent detects a sudden spike in usage that could lead to a budget overrun, it can alert your finance or platform team. It can also identify unused monitors or dashboards by analyzing access patterns and suggesting which ones can be deleted to clean up your environment. This turns your observability tool from a cost center into a managed resource with clear visibility into spend.
One specific quirk of the Datadog logs API is that the sort parameter is not supported and will cause the API request to fail. Ceven handles this by performing the sorting logic within the workflow layer after the data is retrieved. Additionally, time parameters must be provided in milliseconds, which the agent handles automatically by converting your natural language time frames into the required format. Users should also be aware that log search results are paginated, and for very large result sets, the agent will walk the cursor to ensure no critical log lines are missed during an investigation.
The agent can both create and monitor Service Level Objectives. You can define a target percentage for a specific metric, and the agent will track the error budget in real time. If the error budget is being consumed too quickly, the agent can trigger a high priority alert or even automatically trigger a rollback of the latest deployment if the SLO breach is linked to a recent change. This allows you to move toward a data driven reliability model where the agent manages the balance between feature velocity and system stability based on the actual performance data in Datadog.

Alternatives to Datadog

Other tools that solve a similar problem. Ceven supports these too, so you can switch or run more than one at once.

New Relic logoNew RelicDynatrace logoDynatraceSplunk logoSplunkGrafana logoGrafana

Try Ceven on your stack

Plug Ceven on top of the tools you already run. Connect Datadog and the rest of your stack, describe the outcome, and its agents handle the work end to end, days of it in minutes.

Get started for free