RunPod

Provisions GPU clusters and serverless endpoints on demand, manages container templates, and monitors pod health to keep your AI workloads running without manual console clicks.

Try RunPod in Ceven

Ask Ceven anything
Standard

Why use Ceven?

  1. AI native RunPod integration

    • Describe the outcome and Ceven picks the right RunPod calls, fills the parameters, and checks the result.
    • Structured, agent friendly tool schemas so each call runs reliably instead of by guesswork.
    • Rich coverage for reading, writing, and querying your RunPod data, across all 13 of its actions.
  2. Managed auth

    • Built in OAuth with automatic token refresh and rotation.
    • One place to manage, scope, and revoke RunPod access.
    • Per user and per environment credentials instead of shared keys.
  3. Agent optimized design

    • Actions are tuned from real success and error rates so reliability climbs over time.
    • Full execution logs so you always know what ran in RunPod, when, and on whose behalf.
    • The agent pauses and asks when RunPod is unclear instead of plowing ahead.
  4. Enterprise grade security

    • Fine grained access so you control which agents and people can reach RunPod.
    • Least privilege by default, read scopes first and only the writes a workflow needs.
    • A full audit trail of every RunPod action to support review and sign off.

Supported tools

Every action Ceven's agents can run on RunPod, and when to use it.

Create RunPod Cluster
Use this when you need to deploy multiple pods with shared configuration for parallel processing or ML training.
Create Secret
Store sensitive values like API keys or passwords that pods and endpoints need to access at runtime.
Delete Container Registry Authentication
Remove stored registry credentials when a private image source is no longer needed.
Delete Template
Remove a RunPod template that is not currently in use by any pods or serverless endpoints.
Get GPU Types
Pull available GPU options, pricing, and specifications to determine the best hardware for a workload.
Get authenticated user info
Retrieve the current user ID, email, and security status to verify account permissions.
Get Pod Details
Pull the current state, GPU count, and cost for a specific pod using its unique ID.
List CPU Types
Pull available CPU specifications to configure the non GPU portions of a pod.
Save Serverless Endpoint
Create or update a serverless endpoint with specific GPU and scaling configurations.
Save Container Registry Authentication
Store credentials for private Docker images so RunPod can pull your custom model images.
Save Template
Define a reusable pod configuration with specific images and environment variables.
Update Registry Auth
Modify the username or password for an existing container registry connection.
Update User Settings
Update account settings like the SSH public key to enable secure pod access.

13 actions · scroll to see them all

Frequently asked questions

Ceven interacts with RunPod secrets via the official API to ensure sensitive data never lives in plain text within your workflow logs. When you use the Create Secret action, the agent sends the encrypted payload directly to RunPod. These secrets are then injected into your pods as environment variables at the hardware level. Ceven does not store these secrets in its own long term memory; it only facilitates the transfer from your secure input to the RunPod vault. This ensures that your API keys and database passwords remain isolated within the RunPod environment where the compute actually happens.
Yes. You can build a workflow that first calls the Get GPU Types action to check real time availability and pricing. If your preferred GPU is out of stock in a specific region, the agent can be programmed to fallback to a similar tier, such as moving from an A100 to an H100 or multiple A6000s. The agent compares the specifications and price points to ensure the alternative meets your minimum VRAM requirements before triggering the cluster creation. This prevents your deployment pipelines from failing during peak demand periods when specific hardware is scarce.
Ceven does not handle your payments or credit balance but it provides the visibility needed to manage costs. By using the Get Pod Details action, the agent can monitor the hourly burn rate of every active instance. You can set up a Ceven workflow that polls your pod status every hour and sends an alert to Slack if the projected spend exceeds a specific threshold. It can even be authorized to terminate pods that have been idle or are costing too much, effectively acting as a cost guardrail for your GPU spend.
Templates in RunPod act as blueprints for your environments. Ceven uses the Save Template action to define the Docker image, volume mounts, and environment variables once. Instead of passing a massive configuration object every time you start a pod, the agent simply references the template ID. This makes your workflows much cleaner and allows you to update the underlying image in one place. When you update a template via Ceven, any new pods spun up will use the latest version, while existing pods remain unchanged until they are redeployed.
One specific quirk is that the RunPod API relies heavily on GraphQL for certain mutations, including template deletion. Because of this, some actions may have slightly different response formats than standard REST calls. Additionally, RunPod applies rate limits to API requests to prevent abuse. If you have a workflow that polls pod status every few seconds across hundreds of instances, you might hit these limits. Ceven manages this by implementing an exponential backoff strategy, meaning it will automatically pause and retry requests if it receives a rate limit error from the RunPod gateway.
Absolutely. To pull private images, RunPod needs registry authentication. Ceven uses the Save Container Registry Authentication action to securely pass your Docker Hub or GitHub Container Registry credentials to the platform. Once this is set, any template you create through the agent can reference those private images without failing. If you rotate your registry passwords, you can run a single Ceven command to update the credentials across your account, ensuring that your automated pod scaling doesn't break due to authentication failures.
When Ceven creates a pod, it uses the SSH public key stored in your RunPod user settings. You can use the Update User Settings action to upload your public key through the agent. Once the key is in place, every pod RunPod spins up for you will automatically include that key in its authorized keys file. This allows you to go from a Ceven prompt to a terminal session in seconds. The agent can provide you with the connection string and port number immediately after the pod reaches a running state.
Yes, this is a primary use case. While RunPod handles the low level scaling, Ceven can manage the high level configuration. By using the Save Serverless Endpoint action, the agent can adjust the minimum and maximum number of workers based on external signals. For example, if your application sees a spike in user sign ups, Ceven can trigger a workflow to increase the worker limit on your inference endpoint to maintain low latency. Once the traffic subsides, it can scale the endpoint back down to save on costs.

Alternatives to RunPod

Other tools that solve a similar problem. Ceven supports these too, so you can switch or run more than one at once.

Lambda Labs logoLambda LabsPaperspace logoPaperspaceCoreWeave logoCoreWeave

Try Ceven on your stack

Plug Ceven on top of the tools you already run. Connect RunPod and the rest of your stack, describe the outcome, and its agents handle the work end to end, days of it in minutes.

Get started for free