Honeyhive
Streams model inputs and outputs into your evaluation pipelines, automates the creation of gold datasets from production logs, and triggers evaluation runs when performance metrics dip.
Try Honeyhive in Ceven
Ask Ceven anything
Standard
Why use Ceven?
AI native Honeyhive integration
- Describe the outcome and Ceven picks the right Honeyhive calls, fills the parameters, and checks the result.
- Structured, agent friendly tool schemas so each call runs reliably instead of by guesswork.
- Rich coverage for reading, writing, and querying your Honeyhive data, across all 42 of its actions.
Managed auth
- Built in OAuth with automatic token refresh and rotation.
- One place to manage, scope, and revoke Honeyhive access.
- Per user and per environment credentials instead of shared keys.
Agent optimized design
- Actions are tuned from real success and error rates so reliability climbs over time.
- Full execution logs so you always know what ran in Honeyhive, when, and on whose behalf.
- The agent pauses and asks when Honeyhive is unclear instead of plowing ahead.
Enterprise grade security
- Fine grained access so you control which agents and people can reach Honeyhive.
- Least privilege by default, read scopes first and only the writes a workflow needs.
- A full audit trail of every Honeyhive action to support review and sign off.
Supported tools
Every action Ceven's agents can run on Honeyhive, and when to use it.
Add datapoints to dataset
Use this when you need to append multiple entries with specified input, ground truth, and history mappings to an existing set.
Create batch model events
Use this when you need to log a batch of model interactions to HoneyHive in one request to save on API overhead.
Create batch tool events
Use this to record multiple external API calls as tool events after gathering all event data.
Create dataset
Use this when you need to initialize a new dataset within a project for a new evaluation cycle.
Create tool
Use this when you need to register a new function or plugin for invocation tracking.
Delete datapoint
Use this when you need to remove a specific datapoint from HoneyHive after confirming its identifier.
Get datasets
Pull a list of datasets for a specific project with optional filters to find the right test set.
Get metrics
Retrieve all metrics for a specific project after obtaining the project context.
Retrieve events
Pull events based on filter criteria, date range, and pagination for analysis or export.
Retrieve experiment result
Pull the status, metrics, and datapoint level details of a completed experiment run.
Start evaluation run
Use this to initiate an evaluation run using external datasets and linked events.
Start session
Use this to initiate a new tracking session and retrieve a session id for event grouping.
Delete Dataset
Tool to delete a dataset by ID. Use when you need to remove a dataset after confirming its ID.
End Evaluation Run
Tool to mark an evaluation run as completed. Use after finishing manual evaluations to update the run status to completed.
Get Configurations
Tool to retrieve a list of configurations. Use when you need to fetch all configurations for a specific project before making changes.
Get Projects
Tool to retrieve projects. Use when you need to list all available projects.
List Tools
Tool to list all available Honeyhive tools. Use when you need to discover which functions or plugins are registered for use.
Retrieve Datapoint
Tool to retrieve a specific datapoint by its ID. Use when you have a datapoint ID and need its full details.
Retrieve Datapoints
Tool to retrieve a list of datapoints. Use when you need to fetch datapoints for a project with optional filters.
Update Datapoint
Tool to update a specific datapoint. Use when you need to modify fields of an existing datapoint.
Update Dataset
Tool to update an existing dataset. Use when you need to modify a dataset's details (name, description, datapoints, linked evaluations, or metadata) after confirming its ID.
Update Event
Tool to update an event. Use when updating event details by ID.
Update Metric
Tool to update an existing metric. Use when you need to modify a metric’s properties after creation. Ensure you retrieve the metric first to verify its current state.
Update Project
Tool to update a project's name or description. Use when you need to modify an existing project by its ID after creation.
24 actions · scroll to see them all
Frequently asked questions
Alternatives to Honeyhive
Other tools that solve a similar problem. Ceven supports these too, so you can switch or run more than one at once.
Try Ceven on your stack
Plug Ceven on top of the tools you already run. Connect Honeyhive and the rest of your stack, describe the outcome, and its agents handle the work end to end, days of it in minutes.
Get started for free