Diffbot
Turns any website into a structured data source by extracting articles, products, and discussions into your database without manual scrapers.
Try Diffbot in Ceven
Ask Ceven anything
Standard
Why use Ceven?
AI native Diffbot integration
- Describe the outcome and Ceven picks the right Diffbot calls, fills the parameters, and checks the result.
- Structured, agent friendly tool schemas so each call runs reliably instead of by guesswork.
- Rich coverage for reading, writing, and querying your Diffbot data, across all 35 of its actions.
Managed auth
- Built in OAuth with automatic token refresh and rotation.
- One place to manage, scope, and revoke Diffbot access.
- Per user and per environment credentials instead of shared keys.
Agent optimized design
- Actions are tuned from real success and error rates so reliability climbs over time.
- Full execution logs so you always know what ran in Diffbot, when, and on whose behalf.
- The agent pauses and asks when Diffbot is unclear instead of plowing ahead.
Enterprise grade security
- Fine grained access so you control which agents and people can reach Diffbot.
- Least privilege by default, read scopes first and only the writes a workflow needs.
- A full audit trail of every Diffbot action to support review and sign off.
Supported tools
Every action Ceven's agents can run on Diffbot, and when to use it.
Diffbot Search
Use this to query data extracted by crawl or bulk jobs using DQL queries after extraction is complete.
Get Account Details
Pull account details including plan information and usage statistics to verify daily quota status.
Diffbot Analyze
Use this when you have a URL and need Diffbot to automatically determine the content type and route it to the right extractor.
Get Article Data
Extract structured metadata from a web article URL including authors, publication dates, and images.
Get Discussion Thread
Extract structured discussion data from forums, comment sections, and review pages after identifying the URL.
Get Event Data
Use this to pull structured event details such as venue, date, and description from a web page.
Get Image Data
Extract detailed information about images including dimensions and recognition data for publicly accessible URLs.
Get Product Data
Pull structured product information including specifications, prices, availability, and reviews from a page.
Get Video Data
Extract structured video metadata including titles, descriptions, and embedded HTML from any web page.
List Bulk Jobs
Pull a list of all bulk jobs associated with a token to check the status of account jobs.
Resolve Lost ID
Map a lost identifier to its canonical counterpart in the knowledge graph for data consistency.
Start Bulk Job
Use this to process large numbers of URLs asynchronously through a bulk extract job.
Start Crawl Job
Spider a site for links and process them into a single collection using seed URLs.
Stop Bulk Job
Halt further processing of URLs in a job in progress using the specific job ID.
Get Diffbot Account Details
Tool to retrieve account details, including plan information and usage statistics. use after authenticating to verify subscription and daily quota status.
Diffbot Get Event
Tool to extract event details from web pages. use when you need structured event data such as venue, date, and description.
Diffbot Get Image
Tool to extract detailed information about images, including dimensions and recognition data. use after confirming the image url is publicly accessible.
Diffbot Get Product
Tool to extract product information such as specifications, prices, availability, and reviews. use when you need structured product data including specs, pricing, and reviews.
18 actions · scroll to see them all
Frequently asked questions
Alternatives to Diffbot
Other tools that solve a similar problem. Ceven supports these too, so you can switch or run more than one at once.
Try Ceven on your stack
Plug Ceven on top of the tools you already run. Connect Diffbot and the rest of your stack, describe the outcome, and its agents handle the work end to end, days of it in minutes.
Get started for free