DocRaptor

Converts HTML and web pages into professional PDF or Excel files and manages the delivery of generated documents to your customers or internal storage.

Try DocRaptor in Ceven

Ask Ceven anything
Standard

Why use Ceven?

  1. AI native DocRaptor integration

    • Describe the outcome and Ceven picks the right DocRaptor calls, fills the parameters, and checks the result.
    • Structured, agent friendly tool schemas so each call runs reliably instead of by guesswork.
    • Rich coverage for reading, writing, and querying your DocRaptor data, across all 10 of its actions.
  2. Managed auth

    • Built in OAuth with automatic token refresh and rotation.
    • One place to manage, scope, and revoke DocRaptor access.
    • Per user and per environment credentials instead of shared keys.
  3. Agent optimized design

    • Actions are tuned from real success and error rates so reliability climbs over time.
    • Full execution logs so you always know what ran in DocRaptor, when, and on whose behalf.
    • The agent pauses and asks when DocRaptor is unclear instead of plowing ahead.
  4. Enterprise grade security

    • Fine grained access so you control which agents and people can reach DocRaptor.
    • Least privilege by default, read scopes first and only the writes a workflow needs.
    • A full audit trail of every DocRaptor action to support review and sign off.

Supported tools

Every action Ceven's agents can run on DocRaptor, and when to use it.

Create Async Document
Use this when generating PDF, XLS, or XLSX documents that are too large for a synchronous call. You will need to poll for status or use a callback.
Create Document
Use this to generate a PDF or Excel document immediately from HTML content or a URL. The file is returned in the response.
Create Document from Referrer
Use this to convert web pages into documents using referrer based authentication. This avoids the need for a specific API key for registered domains.
Create Hosted Async Document
Use this to create a document asynchronously and have it hosted on DocRaptor servers. Track completion via the status API or a callback URL.
Download Async Document
Use this when you have a download id from a completed asynchronous request and need to retrieve the actual document file.
Get Async Document Status
Use this to monitor the progress of an asynchronous document request by polling for the current completion status.
List Documents JSON
Pull a paginated list of previously created documents and their metadata in JSON format to audit document history.
List Documents XML
Pull a paginated list of previously created documents and their metadata in XML format for legacy system compatibility.
List IPs JSON
Pull the list of IP addresses DocRaptor uses to download assets in JSON format. Use this for firewall configuration.
List IPs Text
Pull the list of IP addresses DocRaptor uses to download assets in plain text format for quick security reviews.

10 actions · scroll to see them all

Frequently asked questions

Ceven utilizes the asynchronous API endpoints for any document that exceeds standard timeout limits. Instead of waiting for a synchronous response, the agent sends the HTML to DocRaptor and receives a unique document ID. The agent then enters a polling loop using the Get Async Document Status tool, checking the progress at set intervals. Once the status returns as completed, the agent automatically triggers the Download Async Document action to retrieve the final file. This ensures that your workflows do not fail during the generation of hundred page reports or complex data tables that require more processing time on the DocRaptor servers.
Ceven can handle this using the Create Document from Referrer tool if the target domain is registered and configured. For other protected pages, you must provide the necessary authentication headers or cookies within the HTML request payload. The agent can pull these credentials from your secure vault and inject them into the API call. If the site uses complex JavaScript based login flows, it is recommended to pass the raw HTML content directly to DocRaptor rather than a URL to ensure the rendered PDF contains the authenticated view of the page without session timeouts.
DocRaptor enforces different rate limits based on your subscription tier. If Ceven receives a 429 Too Many Requests error, the agent is programmed to use an exponential backoff strategy. It will pause the workflow and retry the request after a short delay, increasing the wait time between attempts. For very large batches, you can configure the agent to stagger the requests over a longer period to stay within your plan limits. If you consistently hit these limits, you can check your current usage via the DocRaptor dashboard and upgrade your tier to increase the number of concurrent requests allowed.
DocRaptor supports both PDF and Excel formats. When the agent calls the document creation tools, it specifies the desired output format in the request. For Excel, DocRaptor converts the HTML table structures into XLS or XLSX files. This is particularly useful for workflows that need to turn a web based data grid into a downloadable spreadsheet for finance teams. The agent can handle the mapping of HTML data to these formats and then deliver the resulting spreadsheet to your cloud storage or send it as an email attachment to the end user.
When DocRaptor renders a PDF, it must download images and CSS files from your servers. If your servers are behind a firewall, they will block these requests unless the DocRaptor IPs are allowed. Ceven can use the List IPs tools to pull the current set of DocRaptor IP addresses. You can then set up a workflow that periodically checks for IP changes and updates your cloud firewall rules automatically via your infrastructure API. This ensures that your documents always render with the correct images and styles without leaving your server completely open to the public internet.
No, DocRaptor does not provide permanent storage for your documents. The hosted async documents are stored temporarily to allow for retrieval after the rendering process is complete. Once you download the file using the download id, you should move it to your own permanent storage like Amazon S3 or Google Drive. Ceven automates this by immediately triggering a upload action to your storage provider as soon as the document is ready. If you wait too long to download a hosted document, it will be purged from the DocRaptor system according to their data retention policy.
Yes, DocRaptor has limits on the size of the request payload for synchronous calls. If your HTML content is exceptionally large, the request may be rejected or timeout before the PDF is generated. This is a known quirk of the synchronous API. To solve this, Ceven automatically switches to the asynchronous flow for payloads that exceed a certain size threshold. By using the Create Async Document tool, the agent avoids the synchronous request limit and allows the Prince engine to process the large HTML file in the background, notifying the workflow only when the final document is ready for download.

Alternatives to DocRaptor

Other tools that solve a similar problem. Ceven supports these too, so you can switch or run more than one at once.

Puppeteer logoPuppeteerpdfcrowd logopdfcrowdBrowserless logoBrowserless

Try Ceven on your stack

Plug Ceven on top of the tools you already run. Connect DocRaptor and the rest of your stack, describe the outcome, and its agents handle the work end to end, days of it in minutes.

Get started for free