Question 1

How does Ceven handle large batches of documents in Aryn?

Accepted Answer

Ceven manages large uploads by utilizing the asynchronous task system. When you trigger a batch upload, the agent creates the necessary docsets and pushes the files, then polls the List Async Tasks endpoint to monitor progress. Instead of timing out, the workflow pauses and resumes once Aryn confirms the parsing is complete. You can set up a notification in Ceven to alert you via Slack or email the moment the full batch is processed and ready for extraction. This ensures that even thousands of pages are handled reliably without manual monitoring of the Aryn dashboard.

Question 2

Can I review the extraction logic before running it on my data?

Accepted Answer

Yes. The agent uses the Generate plan tool to show you exactly how Aryn intends to query your documents. This logical plan outlines the steps the AI will take to find the requested data points. You can ask the Ceven agent to explain the plan in plain English or suggest modifications to the query parameters to improve accuracy. Once you are satisfied with the proposed approach, you can give the agent a command to execute the actual query, which prevents wasting processing credits on poorly formulated requests.

Question 3

What happens if Aryn cannot find a field in a document?

Accepted Answer

When Aryn fails to locate a specific field, it returns a null value or a low confidence score. Ceven is configured to detect these gaps and can trigger a fallback workflow. For example, the agent can move that specific document into a manual review queue or send an email to the document provider requesting a clearer scan. You can define the confidence threshold in your workflow settings so that only high certainty data is pushed to your CRM while everything else is flagged for human verification.

Question 4

Are there any limits on document sizes or types?

Accepted Answer

Aryn supports most common document formats including PDF, JPG, and PNG. However, users should be aware that Aryn imposes strict rate limits on the number of concurrent asynchronous tasks depending on your subscription tier. If a Ceven workflow attempts to launch too many simultaneous extraction jobs, Aryn will return a 429 error. Ceven handles this by implementing an exponential backoff strategy, queuing the remaining tasks and processing them sequentially to ensure no data is lost, though this may increase the total time for very large datasets.

Question 5

How does the binary retrieval work for audit trails?

Accepted Answer

For compliance and audit purposes, you often need the original file alongside the extracted data. Ceven uses the Get Document Binary action to pull the original file from Aryn and can simultaneously upload it to your own secure storage like AWS S3 or Google Drive. This creates a permanent link between the structured data in your database and the source document. The agent can automatically name these files using the extracted metadata, making it easy to find the original source of any specific data point during an audit.

Question 6

Can Ceven automate the creation of new docsets?

Accepted Answer

Absolutely. You can build a workflow where the agent creates a new Aryn docset based on a trigger, such as a new project being created in your project management tool. The agent calls the Create DocSet tool, names it according to your naming convention, and then begins routing all related project documents into that specific container. This ensures that your data remains organized and that queries are scoped to the correct set of documents, which improves both the speed and the accuracy of the AI extraction process.

Question 7

Is my data used to train Aryn models?

Accepted Answer

Data privacy is handled at the Aryn account level. Generally, Aryn provides enterprise options to ensure that your uploaded documents and the resulting extractions are not used to train their global models. When you connect Aryn to Ceven, we only access the data necessary to execute your workflows. We do not store your documents on our own servers; we simply act as the orchestration layer that moves the data from Aryn to your destination system. You should verify your specific data processing agreement with Aryn to confirm your privacy settings.

Question 8

How do I handle documents that require multi page analysis?

Accepted Answer

Aryn is designed to handle multi page documents by treating the docset as a cohesive unit. When you run a query, the agent can request data that spans across different pages or even different documents within the same set. For instance, you can ask for the total sum of all invoices in a docset. Ceven manages the iteration process, calling the necessary Aryn endpoints to aggregate the data and then performing the final calculation before delivering the result to your requested output destination.

Aryn

Try Aryn in Ceven

Why use Ceven?

AI native Aryn integration

Managed auth

Agent optimized design

Enterprise grade security

Supported tools

Frequently asked questions

Related integrations

Alternatives to Aryn

Try Ceven on your stack