Bright Data

Triggers large scale web scraping jobs, bypasses anti bot blocks, and pipes structured website data directly into your database or CRM.

Try Bright Data in Ceven

Ask Ceven anything
Standard

Why use Ceven?

  1. AI native Bright Data integration

    • Describe the outcome and Ceven picks the right Bright Data calls, fills the parameters, and checks the result.
    • Structured, agent friendly tool schemas so each call runs reliably instead of by guesswork.
    • Rich coverage for reading, writing, and querying your Bright Data data, across all 10 of its actions.
  2. Managed auth

    • Built in OAuth with automatic token refresh and rotation.
    • One place to manage, scope, and revoke Bright Data access.
    • Per user and per environment credentials instead of shared keys.
  3. Agent optimized design

    • Actions are tuned from real success and error rates so reliability climbs over time.
    • Full execution logs so you always know what ran in Bright Data, when, and on whose behalf.
    • The agent pauses and asks when Bright Data is unclear instead of plowing ahead.
  4. Enterprise grade security

    • Fine grained access so you control which agents and people can reach Bright Data.
    • Least privilege by default, read scopes first and only the writes a workflow needs.
    • A full audit trail of every Bright Data action to support review and sign off.

Supported tools

Every action Ceven's agents can run on Bright Data, and when to use it.

Trigger Site Crawl
Use this when you need to start a crawl for a given dataset and list of urls to extract content across multiple pages.
Browse Available Scrapers
Use this when you need to browse available data sources for structured scraping in the marketplace.
Filter Dataset
Apply custom filter criteria to a marketplace dataset to generate a filtered snapshot of the data.
Get Available Cities
Pull a list of available static network cities for a given country to configure proxy endpoints.
Get Available Countries
List all available countries and their codes to configure zones before provisioning proxies.
Download Scraped Data
Retrieve the collected data from a completed crawl job using a specific snapshot id.
Check Crawl Status
Check the processing status of a crawl job to ensure data collection is complete before downloading.
List Unlocker Zones
View your configured web unlocker zones and proxy endpoints for bot protection bypass.
SERP Search
Perform search engine results page searches to retrieve trending topics or competitive analysis data.
Web Unlocker
Use this when you need to scrape websites that block automated access or require javascript rendering.

10 actions · scroll to see them all

Frequently asked questions

Ceven connects to Bright Data using your account API token. When a workflow requires a specific proxy zone or a Web Unlocker endpoint, the agent retrieves the zone credentials from your Bright Data dashboard and injects them into the request header. This ensures that your traffic is routed through the correct geographic location or residential network without you having to manually copy and paste passwords into every single workflow step. All credentials are encrypted and are only used to authenticate the session with the Bright Data gateway. You can rotate your API token in the Bright Data console at any time, which will require a quick update in the Ceven integration settings to resume service.
Yes. By using the Web Unlocker action, Ceven routes the request through Bright Data's specialized infrastructure that handles browser fingerprinting and captcha solving automatically. The agent does not see the captcha; it simply receives the fully rendered HTML or JSON response once the block is bypassed. This is particularly useful for sites with aggressive anti bot measures that would normally block a standard cloud server IP. The workflow can be set to retry with different proxy zones if the initial attempt fails, ensuring a high success rate for data extraction tasks that are critical for your daily business intelligence reports.
Ceven does not keep a connection open for the entire duration of a crawl. Instead, it uses an asynchronous polling pattern. The agent triggers the crawl and receives a snapshot id. It then schedules a Check Crawl Status call at regular intervals. Once the status returns as complete, the agent automatically triggers the Download Scraped Data action. This means your workflow can pause for hours or even days without consuming active compute resources. You can configure the agent to send you a Slack or email notification the moment the data is ready and has been successfully pushed to your destination database.
Ceven is limited by the specific plan and tier gating of your Bright Data account. For example, some marketplace datasets are only available on higher tiers, and certain proxy zones have strict bandwidth caps. If a workflow attempts to trigger a crawl that exceeds your remaining data balance, Bright Data will return a 403 error. Ceven captures this error and can be configured to alert you that your balance is low. It is important to monitor your Bright Data usage dashboard to ensure you have enough credits for the volume of requests your Ceven agents are making on your behalf.
The SERP Search action allows the agent to query Google, Bing, or Yahoo and receive structured results. Unlike a standard scrape, this returns a clean JSON object containing organic results, ads, and knowledge graph data. You can build a workflow where Ceven monitors specific keywords and triggers an alert when your brand or a competitor moves up or down in the rankings. Because this uses a dedicated API, it is much faster and more reliable than scraping the search page directly, as it avoids the frequent layout changes and aggressive rate limiting typically associated with search engine results pages.
Yes. You can use the Get Available Countries and Get Available Cities actions to dynamically configure your proxy settings. For example, if you are tracking local pricing for a retail chain, the agent can iterate through a list of cities, update the proxy zone to match that location, and then trigger the scrape. This ensures that the website serves the localized version of the content. Ceven manages the mapping between the city name and the ISO code required by the Bright Data API, making it easy to scale your data collection across different global regions without manual configuration.
When dealing with massive datasets, Ceven uses the Filter Dataset action to reduce the payload size before downloading. Instead of pulling millions of rows, the agent applies your specific criteria, such as a date range or a category filter, to create a smaller snapshot. Once the snapshot is ready, the agent streams the data in chunks to your destination to avoid memory overflows. This approach is essential for maintaining system stability when working with the scale of data Bright Data provides, ensuring that your internal tools are not overwhelmed by a single massive file upload.
Bright Data provides the infrastructure for accessing public data, but the responsibility for compliance lies with the user. Ceven simply acts as the orchestrator. We recommend that you configure your agents to respect robots.txt files and avoid scraping private or password protected areas of a website. Bright Data includes tools to help you stay compliant, and you should use the agent to implement rate limiting and delays that mimic human behavior. Always review the terms of service of the target website to ensure your data collection activities are permitted and align with your legal requirements.

Alternatives to Bright Data

Other tools that solve a similar problem. Ceven supports these too, so you can switch or run more than one at once.

Smartproxy logoSmartproxyApify logoApify

Try Ceven on your stack

Plug Ceven on top of the tools you already run. Connect Bright Data and the rest of your stack, describe the outcome, and its agents handle the work end to end, days of it in minutes.

Get started for free