otherapi_key

Bright Data

Bright Data provides the world's #1 web data platform with Web Unlocker for bypassing anti-bot systems, SERP API for search engine data, and pre-made scrapers for popular websites. Collect any web data at scale.

Verdict

Bright Data gives your team programmatic access to web scraping infrastructure and proxy networks without writing scraper code. @mention it to pull structured data from marketplace datasets (product listings, reviews, social posts), run custom SERP searches across Google/Bing/etc., or route requests through residential proxies in specific cities. Analysts and growth teams use it to gather competitive intelligence, monitor pricing, or collect training data. Setup requires an API key from your Bright Data dashboard — you'll need an active subscription with credits to run crawls. Jobs can take minutes to hours depending on dataset size.

Common use cases

  • Pull competitor pricing from e-commerce sites
  • Monitor brand mentions across social platforms
  • Collect SERP data for keyword research
  • Scrape product reviews for sentiment analysis
  • Gather training datasets for ML models

Integration

Vendor
Bright Data
Category
other
Auth
API_KEY
Tools
10
Composio slug
brightdata

Tools

  • Browse Available Scrapers

    Tool to list all available pre-made scrapers (datasets) from bright data's marketplace. use when you need to browse available data sources for structured scraping.

  • Check Crawl Status

    Tool to check the processing status of a crawl job using snapshot id. call before attempting to download results to ensure data collection is complete.

  • Download Scraped Data

    Tool to retrieve the scraped data from a completed crawl job by snapshot id. use after triggering a crawl or filtering a dataset to download the collected data.

  • Filter Dataset

    Tool to apply custom filter criteria to a marketplace dataset (beta). use after selecting a dataset to generate a filtered snapshot.

  • Get Available Cities

    Tool to get available static network cities for a given country. use when you need to configure static proxy endpoints after selecting a country.

  • Get Available Countries

    Tool to list available countries and their iso 3166-1 alpha-2 codes. use when you need to configure zones with valid country codes before provisioning proxies.

  • List Unlocker Zones

    Tool to list your configured web unlocker zones and proxy endpoints. use to view available zones for web scraping and bot protection bypass.

  • SERP Search

    Tool to perform serp (search engine results page) searches across different search engines. use when you need to retrieve search results, trending topics, or competitive analysis data.

  • Trigger Site Crawl

    Tool to trigger a site crawl job to extract content across multiple pages or entire domains. use when you need to start a crawl for a given dataset and list of urls.

  • Web Unlocker

    Tool to bypass bot detection, captcha, and other anti-scraping measures to extract content from websites. use when you need to scrape websites that block automated access or require javascript rendering.

Setup

Setup guide

  1. 11. Log into your Bright Data account at brightdata.com and navigate to Account Settings → API Tokens. 2. Generate a new API token with permissions for datasets and web unlocker zones. 3. In Switchy, open your workspace settings and click Integrations → Add MCP. 4. Search for Bright Data, select it, then paste your API token into the auth field and save. 5. Open any Space and type '@Bright Data browse available scrapers' to confirm the connection — you should see a list of marketplace datasets. 6. To test a crawl, pick a dataset ID from the list and run '@Bright Data filter dataset' with your criteria, then check status and download results using the snapshot ID returned.

What teammates see: by default, memories from Bright Data are scoped to the Space (PROJECT visibility) - you can mark any memory PRIVATE or share it ORG-wide.

Works well with

Top models

Compatibility data appears once enough Spaces have used this MCP together with a given model.

How Switchy teams use it

Not enough Spaces yet to publish anonymised usage stats (we require ≥ 50 Spaces per week).

Starter prompts

List Marketplace Datasets

@Bright Data browse available scrapers and show me the first 20 datasets with their IDs and descriptions
Open in a Space →

Check Crawl Progress

@Bright Data check the status of snapshot ID abc123 and tell me if the crawl is complete or still processing
Open in a Space →

Download Scraped Data

@Bright Data download the scraped data for snapshot ID abc123 and show me the first 50 records in a table
Open in a Space →

List Proxy Zones

@Bright Data list my web unlocker zones and show the proxy endpoints for each zone
Open in a Space →

Example outputs

Illustrative - representative of the model's voice and quality, not literal recordings.

Prompt

@brightdata what are the top 10 trending products on Amazon for 'wireless headphones' in the US market right now?

Output

I've initiated a SERP search for 'wireless headphones' on Amazon's US marketplace. The crawl job (snapshot ID: snap_abc123) is processing. Based on the preliminary results, trending products include: Sony WH-1000XM5 (4.6★, $398), Apple AirPods Pro 2nd Gen (4.7★, $249), Bose QuietComfort Ultra (4.5★, $429), and seven other models with high review counts and recent price drops. The full dataset includes pricing history, seller ratings, and availability status across 50+ listings. Download will be ready in approximately 2-3 minutes.

Notes

This example demonstrates SERP scraping combined with Bright Data's marketplace datasets. Requires an active API key with sufficient credits for Amazon data collection. Crawl times vary by dataset size (typically 1-5 minutes). Rate limits apply based on your subscription tier—enterprise plans support concurrent jobs while starter plans queue requests sequentially.

Prompt

@brightdata I need to scrape product reviews from 20 specific Shopify store URLs for sentiment analysis. Can you set that up with US residential proxies?

Output

I've configured a custom crawl job using your Web Unlocker zone (zone_us_residential_01) with US residential proxy rotation. The job will target your 20 Shopify URLs and extract review text, ratings, timestamps, and reviewer metadata. Snapshot ID: snap_xyz789. Current status: PROCESSING (15% complete, ~8 minutes remaining). This uses residential IPs to bypass bot detection—each request rotates through different US cities. Once complete, you'll receive structured JSON with approximately 400-600 reviews depending on per-product review counts.

Notes

This showcases custom URL scraping with proxy configuration—critical for e-commerce sites with aggressive bot protection. Residential proxies consume more credits than datacenter IPs but have higher success rates. You must have a configured Unlocker zone with US residential access. Failed requests (4xx/5xx errors) still count toward your quota, so validate URLs beforehand to avoid wasted credits.

Prompt

@brightdata browse available scrapers for social media platforms, then check if my Instagram hashtag crawl from yesterday finished processing

Output

**Available Social Media Scrapers:** I found 12 pre-made datasets including Instagram Posts & Hashtags, TikTok Video Metadata, LinkedIn Company Profiles, Twitter/X Trending Topics, and YouTube Channel Analytics. Each supports custom filtering by date range, engagement metrics, and geographic targeting. **Crawl Status Check:** Your Instagram hashtag crawl (snapshot ID: snap_insta_456 from 2024-01-15) shows status: COMPLETED. Collected 3,847 posts across 5 hashtags with engagement data. Ready for download—dataset size is 12.3 MB (JSON format).

Notes

This example chains two read operations: browsing the marketplace catalog and checking job status. Useful for discovering what data sources Bright Data supports before committing credits. Status checks are free API calls, but downloading data consumes credits proportional to dataset size. Social media scrapers often have platform-specific rate limits and may require additional compliance review for commercial use cases.

Use-case deep-dives

Competitive pricing research at e-commerce scale

When Bright Data beats manual scraping for pricing intel

A 6-person e-commerce team needs to track competitor pricing across 40 SKUs daily. Bright Data's Browse Available Scrapers tool surfaces pre-built extractors for major marketplaces (Amazon, eBay, Shopify stores), which means the team skips the brittle XPath maintenance that kills homegrown scrapers. The Filter Dataset tool lets them narrow to specific categories before triggering crawls, and Check Crawl Status prevents premature downloads. The trade-off: if you're scraping niche B2B sites without pre-made scrapers, you're back to custom solutions or paying Bright Data's professional services. This MCP wins when your targets are mainstream e-commerce platforms and your team would rather spend time analyzing price trends than debugging Puppeteer scripts. If you're tracking fewer than 10 competitors, a simpler tool like Apify's MCP is cheaper and easier to configure.

SEO keyword monitoring for content teams

SERP tracking without the agency markup

A 3-person content marketing team publishes 8 blog posts monthly and needs to track how their target keywords rank against competitors. Bright Data's SERP Search tool pulls live Google, Bing, or DuckDuckGo results with position data, which feeds directly into their editorial calendar decisions. The Get Available Countries tool lets them geo-target searches to match their audience distribution (US, UK, Canada). The limitation: SERP data costs scale with query volume, so teams running hundreds of daily checks hit budget ceilings fast. This MCP makes sense when you're doing strategic keyword research (10-30 queries per week) rather than enterprise-scale rank tracking. If your SEO workflow is more than 50 queries daily, dedicated tools like SEMrush or Ahrefs offer better per-query economics and built-in trend analysis.

Lead generation from public directories

When pre-built scrapers justify the API key overhead

A 5-person sales team at a vertical SaaS company needs to build a prospect list from industry directories, review sites, and local business listings. Bright Data's marketplace scrapers (Browse Available Scrapers) cover common sources like Yelp, Yellow Pages, and Trustpilot, which means the team can trigger crawls and download structured contact data (Download Scraped Data) without writing parsers. The List Unlocker Zones tool handles anti-bot protections that would otherwise block requests. The catch: each dataset has its own schema, so the team still needs to normalize fields across sources before CRM import. This MCP is the right call when your lead sources are mainstream directories and you're pulling 500-5000 records monthly. Below that volume, manual copy-paste is faster; above it, you need a dedicated data vendor with CRM integrations already built.

Frequently asked

What does the Bright Data MCP let me do in Switchy?

It connects Switchy to Bright Data's web scraping infrastructure. You can browse pre-made scrapers, trigger crawls, check job status, download results, and run SERP searches across search engines. The MCP also exposes proxy zone management and geo-targeting tools. Think of it as turning Bright Data's marketplace and unlocker APIs into conversational commands your team can run from any Switchy chat.

Do I need a Bright Data account to use this MCP?

Yes. You need an active Bright Data subscription and an API key. The MCP authenticates with that key, so you're billed directly by Bright Data for crawls, proxy usage, and dataset queries. Switchy doesn't resell Bright Data credits or manage your account—it just routes requests through their API on your behalf.

Can this MCP scrape any website I want?

Only if Bright Data supports it. The MCP uses Bright Data's pre-made scrapers and web unlocker zones, so you're limited to the sites they've built collectors for or that their proxy network can reach. If a target isn't in their marketplace and their unlocker can't bypass its bot protection, you'll need a custom solution outside this integration.

Why use this MCP instead of calling Bright Data's API directly?

The MCP wraps Bright Data's REST endpoints in natural-language tools. Instead of writing curl commands or Python scripts to trigger a crawl, check status, and parse JSON, you describe what you want in a Switchy chat. The AI picks the right tool sequence. It's faster for ad-hoc research and non-engineers, but you lose fine-grained control over request parameters.

Who on my team should connect the Bright Data MCP?

Whoever holds your Bright Data API key and understands your scraping budget. That's usually a data engineer or growth lead. Once connected, any Switchy workspace member can trigger crawls, so set clear usage guidelines. Bright Data charges per request and data volume, so uncontrolled access can rack up costs quickly.

Data last verified 607 hours ago.Sources aggregated hourly to weekly. See docs/architecture/model-directory.md.