Bright Data
Bright Data provides the world's #1 web data platform with Web Unlocker for bypassing anti-bot systems, SERP API for search engine data, and pre-made scrapers for popular websites. Collect any web data at scale.
Verdict
Common use cases
- Pull competitor pricing from e-commerce sites
- Monitor brand mentions across social platforms
- Collect SERP data for keyword research
- Scrape product reviews for sentiment analysis
- Gather training datasets for ML models
Integration
- Vendor
- Bright Data
- Category
- other
- Auth
- API_KEY
- Tools
- 10
- Composio slug
brightdata
Tools
- Browse Available Scrapers
Tool to list all available pre-made scrapers (datasets) from bright data's marketplace. use when you need to browse available data sources for structured scraping.
- Check Crawl Status
Tool to check the processing status of a crawl job using snapshot id. call before attempting to download results to ensure data collection is complete.
- Download Scraped Data
Tool to retrieve the scraped data from a completed crawl job by snapshot id. use after triggering a crawl or filtering a dataset to download the collected data.
- Filter Dataset
Tool to apply custom filter criteria to a marketplace dataset (beta). use after selecting a dataset to generate a filtered snapshot.
- Get Available Cities
Tool to get available static network cities for a given country. use when you need to configure static proxy endpoints after selecting a country.
- Get Available Countries
Tool to list available countries and their iso 3166-1 alpha-2 codes. use when you need to configure zones with valid country codes before provisioning proxies.
- List Unlocker Zones
Tool to list your configured web unlocker zones and proxy endpoints. use to view available zones for web scraping and bot protection bypass.
- SERP Search
Tool to perform serp (search engine results page) searches across different search engines. use when you need to retrieve search results, trending topics, or competitive analysis data.
- Trigger Site Crawl
Tool to trigger a site crawl job to extract content across multiple pages or entire domains. use when you need to start a crawl for a given dataset and list of urls.
- Web Unlocker
Tool to bypass bot detection, captcha, and other anti-scraping measures to extract content from websites. use when you need to scrape websites that block automated access or require javascript rendering.
Setup
Setup guide
- 11. Log into your Bright Data account at brightdata.com and navigate to Account Settings → API Tokens. 2. Generate a new API token with permissions for datasets and web unlocker zones. 3. In Switchy, open your workspace settings and click Integrations → Add MCP. 4. Search for Bright Data, select it, then paste your API token into the auth field and save. 5. Open any Space and type '@Bright Data browse available scrapers' to confirm the connection — you should see a list of marketplace datasets. 6. To test a crawl, pick a dataset ID from the list and run '@Bright Data filter dataset' with your criteria, then check status and download results using the snapshot ID returned.
What teammates see: by default, memories from Bright Data are scoped to the Space (PROJECT visibility) - you can mark any memory PRIVATE or share it ORG-wide.
Works well with
Top models
Compatibility data appears once enough Spaces have used this MCP together with a given model.
How Switchy teams use it
Starter prompts
List Marketplace Datasets
@Bright Data browse available scrapers and show me the first 20 datasets with their IDs and descriptionsOpen in a Space →
Run SERP Search
@Bright Data perform a SERP search for 'project management software' on Google US and return the top 10 organic results with titles and URLsOpen in a Space →
Check Crawl Progress
@Bright Data check the status of snapshot ID abc123 and tell me if the crawl is complete or still processingOpen in a Space →
Download Scraped Data
@Bright Data download the scraped data for snapshot ID abc123 and show me the first 50 records in a tableOpen in a Space →
List Proxy Zones
@Bright Data list my web unlocker zones and show the proxy endpoints for each zoneOpen in a Space →
Example outputs
Illustrative - representative of the model's voice and quality, not literal recordings.
@brightdata what are the top 10 trending products on Amazon for 'wireless headphones' in the US market right now?
I've initiated a SERP search for 'wireless headphones' on Amazon's US marketplace. The crawl job (snapshot ID: snap_abc123) is processing. Based on the preliminary results, trending products include: Sony WH-1000XM5 (4.6★, $398), Apple AirPods Pro 2nd Gen (4.7★, $249), Bose QuietComfort Ultra (4.5★, $429), and seven other models with high review counts and recent price drops. The full dataset includes pricing history, seller ratings, and availability status across 50+ listings. Download will be ready in approximately 2-3 minutes.
This example demonstrates SERP scraping combined with Bright Data's marketplace datasets. Requires an active API key with sufficient credits for Amazon data collection. Crawl times vary by dataset size (typically 1-5 minutes). Rate limits apply based on your subscription tier—enterprise plans support concurrent jobs while starter plans queue requests sequentially.
@brightdata I need to scrape product reviews from 20 specific Shopify store URLs for sentiment analysis. Can you set that up with US residential proxies?
I've configured a custom crawl job using your Web Unlocker zone (zone_us_residential_01) with US residential proxy rotation. The job will target your 20 Shopify URLs and extract review text, ratings, timestamps, and reviewer metadata. Snapshot ID: snap_xyz789. Current status: PROCESSING (15% complete, ~8 minutes remaining). This uses residential IPs to bypass bot detection—each request rotates through different US cities. Once complete, you'll receive structured JSON with approximately 400-600 reviews depending on per-product review counts.
This showcases custom URL scraping with proxy configuration—critical for e-commerce sites with aggressive bot protection. Residential proxies consume more credits than datacenter IPs but have higher success rates. You must have a configured Unlocker zone with US residential access. Failed requests (4xx/5xx errors) still count toward your quota, so validate URLs beforehand to avoid wasted credits.
@brightdata browse available scrapers for social media platforms, then check if my Instagram hashtag crawl from yesterday finished processing
**Available Social Media Scrapers:** I found 12 pre-made datasets including Instagram Posts & Hashtags, TikTok Video Metadata, LinkedIn Company Profiles, Twitter/X Trending Topics, and YouTube Channel Analytics. Each supports custom filtering by date range, engagement metrics, and geographic targeting. **Crawl Status Check:** Your Instagram hashtag crawl (snapshot ID: snap_insta_456 from 2024-01-15) shows status: COMPLETED. Collected 3,847 posts across 5 hashtags with engagement data. Ready for download—dataset size is 12.3 MB (JSON format).
This example chains two read operations: browsing the marketplace catalog and checking job status. Useful for discovering what data sources Bright Data supports before committing credits. Status checks are free API calls, but downloading data consumes credits proportional to dataset size. Social media scrapers often have platform-specific rate limits and may require additional compliance review for commercial use cases.
Use-case deep-dives
When Bright Data beats manual scraping for pricing intel
A 6-person e-commerce team needs to track competitor pricing across 40 SKUs daily. Bright Data's Browse Available Scrapers tool surfaces pre-built extractors for major marketplaces (Amazon, eBay, Shopify stores), which means the team skips the brittle XPath maintenance that kills homegrown scrapers. The Filter Dataset tool lets them narrow to specific categories before triggering crawls, and Check Crawl Status prevents premature downloads. The trade-off: if you're scraping niche B2B sites without pre-made scrapers, you're back to custom solutions or paying Bright Data's professional services. This MCP wins when your targets are mainstream e-commerce platforms and your team would rather spend time analyzing price trends than debugging Puppeteer scripts. If you're tracking fewer than 10 competitors, a simpler tool like Apify's MCP is cheaper and easier to configure.
SERP tracking without the agency markup
A 3-person content marketing team publishes 8 blog posts monthly and needs to track how their target keywords rank against competitors. Bright Data's SERP Search tool pulls live Google, Bing, or DuckDuckGo results with position data, which feeds directly into their editorial calendar decisions. The Get Available Countries tool lets them geo-target searches to match their audience distribution (US, UK, Canada). The limitation: SERP data costs scale with query volume, so teams running hundreds of daily checks hit budget ceilings fast. This MCP makes sense when you're doing strategic keyword research (10-30 queries per week) rather than enterprise-scale rank tracking. If your SEO workflow is more than 50 queries daily, dedicated tools like SEMrush or Ahrefs offer better per-query economics and built-in trend analysis.
When pre-built scrapers justify the API key overhead
A 5-person sales team at a vertical SaaS company needs to build a prospect list from industry directories, review sites, and local business listings. Bright Data's marketplace scrapers (Browse Available Scrapers) cover common sources like Yelp, Yellow Pages, and Trustpilot, which means the team can trigger crawls and download structured contact data (Download Scraped Data) without writing parsers. The List Unlocker Zones tool handles anti-bot protections that would otherwise block requests. The catch: each dataset has its own schema, so the team still needs to normalize fields across sources before CRM import. This MCP is the right call when your lead sources are mainstream directories and you're pulling 500-5000 records monthly. Below that volume, manual copy-paste is faster; above it, you need a dedicated data vendor with CRM integrations already built.
Frequently asked
What does the Bright Data MCP let me do in Switchy?
It connects Switchy to Bright Data's web scraping infrastructure. You can browse pre-made scrapers, trigger crawls, check job status, download results, and run SERP searches across search engines. The MCP also exposes proxy zone management and geo-targeting tools. Think of it as turning Bright Data's marketplace and unlocker APIs into conversational commands your team can run from any Switchy chat.
Do I need a Bright Data account to use this MCP?
Yes. You need an active Bright Data subscription and an API key. The MCP authenticates with that key, so you're billed directly by Bright Data for crawls, proxy usage, and dataset queries. Switchy doesn't resell Bright Data credits or manage your account—it just routes requests through their API on your behalf.
Can this MCP scrape any website I want?
Only if Bright Data supports it. The MCP uses Bright Data's pre-made scrapers and web unlocker zones, so you're limited to the sites they've built collectors for or that their proxy network can reach. If a target isn't in their marketplace and their unlocker can't bypass its bot protection, you'll need a custom solution outside this integration.
Why use this MCP instead of calling Bright Data's API directly?
The MCP wraps Bright Data's REST endpoints in natural-language tools. Instead of writing curl commands or Python scripts to trigger a crawl, check status, and parse JSON, you describe what you want in a Switchy chat. The AI picks the right tool sequence. It's faster for ad-hoc research and non-engineers, but you lose fine-grained control over request parameters.
Who on my team should connect the Bright Data MCP?
Whoever holds your Bright Data API key and understands your scraping budget. That's usually a data engineer or growth lead. Once connected, any Switchy workspace member can trigger crawls, so set clear usage guidelines. Bright Data charges per request and data volume, so uncontrolled access can rack up costs quickly.