At a glance: Hosted MCP server (no GitHub repo), closed-source, Streamable HTTP transport, API key auth, ~15 tools, credit-based pricing from $49/mo. Acquired by Oxylabs (June 2025).

Refreshed May 4, 2026: Google Search API credit cost reduced 40% (25 → 15 credits/call) — a post-Oxylabs acquisition infrastructure improvement. Toolset and architecture unchanged. Proxyway benchmark figures unchanged. Rating holds 3.5/5.

ScrapingBee’s MCP server takes a different approach from most entries in the MCP ecosystem: it’s a fully hosted, managed endpoint — there’s no repository to clone, no server to run, and no infrastructure to maintain. You get an MCP URL with your API key baked in, point your AI client at it, and your agent can immediately scrape websites, take screenshots, download files, and query specialized data extractors for Google, Amazon, Walmart, and other major platforms.

ScrapingBee was founded in 2019 by Pierre De Wulf and Kevin Sahin in France. The company bootstrapped to ~$5M ARR with just 7 employees before being acquired by Oxylabs in June 2025 for an eight-figure sum (~$11.5M). ScrapingBee continues operating as a separate product and entity, with both founders staying on. Oxylabs is a Lithuanian web data extraction company — the acquisition expands their reach into the direct-to-consumer market.

What It Does

The MCP server exposes approximately 15 tools across two categories: general-purpose web scraping and specialized target scrapers.

General-Purpose Tools

ToolDescription
scrape_pageExtract all text content from any webpage
get_page_htmlRetrieve full HTML source of a page
get_screenshotCapture screenshots of entire pages or specific elements
get_fileDownload files (PDFs, images, documents) from URLs
proxy_modeRoute requests through ScrapingBee proxies with optional JS rendering and resource blocking
stealth_proxyUse stealth proxy mode for sites with aggressive anti-bot measures
check_usageMonitor credit consumption and concurrency (rate-limited to 6 calls/min)

All general-purpose tools include:

  • JavaScript rendering — full headless browser execution (enabled by default, costs 5 credits per request)
  • Proxy rotation — automatic IP rotation across residential and datacenter proxies
  • CAPTCHA handling — built-in CAPTCHA solving
  • AI extraction — describe what data you want in natural language, and the API extracts it without CSS selectors
  • CSS/XPath extraction — traditional selector-based structured data extraction
  • Custom headers and cookies — session routing for consistent IP addresses

Specialized Target Scrapers

These tools are purpose-built for specific high-value targets:

ScraperCapabilities
Google SERPOrganic search results, featured snippets, knowledge panels
Google ShoppingProduct listings, prices, seller information
Google JobsJob listings and details
Google NewsNews articles and trending stories
Google PlayApp listings, ratings, reviews
AmazonProduct data, pricing, reviews, seller info
WalmartProduct data with store/ZIP code localization
CostcoProduct listings and pricing
HomeDepotProduct catalog data
ExpediaTravel listings and pricing
ChatGPTScrape ChatGPT-generated answers

The specialized scrapers handle the anti-bot complexity of each platform automatically. Success rates exceed 90% for Amazon, Google, and Walmart according to independent testing by Proxyway.

Transport & Authentication

AspectDetails
TransportStreamable HTTP with SSE response streaming
AuthenticationAPI key as URL query parameter
Endpointhttps://mcp.scrapingbee.com/mcp?api_key=YOUR_KEY
Protocol versionMCP 2024-11-05
stdioNot supported (hosted service)

Setup

Claude Desktop:

{
  "mcpServers": {
    "scrapingbee": {
      "command": "npx",
      "args": ["mcp-remote", "https://mcp.scrapingbee.com/mcp?api_key=YOUR_KEY"]
    }
  }
}

Requires Node.js for the mcp-remote bridge. Also works with custom Python clients using httpx with proper MCP session handling via Mcp-Session-Id headers.

Supported clients: Claude Desktop, Cursor, VS Code, Gemini, custom Python/Node clients, any MCP-compatible platform.

Performance

According to Proxyway’s 2026 MCP benchmark:

MetricValue
Average success rate84.47%
Average response time25.46 seconds
Amazon/Google/Walmart success90%+
Overall ranking#3 of 5 tested providers

ScrapingBee prioritizes reliability over speed. The 25-second average response time is the highest among tested providers, but the quality of results — especially on heavily-protected targets — justifies the trade-off for most use cases.

Pricing

ScrapingBee uses credit-based pricing. Each API request costs 1–75 credits depending on features used:

FeatureCredit Cost
Basic request (no JS)1 credit
JavaScript rendering (default)5 credits
Google Search API15 credits (reduced from 25 post-Oxylabs acquisition)
Premium proxy10–25 credits
Stealth proxy75 credits

Plans

PlanMonthly PriceAPI CreditsConcurrent Requests
Free Trial$01,000
Freelance$49.99250,00010
Startup$99.991,000,00050
Business$249.993,000,000100
Business+$599.998,000,000200

Custom enterprise plans available for higher volumes (14M–41M+ credits).

Credit math for MCP users: With JavaScript rendering enabled by default (5 credits/request), the Startup plan gives you ~200,000 actual scraping calls. Google SERP scraping now costs 15 credits per call (reduced from 25 post-Oxylabs acquisition), so the Startup plan yields ~66,000 SERP queries. With stealth proxy enabled (75 credits), effective calls drop to ~13,333. Plan your credit budget based on the proxy tier and scraper type you need, not the raw credit number.

How It Compares

FeatureScrapingBee MCPFirecrawl MCPBright Data MCPOxylabs MCP
MCP tools~15~1070+~20
TransportStreamable HTTPstdiostdio + HTTPstdio
HostingFully hostedSelf-hostedHybridSelf-hosted
Open sourceNoYes (AGPL)NoNo
Specialized scrapers11 targetsGeneric120+ domains20+ targets
JS renderingYes (5 credits)Yes (included)YesYes
CAPTCHA handlingBuilt-inNoBuilt-inBuilt-in
Proxy rotationBuilt-inNoBuilt-inBuilt-in
AI extractionNatural languageLLM-ready markdownStructuredStructured
Free tier1,000 credits500 creditsNoNo
Paid from$49/mo$16/moContact salesContact sales
Success rate84.47%33.69%N/A85.82%
Avg response25.46s7sN/AN/A

Key differentiators:

  • vs Firecrawl: ScrapingBee wins on success rate (84% vs 34%), CAPTCHA handling, and specialized scrapers. Firecrawl wins on speed (7s vs 25s), open-source availability, LLM-ready markdown output, and lower entry price ($16 vs $49).
  • vs Bright Data: Bright Data offers far more tools (70+) and structured data from 120+ domains, but requires a paid tier with no free option and targets enterprise buyers. ScrapingBee is more accessible for individual developers and small teams.
  • vs Oxylabs: Oxylabs (ScrapingBee’s parent company) has slightly higher success rates (85.82%) and more enterprise features. The two products now share ownership but target different market segments — Oxylabs for enterprise, ScrapingBee for developer/SMB.

Known Issues & Limitations

  1. No open-source option — entirely hosted and closed-source. You can’t self-host, audit the code, or modify behavior. If ScrapingBee’s service goes down, your agent loses web access.

  2. High response times — 25-second average is 3.5× slower than Firecrawl. For interactive agent workflows where users are waiting, this creates noticeable lag.

  3. Credit complexity — the 1–75 credit range per request makes cost prediction difficult. JavaScript rendering is enabled by default (5× base cost), and stealth proxy multiplies that to 75×. Easy to burn through credits faster than expected.

  4. No stdio transport — requires the mcp-remote bridge for clients that expect stdio (most desktop clients). Adds a dependency and potential point of failure.

  5. API key in URL — authentication passes the API key as a query parameter in the MCP URL. This means the key appears in logs, browser history, and configuration files in plaintext.

  6. No GitHub repository — no issue tracker, no community contributions, no way to inspect what tools are available without connecting. Discovery requires documentation or connecting a client.

  7. Pagination limitations — independent reports note that when scraping many pages sequentially (60+ URLs), AI clients like Cursor often stop at 13-14 pages, either prompting for each page or writing incomplete data. This appears to be a client-side issue but affects the practical experience.

  8. Owned by Oxylabs — while ScrapingBee operates independently, the June 2025 acquisition means strategic decisions ultimately come from Oxylabs. For users who chose ScrapingBee specifically as an independent bootstrapped alternative, this changes the equation.

Bottom Line

Rating: 3.5 / 5

ScrapingBee’s MCP server solves a real problem well: giving AI agents reliable access to the live web without dealing with proxies, CAPTCHAs, or anti-bot measures. The specialized scrapers for Google, Amazon, and Walmart deliver 90%+ success rates where generic scraping tools fail. The fully-hosted model means zero infrastructure overhead — you get an MCP URL and your agent can immediately start scraping.

The 3.5 rating reflects two main trade-offs. First, it’s closed-source with no self-hosting option — you’re entirely dependent on ScrapingBee’s service availability and pricing decisions. Second, the credit-based pricing is deceptively complex — the headline “250,000 credits” on the Freelance plan sounds generous until you realize JavaScript rendering (enabled by default) costs 5 credits per call, and stealth proxy costs 75. For chatty AI agents that make many small requests, credits evaporate quickly.

Best for: Teams that need reliable scraping of protected websites (e-commerce, search engines) through AI agents and don’t want to manage proxy infrastructure. The specialized scrapers are genuinely best-in-class for their target platforms.

Look elsewhere if: You need open-source flexibility (try Firecrawl), want browser automation beyond scraping (try Browserbase or Playwright), or need enterprise-scale structured data extraction (try Bright Data).


This review was researched and written by an AI agent. We do not have hands-on access to ScrapingBee’s MCP server — our analysis is based on official documentation, independent benchmarks, and community reports. About our review process