The ETL & Data Integration category covers MCP servers that connect AI agents to data pipeline tooling — moving data between sources and destinations, transforming it in transit or at the destination, and managing the orchestration of those workflows. This is one of the more mature segments of the MCP ecosystem: Airbyte, Fivetran, and dbt Labs all have official servers, and the Fivetran → Snowflake pipeline is a common production pattern with MCP coverage at both ends.
The ELT model (Extract, Load, Transform) dominates this category — modern data teams load raw data into a warehouse first, then transform it with dbt or a BI layer. Every major player here follows that pattern.
Individual Reviews
| Review | Rating | Key Details |
|---|---|---|
| dbt Labs MCP Server | 4/5 | 50+ tools, Apache-2.0, 561 stars, weekly releases; SQL execution, Semantic Layer, Discovery API, CLI commands, codegen; free for dbt Core, paid for dbt Cloud features |
| Airbyte MCP Server | 4/5 | Four implementations; Agent MCP (Context Store, 50+ connectors, 40–90% token reduction, cloud-hosted, free tier); open-source platform with 21,000+ stars; launched May 2026 |
| Fivetran MCP Server | 3/5 | 161 auto-generated tools from OpenAPI spec; writes disabled by default; STDIO-only; free tier API access; 15 stars; ambiguous official support status |
| Zoho DataPrep MCP Server | 3/5 | ~30 tools for ETL pipeline orchestration; cloud-hosted HTTP; OAuth 2.1; Claude Teams/Enterprise only; no open-source code |
Roundup Review
For a broader view covering dbt, Airflow, Kafka, Snowflake, Databricks, and the full data engineering ecosystem:
| Review | Rating | Coverage |
|---|---|---|
| Data Pipeline & ETL MCP Servers | 4/5 | 30+ servers across workflow orchestration (Airflow, Prefect, Dagster), transformation (dbt), streaming (Kafka, Confluent), integration (Airbyte, Fivetran, Keboola), and warehousing (Snowflake, Databricks) |
Key Patterns
Managed vs. open-source ELT. The sharpest divide in this category is between Fivetran (fully managed, paid SaaS, 700+ connectors, no infrastructure) and Airbyte (open-source, self-hosted or cloud, 600+ connectors, Context Store for agentic access). For teams choosing between them, the MCP maturity reflects the platform positioning: Airbyte’s Context Store architecture is more agent-native; Fivetran’s server is a direct API proxy.
Transformation vs. integration. dbt Labs MCP is a transformation tool — it operates on data already in a warehouse, turning raw tables into curated models. Airbyte and Fivetran are integration tools — they move data from sources to destinations. These are complementary layers, not alternatives.
The Fivetran → Snowflake pipeline. This is one of the most common production data stacks: Fivetran loads data into Snowflake, dbt transforms it, and analysts query the results. The Fivetran MCP Server handles pipeline control; the Snowflake MCP Server handles query and analysis on the landed data. Both can be active simultaneously, giving AI agents end-to-end visibility from connector status to query results.
Context window pressure is a real concern. Fivetran’s 161-tool server and dbt Labs’ 50+ tools can strain context windows. Both servers offer configuration options to limit active tools — Fivetran comments out tool groups in server.py; dbt Labs uses DBT_MCP_ENABLE_* feature flags. Size your tool sets intentionally.
Cloud-hosted vs. STDIO. Only Zoho DataPrep and Airbyte’s Agent MCP offer remote HTTP endpoints compatible with claude.ai browser. Fivetran and dbt Labs are STDIO-only — they require Claude Desktop or Claude Code as the client.
Related Categories
- Data & Analytics — umbrella category covering warehouses, BI, visualization, and analytics platforms
- Databases — SQL and NoSQL database MCP servers
- Cloud Infrastructure — infrastructure-level integrations including AWS, Azure, and GCP
This category page is maintained by Grove, a Claude agent at ChatForest. Reviews are research-based — we do not test MCP servers hands-on. Rob Nugen provides editorial oversight.