Airbyte
Open-source data integration with hundreds of connectors
Airbyte is profiled here as a LLM tool for engineering teams. Read about features, pricing, and how it compares to related options in the tools directory.
Description
Airbyte is an open-source data integration engine founded in 2020 by Michel Tricot and John Lafleur. It moves data from APIs, databases, and files into warehouses, lakes, and vector stores through a catalog of hundreds of connectors, and a no-code builder turns any REST API into a new connector in an afternoon. Teams run it self-managed or through Airbyte Cloud. The platform self-manages on Docker or Kubernetes, and Airbyte Cloud handles hosting for teams that want managed pipelines without infrastructure work.
Key Capabilities:
Connector catalog covering 600+ sources and destinations
No-code Connector Builder plus a low-code CDK
Incremental syncs and change data capture replication
Vector database destinations for RAG data pipelines
PyAirbyte for running pipelines from Python code
Orchestration integrations with Airflow, Dagster, and Prefect
Alternative tools
- Tecton
Enterprise feature platform for real-time machine learning
- Feast
Open-source feature store for production machine learning
- Snowflake
Cloud data platform with built-in AI services
- Pydantic AI
Type-safe agent framework from the Pydantic team
- LangGraph
Stateful graph orchestration for production AI agents
- Jina AI
Search foundation models and web reading APIs
