25
loading...
This website collects cookies to deliver better user experience
How is Airbyte different from other ETL / ELT tools?
Traditional tools, such as Fivetran or StitchData, are closed-source and cloud-based. Your prices are indexed according to the volume of data you replicate. Open source allows Airbyte to offer better prices and cover all your connector needs, given the long tail of integrations, and all without any blocking mechanism.
How long does it take to replicate data with Airbyte?
There are several ways to deploy Airbyte. For example, using Docker compose, it takes 2 minutes to replicate data from Salesforce to Snowflake.
git clone https://github.com/airbytehq/airbyte.git
cd airbyte
docker-compose up
Sync schedule: when triggering a data synchronization.
Destination Namespace and stream names: where the data will end up being written.
A catalog selection: which streams and fields to replicate from the source.
Sync mode: how streams should be copied (read and write).
Optional transformations: Converting data from Airbyte protocol messages (raw JSON blob) into some other data representations.
{
"make": "alfa romeo",
"model": "4C coupe",
"horsepower": "247"
}
CREATE TABLE “_airbyte_raw_cars” (
-- metadata added by airbyte
“_airbyte_ab_id” VARCHAR,
-- uuid value assigned by connectors to each row of the data written in the destination.
“_airbyte_emitted_at” TIMESTAMP_WITH_TIMEZONE,
-- time at which the record was emitted.
“_airbyte_data” JSONB — data stored as a Json Blob.
);
CREATE TABLE "cars" (
"_airbyte_ab_id" VARCHAR,
"_airbyte_emitted_at" TIMESTAMP_WITH_TIMEZONE,
"_airbyte_cars_hashid" VARCHAR,
"_airbyte_normalized_at" TIMESTAMP_WITH_TIMEZONE, -- data from source
"make" VARCHAR,
"model" VARCHAR,
"horsepower" INTEGER
);
Incremental CDC only supports tables with primary keys.
A CDC source can still choose to replicate tables without primary keys like “Full Refresh” or a non-CDC source can be configured for the same database to replicate tables without primary keys using standard incremental replication.
Data must be in tables, not views.
The modifications you are trying to capture must be done using “DELETE / INSERT / UPDATE” for example, changes made to “TRUNCATE / ALTER” will not appear in the logs; therefore, in your destination.
Airbyte does not automatically support schema changes for “CDC” fonts. I recommend resetting and resynchronizing the data if you make a schema change.
There are database-specific limitations. See the individual connector documentation pages for more information.
Records produced by “DELETE” statements contain only primary keys. All other data fields are undefined.
“With Great Powers Come Great Responsibilities” -Stan Lee.