Change Data Capture (CDC) Definition
Change Data Capture (CDC)
Change data capture (CDC) tracks changes in a source dataset and automatically transfers those changes to a target dataset.
Changes are synced instantly or near-instantly. In practice, CDC is often used to replicate data between databases in real-time.
On a core level, change data capture: constantly tracks changes in a source database, immediately updates the target database, and uses stream processing to ensure instant changes.
With CDC, data sources include operational databases, applications, ERP mainframes, and other systems that record transactions or business occurrences. Targets include data lakes and data warehouses.
Change data capture enables teams to replicate data instantly and incrementally. CDC ensures data is always up to date. The source database and target database are continuously synced.