Serverless Change Data Capture (CDC) and Replication Service that allows you to synchronize data across heterogeneous databases, storage systems, and applications reliably and with minimal latency.
Based on gigabytes (GBs) processed. Usage is billed in per-byte increments, on a per-stream basis, and is stated in GB (500 MB is 0.5 GB, for example). Bytes are counted based on raw (uncompressed) data for CDC and Backfill.
In addition to Datastream costs, you're billed for resources used to transfer, store, or process data: Cloud Storage, Dataflow, and Networking.
Data streams from databases and Software-as-a-Service (SaaS) cloud services can feed a near-real-time data integration pipeline by loading data into BigQuery via Dataflow or Cloud Data Fusion.
Changes in databases are ingested into streaming pipelines such as with Dataflow for fraud detection, security event processing, and anomaly detection.
Availability of data changes in near-real-time powers artificial intelligence and machine learning applications to prevent churn or increase engagement via marketing efforts or by feeding back into production systems.
Each change in a source system is captured and recorded in a data store. This is helpful in cases where it is important to know all changes over time and not just the state of the database at the time of data extraction.
Datastream ingests a lot of data in real time from a variety of sources, and makes the data available for consumption in the destination.
Because Datastream natively writes only to Cloud Storage, replicating to other destinations requires a Dataflow template workaround. Supported replication destinations include: Cloud SQL, Spanner, BigQuery, Bigtable, MongoDB, and Databricks.
Q1. What is the native destination for Datastream output?
Q2. Which Oracle CDC method does Datastream use that avoids additional licenses?
Q3. If you need to replicate data from Datastream to BigQuery, what service is required as an intermediary?
Q4. What is the alternative to CDC if you only need occasional access and joining of external data?