Best Practices for Loading Real-time Data into Distributed Systems with Change Data Capture

Best Practices for Loading Real-time Data into Distributed Systems with Change Data Capture

Change Data Capture (CDC) has become a very efficient way to minimize resources required for an ETL process, as well as a useful instrument allowing to organize efficient replication schemas. We will cover the fundamental principals and restrictions of the CDC, look at some examples of how CDC is implemented in real life. This talk will be useful for developers and architects trying to achieve incremental updates of large data sets either in batches or in real-time. By the end of this session you will understand:

  • CDC design principles, architecture, and algorithms
  • Best practices for integrating with horizontally scalable systems
  • How to perform CDC from and to Apache Ignite / GridGain

Schedule:

Schedule

Room:

Room
Regency Ballroom B

Tracks:

Speakers
Alexey
Goncharuk
Chief Architect
at
GridGain Systems

Slides & Recordings

   Download Slides