Experts,

Can you please suggest the strategies with some code samples for doing Change 
Data Capture in Hive?

Right now, we are staging lot of tables from Oracle to Oracle with change data 
capture (mainly Type 2).   We are planning to migrate all the staging tables to 
Hadoop.
We do bulk import on day 1 and then followed by incremental imports which 
contains inserts, updates and deletes.

How can we apply these inserts, updates and deletes on Hive tables to achieve 
Type2, Type1 and Type 0 based CDC?

CDC Type 0: New Records only; Discard Changes & Deletes
CDC Type 1: Overwrite
CDC Type 2: Expire and Create New Record

Regards,
Venkat Ankam

Reply via email to