eisig edited a comment on issue #779: HoodieDeltaStreamer may install duplicate 
record?
URL: https://github.com/apache/incubator-hudi/issues/779#issuecomment-510828768
 
 
    I have restart the job several times,
   and add  --disable-compaction
   Ohter results seems wrong.
   
   ```
   select count(*) count1, count(distinct id) count2 from 
hive200.test.t_order_mor03_rt
   select count(*) count3, count(distinct id) count4 from 
hive200.test.t_order_mor03 
   ```
   count1 == count3
   count2 == count4
   count1 != count2 and count3 != count4
   
   ```
   select (select max(_hoodie_commit_time) from hive200.test.t_order_mor03),
   (select max(_hoodie_commit_time) from hive200.test.t_order_mor03_rt)
   ```
   _hoodie_commit_time are always the some.
   
   ```
   select count(*) count
   from  hive200.test.t_order_mor03_rt rt 
    join hive200.test.t_order_mor03 ro
   on ro.id = rt.id
   where rt.modify_date!=ro.modify_date
   ```
   count is going up

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to