eisig edited a comment on issue #779: HoodieDeltaStreamer may install duplicate record? URL: https://github.com/apache/incubator-hudi/issues/779#issuecomment-510828768 I have restart the job several times, and add --disable-compaction Ohter results seems wrong. ``` select count(*) count1, count(distinct id) count2 from hive200.test.t_order_mor03_rt select count(*) count3, count(distinct id) count4 from hive200.test.t_order_mor03 ``` count1 == count3 count2 == count4 count1 != count2 and count3 != count4 ``` select (select max(_hoodie_commit_time) from hive200.test.t_order_mor03), (select max(_hoodie_commit_time) from hive200.test.t_order_mor03_rt) ``` _hoodie_commit_time are always the some. ``` select count(*) count from hive200.test.t_order_mor03_rt rt join hive200.test.t_order_mor03 ro on ro.id = rt.id where rt.modify_date!=ro.modify_date ``` count is going up
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
