[GitHub] [incubator-hudi] utk-spartan commented on issue #1384: [SUPPORT] Hudi datastore missing updates for many records

2020-03-11 Thread GitBox
utk-spartan commented on issue #1384: [SUPPORT] Hudi datastore missing updates 
for many records
URL: https://github.com/apache/incubator-hudi/issues/1384#issuecomment-597750294
 
 
   Can this have some relation with 
https://issues.apache.org/jira/browse/HUDI-409 , as we recently encountered 
parquet corruption errors (magic numbers mismatch) while reading from presto on 
a fresh hudi table.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [incubator-hudi] utk-spartan commented on issue #1384: [SUPPORT] Hudi datastore missing updates for many records

2020-03-11 Thread GitBox
utk-spartan commented on issue #1384: [SUPPORT] Hudi datastore missing updates 
for many records
URL: https://github.com/apache/incubator-hudi/issues/1384#issuecomment-597748228
 
 
   This is for COW tables, upon analyzing the data , missing record updates 
were below 0.01 % for old updated data but have recently increased to around 
20-30%.
   
   Can't find any failures in spark logs.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services