Prashant Wason created HUDI-2013:
------------------------------------

             Summary: Fallback to file listing may lead to data loss
                 Key: HUDI-2013
                 URL: https://issues.apache.org/jira/browse/HUDI-2013
             Project: Apache Hudi
          Issue Type: Bug
            Reporter: Prashant Wason
            Assignee: Prashant Wason


When fallback to file listing mode is enabled (hoodie.metadata.fallback.enable, 
default is true), then if listing from the metadata table leads to an exception 
the normal file-system listing used.

Metadata table listing may fail if the table is inconsistent or due to bugs. 
Falling back to file listing has the following downsides:
 # It masks the issue as the commit does not fail (only an exception is logged).
 # By the time the issue is discovered, logs may have been lost
 # There is no guarantee that all the commits wrote/updated the correct files.

Since listing from metadata table is per-partition, the issue is further 
complicated when listing for some partitions succeeds (file-list retrieved from 
metadata table) and fails for other partitions (file list retrieved from 
filesystem).

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to