Ma Jian created HUDI-8001:
-----------------------------

             Summary: Insert overwrite failed due to missing 'path' property 
when using Spark 3.5.1 and Hudi 1.0.0
                 Key: HUDI-8001
                 URL: https://issues.apache.org/jira/browse/HUDI-8001
             Project: Apache Hudi
          Issue Type: Bug
            Reporter: Ma Jian


The issue with Spark 3.5.1 arises because the {{InsertIntoHoodieTableCommand}} 
chain calls the initialization of the {{HoodieFileIndex}} class. For v1 tables, 
the path is stored in 
{{{}CatalogTable#CatalogStorageFormat#storageProperties{}}}, but not in 
{{{}CatalogTable#properties{}}}. When Spark reloads the table, it removes the 
path key from {{{}CatalogTable#CatalogStorageFormat#storageProperties{}}}. 
Consequently, {{InsertIntoHoodieTableCommand}} in Hudi cannot retrieve the path 
from either {{CatalogTable#CatalogStorageFormat#storageProperties}} or 
{{CatalogTable#properties}} during {{{}deduceOverwriteConfig{}}}. This absence 
of the path key in {{combinedOpts}} leads to an error when initializing 
{{{}HoodieFileIndex{}}}.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to