bhat-vinay commented on issue #10110:
URL: https://github.com/apache/hudi/issues/10110#issuecomment-2025074505
Thanks for trying all the suggestions. I am unable to reproduce this in my
environment. I will spend some time next week to setup the exact
environment/versions you are using. In the mean time, would it be possible for
you to run your tests against the latest master branch of Hudi and reproduce
this? From the stack-trace below, I am not sure why the index build process is
stripping the full path of the data file in your environment.
```
24/03/28 07:36:06 WARN ScheduleIndexActionExecutor: Following partitions
already exist or inflight: [files]. Going to schedule indexing of only these
partitions: [func_index_]
24/03/28 07:36:06 ERROR HoodieBackedTableMetadataWriter: Bootstrap on
func_index_ partition failed for
file:/Users/soumilshah/Desktop/hudidemo/.hoodie/metadata
org.apache.spark.sql.AnalysisException: [PATH_NOT_FOUND] Path does not
exist:
file:/2c1b72c3-5de3-4f19-b7dc-e67b3a345c53-0_5-25-0_20240328073553477.parquet.
at
org.apache.spark.sql.errors.QueryCompilationErrors$.dataPathNotExistError(QueryCompilationErrors.scala:1419)
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]