liucongjy commented on PR #12659: URL: https://github.com/apache/hudi/pull/12659#issuecomment-2606214565
> > Is the code you submitted modified based on Hudi1.0? When I tested the submitted code on Hudi 0.15, I still got the same error as before > > The main change is in the constructor of `org.apache.hudi.hadoop.avro.HoodieAvroParquetReader`, where the exception happens as you posted in the previous issue, and it's not newly introduced in 1.0. Can you verify that your case reaches this part actually? The constructor of the HoodieAvroParquetReader class has indeed been executed, and it added a log output. The parameter internalSchemaOption passed to the log is empty, so the current test still encounters the same exception as before. It seems that the InternalSchema returned by the getTableInternalSchemaFromCommitMetadata() method of the TableSchemaResolver class is empty. This indicates that the schema information in the commit file obtained from the Commit Time Line in the Hudi table was not retrieved. However, after adding fields to the table, if data is written back into the table and then queried, the previous exception will no longer be triggered -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
