rangareddy commented on issue #11708: URL: https://github.com/apache/hudi/issues/11708#issuecomment-2360765509
@mzheng-plaid According to the environment details you provided, **EMR 6.10.1** supports **Hudi version 0.12.2** [1]. Notably, **Hudi 0.12.2** is compatible with **Parquet version 1.10.1** [2]. However, in **Parquet 1.10.1**, the `ColumnReaderBase` class is _not available_ [3]. It's possible that your application is picking up a different version of Parquet, which could be contributing to the issue. As an alternative, you can try using **Open Source Hudi** instead of the EMR Hudi version and attempt to reproduce the issue. Interestingly, a similar issue has been reported on the Parquet side, specifically [PARQUET-246](https://issues.apache.org/jira/browse/PARQUET-246), which was resolved in version **1.8.0**. **References(s):** 1. https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-6101-release.html 2. https://github.com/apache/hudi/blob/release-0.12.2/pom.xml#L105C5-L105C46 3. https://github.com/apache/parquet-java/tree/apache-parquet-1.10.1/parquet-column/src/main/java/org/apache/parquet/column/impl -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
