glory9211 commented on issue #6194: URL: https://github.com/apache/hudi/issues/6194#issuecomment-1198346968
> @KnightChess I added `spark.sql.parquet.mergeSchema true` to the spark properties file, then reconnected to the hudi-cli and re-tried the repair command again. The result was the same: > > org.apache.spark.sql.AnalysisException: cannot resolve '_hoodie_record_key' given input columns: [] > > I am pretty confident that there is no schema diff in the data files. Can you verify if the duplicates are in the same file? hoodie_file column has the same value or different value -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
