alex-shchetkov commented on issue #4086: URL: https://github.com/apache/hudi/issues/4086#issuecomment-986937571
Have been running into the same issue with hudi 0.8 and 0.9 with pyspark. This seems relatively critical as it's causing batches of data to get silently thrown out. To see the actual error message (at least with EMR), you have to edit `/etc/spark/conf/log4j.properties` file. EMR automatically adds "log4j.logger.org.apache.hudi=WARN" in there, which causes the errors to not get printed -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
