zaminhassnain06 opened a new issue, #5887: URL: https://github.com/apache/hudi/issues/5887
I'm running Hudi creating an external table from files on S3, and when trying to insert into this table it fails with exception org.apache.hudi.exception.HoodieException: The value of <field> not found in record. However it was working fine before yesterday. I have checked all the data in file the field is present. The <field> is lastModifiedData and it is precombineKey I am creating cluster from Airflow Dag When I ran code manually from Jupyter notebook it processed the file successfully. However it is giving error with Airflow DAG. Following is the EMR Error y4j.protocol.Py4JJavaError: An error occurred while calling o103.save. : org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 0.0 failed 4 times, most recent failure: org.apache.hudi.exception.HoodieException: LastModifiedDate(Part -LastModifiedDate) field not found in record. Acceptable fields were : -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
