[GitHub] [hudi] mithalee commented on issue #3336: [SUPPORT] Delete not functioning with deltastreamer

2021-08-09 Thread GitBox
mithalee commented on issue #3336: URL: https://github.com/apache/hudi/issues/3336#issuecomment-895405470 @nsivabalan @codope Hi, I found the issue. Spark on K8 3.1.1 has some bugs and throws the "Premature Content" error. I tried the Spark on K8 3.1.2 which is the latest release and the

[GitHub] [hudi] mithalee commented on issue #3336: [SUPPORT] Delete not functioning with deltastreamer

2021-08-05 Thread GitBox
mithalee commented on issue #3336: URL: https://github.com/apache/hudi/issues/3336#issuecomment-893711383 @nsivabalan @codope I came across this issue: https://issues.apache.org/jira/browse/HADOOP-17338 This may be the root cause of the error I am running into. I am running into

[GitHub] [hudi] mithalee commented on issue #3336: [SUPPORT] Delete not functioning with deltastreamer

2021-08-03 Thread GitBox
mithalee commented on issue #3336: URL: https://github.com/apache/hudi/issues/3336#issuecomment-892112742 @codope Hi, I did try the HoodieDeltaStreamer on below version of EMR: Release label:emr-6.3.0 Hadoop distribution:Amazon 3.2.1 Applications:Tez 0.9.2, Spark 3.1.1, Hive 3.1.2,

[GitHub] [hudi] mithalee commented on issue #3336: [SUPPORT] Delete not functioning with deltastreamer

2021-07-28 Thread GitBox
mithalee commented on issue #3336: URL: https://github.com/apache/hudi/issues/3336#issuecomment-888699402 > @mithalee Can you try with the latest master branch? I built the master code and tried to reproduce the scenario in a local docker environment. It runs fine. For example, after

[GitHub] [hudi] mithalee commented on issue #3336: [SUPPORT] Delete not functioning with deltastreamer

2021-07-26 Thread GitBox
mithalee commented on issue #3336: URL: https://github.com/apache/hudi/issues/3336#issuecomment-887202873 Does the source ordering config and precombine field has to be a timestamp field?

[GitHub] [hudi] mithalee commented on issue #3336: [SUPPORT] Delete not functioning with deltastreamer

2021-07-26 Thread GitBox
mithalee commented on issue #3336: URL: https://github.com/apache/hudi/issues/3336#issuecomment-887201790 I changed the input data set a bit so that I can provide 3 different fields for the 3 differnt mentioned configs. ./spark-submit --master