[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17102837#comment-17102837
]
Afroz Baig commented on SPARK-29037:
spark.hadoop.mapreduce.fileoutputcommitter.algorithm.version=2
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17095434#comment-17095434
]
t oo commented on SPARK-29037:
--
with spark 2.3.4 and hadoop 2.8.5: i am facing this doing simple Overwrite
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16929916#comment-16929916
]
feiwang commented on SPARK-29037:
-
[~advancedxy] Hi, I found that even with dynamicPartitionOverwrite,
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928674#comment-16928674
]
feiwang commented on SPARK-29037:
-
[~advancedxy]
I just checked the code, as shown below.
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928656#comment-16928656
]
feiwang commented on SPARK-29037:
-
In detail, I think we need change the logic of
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928650#comment-16928650
]
Xianjin YE commented on SPARK-29037:
> About output check, I think it is not appropriate, because
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928598#comment-16928598
]
feiwang commented on SPARK-29037:
-
The implementation of InsertIntoHiveTable prevent reuse same
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928596#comment-16928596
]
feiwang commented on SPARK-29037:
-
[~advancedxy]
1. We re-submit the same application again.
We meet
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928561#comment-16928561
]
Xianjin YE commented on SPARK-29037:
[~hzfeiwang] by rerun the application, do you mean re-submit
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928503#comment-16928503
]
Wenchen Fan commented on SPARK-29037:
-
[~advancedxy] can you take a look?
> [Core] Spark gives
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928394#comment-16928394
]
feiwang commented on SPARK-29037:
-
But for the version 2, it may produce partial result when we kill an
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928375#comment-16928375
]
feiwang commented on SPARK-29037:
-
If we set
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928180#comment-16928180
]
feiwang commented on SPARK-29037:
-
[~cloud_fan]
> [Core] Spark gives duplicate result when an
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928170#comment-16928170
]
feiwang commented on SPARK-29037:
-
If we have several applications, which insert overwrite a partition
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928168#comment-16928168
]
feiwang commented on SPARK-29037:
-
This committedTaskPath is hard coded in FileOutputCommitter class.
>
[
https://issues.apache.org/jira/browse/SPARK-29037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16928165#comment-16928165
]
feiwang commented on SPARK-29037:
-
This is the unit test log.
!screenshot-1.png!
We can see that, the
16 matches
Mail list logo