[ https://issues.apache.org/jira/browse/MAPREDUCE-7448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17763063#comment-17763063 ]
ASF GitHub Bot commented on MAPREDUCE-7448: ------------------------------------------- hadoop-yetus commented on PR #6038: URL: https://github.com/apache/hadoop/pull/6038#issuecomment-1711545185 :confetti_ball: **+1 overall** | Vote | Subsystem | Runtime | Logfile | Comment | |:----:|----------:|--------:|:--------:|:-------:| | +0 :ok: | reexec | 0m 29s | | Docker mode activated. | |||| _ Prechecks _ | | +1 :green_heart: | dupname | 0m 0s | | No case conflicting files found. | | +0 :ok: | codespell | 0m 0s | | codespell was not available. | | +0 :ok: | detsecrets | 0m 0s | | detect-secrets was not available. | | +0 :ok: | markdownlint | 0m 0s | | markdownlint was not available. | | +1 :green_heart: | @author | 0m 0s | | The patch does not contain any @author tags. | |||| _ trunk Compile Tests _ | | +1 :green_heart: | mvninstall | 34m 44s | | trunk passed | | +1 :green_heart: | mvnsite | 0m 34s | | trunk passed | | +1 :green_heart: | shadedclient | 57m 15s | | branch has no errors when building and testing our client artifacts. | |||| _ Patch Compile Tests _ | | +1 :green_heart: | mvninstall | 0m 25s | | the patch passed | | +1 :green_heart: | blanks | 0m 0s | | The patch has no blanks issues. | | +1 :green_heart: | mvnsite | 0m 27s | | the patch passed | | +1 :green_heart: | shadedclient | 27m 22s | | patch has no errors when building and testing our client artifacts. | |||| _ Other Tests _ | | +1 :green_heart: | asflicense | 0m 29s | | The patch does not generate ASF License warnings. | | | | 90m 8s | | | | Subsystem | Report/Notes | |----------:|:-------------| | Docker | ClientAPI=1.43 ServerAPI=1.43 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6038/1/artifact/out/Dockerfile | | GITHUB PR | https://github.com/apache/hadoop/pull/6038 | | Optional Tests | dupname asflicense mvnsite codespell detsecrets markdownlint | | uname | Linux cc6860143ecf 4.15.0-213-generic #224-Ubuntu SMP Mon Jun 19 13:30:12 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux | | Build tool | maven | | Personality | dev-support/bin/hadoop.sh | | git revision | trunk / 74ccab5348357559c963a54be1e5fcbcd8c368a3 | | Max. process+thread count | 709 (vs. ulimit of 5500) | | modules | C: hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core U: hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core | | Console output | https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6038/1/console | | versions | git=2.25.1 maven=3.6.3 | | Powered by | Apache Yetus 0.14.0 https://yetus.apache.org | This message was automatically generated. > Inconsistent Behavior for FileOutputCommitter V1 to commit successfully many > times > ---------------------------------------------------------------------------------- > > Key: MAPREDUCE-7448 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-7448 > Project: Hadoop Map/Reduce > Issue Type: Bug > Reporter: ConfX > Priority: Critical > Attachments: reproduce.sh > > > h2. What happened > I turned on {{mapreduce.fileoutputcommitter.cleanup.skipped=true}} and then > the version 1 of {{FileOutputCommitter}} can commit several times, which is > unexpected. > h2. Where's the problem > In {{{}FileOutputCommitter.commitJobInternal{}}}, > {noformat} > if (algorithmVersion == 1) { > for (FileStatus stat: getAllCommittedTaskPaths(context)) { > mergePaths(fs, stat, finalOutput, context); > } > } if (skipCleanup) { > LOG.info("Skip cleanup the _temporary folders under job's output " + > "directory in commitJob."); > ...{noformat} > Here if we skip cleanup, the _temporary folder would not be deleted and the > _SUCCESS file would also not be created, which cause the {{mergePaths}} next > time to not fail. > h2. How to reproduce > # set {{{}mapreduce.fileoutputcommitter.cleanup.skipped{}}}={{{}true{}}} > # run > {{org.apache.hadoop.mapred.TestFileOutputCommitter#testCommitterWithDuplicatedCommitV1}} > you should observe > {noformat} > java.lang.AssertionError: Duplicate commit successful: wrong behavior for > version 1. > at org.junit.Assert.fail(Assert.java:89) > at > org.apache.hadoop.mapred.TestFileOutputCommitter.testCommitterWithDuplicatedCommitInternal(TestFileOutputCommitter.java:295) > at > org.apache.hadoop.mapred.TestFileOutputCommitter.testCommitterWithDuplicatedCommitV1(TestFileOutputCommitter.java:269){noformat} > For an easy reproduction, run the reproduce.sh in the attachment. > We are happy to provide a patch if this issue is confirmed. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org