[
https://issues.apache.org/jira/browse/MAPREDUCE-7366?focusedWorklogId=670082&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-670082
]
ASF GitHub Bot logged work on MAPREDUCE-7366:
---------------------------------------------
Author: ASF GitHub Bot
Created on: 26/Oct/21 13:55
Start Date: 26/Oct/21 13:55
Worklog Time Spent: 10m
Work Description: steveloughran commented on pull request #3582:
URL: https://github.com/apache/hadoop/pull/3582#issuecomment-951965390
one more thing. Simplest to leave everything under _temporary, but add a
switch telling the CleanupStage to not delete that dir, only the job attempt
dir and that of any previous attempts (MapReduce). Provided the spark version
correctly generates unique job IDs, the use of separate paths would be
automatic...you wouldn't need to change the prefix for every job.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 670082)
Time Spent: 1h 10m (was: 1h)
> FileOutputCommitter Enable Concurent Writes
> --------------------------------------------
>
> Key: MAPREDUCE-7366
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-7366
> Project: Hadoop Map/Reduce
> Issue Type: Improvement
> Components: mrv2
> Affects Versions: 3.3.1
> Reporter: ismail
> Priority: Major
> Labels: pull-request-available
> Time Spent: 1h 10m
> Remaining Estimate: 0h
>
> is it possible to make `{{PENDING_DIR_NAME}}` configurable?
> That will enable concurrent writes to same location. current if two spark
> processes write same destination one of them is failing.
> current
> {code:java}
> public static final String PENDING_DIR_NAME = "_temporary";{code}
> new:
> {code:java}
> PENDING_DIR_NAME = conf.get("mapreduce.fileoutputcommitter.pending.dir",
> "_temporary");{code}
> here is custom commiter doing it:
> https://gist.github.com/ismailsimsek/33c55d8e1fcfc79160483c38a978edbd
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]