[ https://issues.apache.org/jira/browse/MAPREDUCE-7366?focusedWorklogId=670198&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-670198 ]
ASF GitHub Bot logged work on MAPREDUCE-7366: --------------------------------------------- Author: ASF GitHub Bot Created on: 26/Oct/21 16:17 Start Date: 26/Oct/21 16:17 Worklog Time Spent: 10m Work Description: steveloughran commented on pull request #3582: URL: https://github.com/apache/hadoop/pull/3582#issuecomment-952099811 (Avoids having to stop worrying about people creating pending dirs like "year=2021" and wondering why data is lost... I see this is actually a duplicate of MAPREDUCE-7331 & I had proposed fixing it, just not done it in the current pr -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 670198) Time Spent: 1.5h (was: 1h 20m) > FileOutputCommitter Enable Concurent Writes > -------------------------------------------- > > Key: MAPREDUCE-7366 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-7366 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: mrv2 > Affects Versions: 3.3.1 > Reporter: ismail > Priority: Major > Labels: pull-request-available > Time Spent: 1.5h > Remaining Estimate: 0h > > is it possible to make `{{PENDING_DIR_NAME}}` configurable? > That will enable concurrent writes to same location. current if two spark > processes write same destination one of them is failing. > current > {code:java} > public static final String PENDING_DIR_NAME = "_temporary";{code} > new: > {code:java} > PENDING_DIR_NAME = conf.get("mapreduce.fileoutputcommitter.pending.dir", > "_temporary");{code} > here is custom commiter doing it: > https://gist.github.com/ismailsimsek/33c55d8e1fcfc79160483c38a978edbd -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org