[ 
https://issues.apache.org/jira/browse/HADOOP-17977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HADOOP-17977:
------------------------------------
    Labels: committers easyfix easytask pull-request-available  (was: 
committers easyfix easytask)

> FileOutputCommitter Enable Concurent Writes 
> --------------------------------------------
>
>                 Key: HADOOP-17977
>                 URL: https://issues.apache.org/jira/browse/HADOOP-17977
>             Project: Hadoop Common
>          Issue Type: Improvement
>            Reporter: ismail
>            Priority: Major
>              Labels: committers, easyfix, easytask, pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> is it possible to make `{{PENDING_DIR_NAME}}` configurable? 
> That will enable concurrent writes to same location. current if two spark 
> processes write same destination one of them is failing.
> current
> {code:java}
>  public static final String PENDING_DIR_NAME = "_temporary";{code}
> new:
> {code:java}
> PENDING_DIR_NAME = conf.get("mapreduce.fileoutputcommitter.pending.dir", 
> "_temporary");{code}
> here is custom commiter doing it: 
> https://gist.github.com/ismailsimsek/33c55d8e1fcfc79160483c38a978edbd



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to