[
https://issues.apache.org/jira/browse/HADOOP-17977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HADOOP-17977:
------------------------------------
Labels: committers easyfix easytask pull-request-available (was:
committers easyfix easytask)
> FileOutputCommitter Enable Concurent Writes
> --------------------------------------------
>
> Key: HADOOP-17977
> URL: https://issues.apache.org/jira/browse/HADOOP-17977
> Project: Hadoop Common
> Issue Type: Improvement
> Reporter: ismail
> Priority: Major
> Labels: committers, easyfix, easytask, pull-request-available
> Time Spent: 10m
> Remaining Estimate: 0h
>
> is it possible to make `{{PENDING_DIR_NAME}}` configurable?
> That will enable concurrent writes to same location. current if two spark
> processes write same destination one of them is failing.
> current
> {code:java}
> public static final String PENDING_DIR_NAME = "_temporary";{code}
> new:
> {code:java}
> PENDING_DIR_NAME = conf.get("mapreduce.fileoutputcommitter.pending.dir",
> "_temporary");{code}
> here is custom commiter doing it:
> https://gist.github.com/ismailsimsek/33c55d8e1fcfc79160483c38a978edbd
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]