[
https://issues.apache.org/jira/browse/ACCUMULO-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13815670#comment-13815670
]
Josh Elser commented on ACCUMULO-1854:
--------------------------------------
I was talking to Christopher tonight about this. He did bring up the good point
about why not to use the AccumuloMultiTableInputFormat. One point we came to
was that making these changes would allow single M/R jobs to talk to separate
Accumulo clusters instead of a single cluster.
I did settle on a change that I'm not completely happy about that is reliant on
the fact that splits are generated by one host in serial. If they were
generated in parallel, my approach would break. However, given that the
InputFormata can't rely on getting the same Configuration object in each
invocation of getSplits, the only other reliable approach I could come up with
was to use something like HDFS which has its own sort of concurrency issues.
Since it's not an issue now, I've punted on worrying about it.
> Accumulo{Input,Output}Format can't handle multiple configurations
> -----------------------------------------------------------------
>
> Key: ACCUMULO-1854
> URL: https://issues.apache.org/jira/browse/ACCUMULO-1854
> Project: Accumulo
> Issue Type: Bug
> Affects Versions: 1.4.4, 1.5.0
> Reporter: Josh Elser
> Assignee: Josh Elser
> Fix For: 1.4.5, 1.5.1, 1.6.1
>
>
> I noticed that I was unable to properly use MultipleInputs (or any code which
> uses a similar approach) with the AccumuloInputFormat class because of the
> way it builds up information in the Configuration object.
> It would be useful to be able to have multiple instances of AIF (and AOF)
> configured within one Job (Configuration).
--
This message was sent by Atlassian JIRA
(v6.1#6144)