[ 
https://issues.apache.org/jira/browse/ACCUMULO-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Josh Elser updated ACCUMULO-1854:
---------------------------------

    Description: 
The AccumuloInputFormat required that the same exact Configuration object which 
was provided to its static configuration methods is also provided to its 
getSplits() and createRecordReader() methods.

In practice, some tools, like MultipleInputs or Pig, don't guarantee that the 
same Configuration object is provided to getSplits() and createRecordReader(). 
This tends to be a common method of interaction to ensure that multiple 
InputFormats don't collide with one another in the same Configuration object 
(e.g. FileInputFormat). By serialization the necessary information from the 
Configuration regarding how to query Accumulo into the InputSplit 
AccumuloInputFormat creates, we can alleviate this requirement and make client 
interaction a little more standard.

  was:
I noticed that I was unable to properly use MultipleInputs (or any code which 
uses a similar approach) with the AccumuloInputFormat class because of the way 
it builds up information in the Configuration object.

It would be useful to be able to have multiple instances of AIF (and AOF) 
configured within one Job (Configuration).


> AccumuloInputFormat can be less strict about contents of Configuration
> ----------------------------------------------------------------------
>
>                 Key: ACCUMULO-1854
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-1854
>             Project: Accumulo
>          Issue Type: Bug
>    Affects Versions: 1.4.4, 1.5.0
>            Reporter: Josh Elser
>            Assignee: Josh Elser
>             Fix For: 1.4.5, 1.5.1, 1.6.0
>
>
> The AccumuloInputFormat required that the same exact Configuration object 
> which was provided to its static configuration methods is also provided to 
> its getSplits() and createRecordReader() methods.
> In practice, some tools, like MultipleInputs or Pig, don't guarantee that the 
> same Configuration object is provided to getSplits() and 
> createRecordReader(). This tends to be a common method of interaction to 
> ensure that multiple InputFormats don't collide with one another in the same 
> Configuration object (e.g. FileInputFormat). By serialization the necessary 
> information from the Configuration regarding how to query Accumulo into the 
> InputSplit AccumuloInputFormat creates, we can alleviate this requirement and 
> make client interaction a little more standard.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to