On Jan. 11, 2014, 1:56 a.m., Josh Elser wrote:
> > That much better, thanks.
> >
> > Caster is added. But for AccumuloBinaryConverter, we shall retain the data
> > in binary style. Eg, toBytes(Integer), we shall not convert the integer
> > into String, then get bytes from string, but retain in 4 bytes integer
> > style (like Bytes.toBytes(Integer) in hbase, not sure what's best in
> > Accumulo). Also in addition to specify caster in construct parameter,
> > better to have a configuration entry for that. I assume user prefer
> > BinaryConverter will always use BinaryConverter, they don't want to specify
> > the option every time.
> >
> > I still feel the syntax for AccumuloStorage is not straightforward. Can
> > user use the following style (similar to HBaseStorage)?
> >
> > a = load 'accumulo://....' using AccumuloStorage('info:age info2:address',
> > 'other options');
> > -- produce a 3 item tuple (key, info:age, info2:address). info/info2 is
> > colFam, age/address is colQual
> > -- user might optionally specify info:*, which will produce a map including
> > every colQual in this colFam
> > -- I see aggregate flag in the code, but I cannot think of much use case
> > that will be useful, how do you feel?
> >
> > store x into 'accumulo://....' using AccumuloStorage('info:age
> > info2:address', 'other options');
> > -- input tuple contains 3 items: (key, info:age, info2:address)
> > -- mirror what's in input side, user can pass a map instead, then he needs
> > to use wild card in construct AccumuloStorage('info:*');
>
> Josh Elser wrote:
> I was planning to have a new patch uploaded tonight, but I just found a
> bug so I'll fix that up tmrw and should get a new patch early in the day.
> Thanks for the push to mimic HBaseStorage's columns; I think it did clean up
> things from a usage perspective.
>
> I also added a few things over what HBaseStorage currently does, mainly
> because Accumulo tables can have any number of column families without
> altering the table. I'll go into details when I post the patch.
>
> re: configuration entry for specific caster, don't lines ~180-184 in
> AbstractAccumuloStorage address a specific caster without needing to
> configure it for every AccumuloStorage invocation?
>
> re: AccumuloBinaryConverter, yeah, I was being lazy. I'll write something
> that doesn't rely on String to serialize the numerics.
>
>
>
Oh yes, I miss 180-184 in AbstractAccumuloStorage, caster property is good.
- Daniel
-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/16533/#review31537
-----------------------------------------------------------
On Jan. 10, 2014, 7:20 p.m., Josh Elser wrote:
>
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/16533/
> -----------------------------------------------------------
>
> (Updated Jan. 10, 2014, 7:20 p.m.)
>
>
> Review request for pig.
>
>
> Bugs: PIG-3573
> https://issues.apache.org/jira/browse/PIG-3573
>
>
> Repository: pig-git
>
>
> Description
> -------
>
> Provides basic StoreFunc and LoadFunc implementations. Based off of code that
> was in an Accumulo contrib project.
>
>
> Diffs
> -----
>
> build.xml 575c9ae
> ivy.xml 180eb2c
> ivy/libraries.properties 14abdf8
> src/org/apache/pig/backend/hadoop/accumulo/AbstractAccumuloStorage.java
> PRE-CREATION
> src/org/apache/pig/backend/hadoop/accumulo/AccumuloBinaryConverter.java
> PRE-CREATION
> src/org/apache/pig/backend/hadoop/accumulo/AccumuloStorage.java
> PRE-CREATION
> src/org/apache/pig/backend/hadoop/accumulo/AccumuloStorageOptions.java
> PRE-CREATION
> src/org/apache/pig/backend/hadoop/accumulo/FixedByteArrayOutputStream.java
> PRE-CREATION
> src/org/apache/pig/backend/hadoop/accumulo/Utils.java PRE-CREATION
>
> test/org/apache/pig/backend/hadoop/accumulo/AbstractAccumuloStorageTest.java
> PRE-CREATION
> test/org/apache/pig/backend/hadoop/accumulo/AccumuloPigClusterTest.java
> PRE-CREATION
>
> test/org/apache/pig/backend/hadoop/accumulo/AccumuloStorageConfigurationTest.java
> PRE-CREATION
> test/org/apache/pig/backend/hadoop/accumulo/AccumuloStorageOptionsTest.java
> PRE-CREATION
> test/org/apache/pig/backend/hadoop/accumulo/AccumuloStorageTest.java
> PRE-CREATION
>
> Diff: https://reviews.apache.org/r/16533/diff/
>
>
> Testing
> -------
>
> Local tests reading, writing and JOIN'ing Accumulo tables. Tested against
> Hadoop-1.0.4 and 2.2.0, with Accumulo 1.5.0
>
>
> Thanks,
>
> Josh Elser
>
>