Re: Fwd: Extacting ALL Data using multiple java processes

2016-10-16 Thread Josh Elser
There should be a static setRanges(Configuration, Collection) method somewhere in the type hierarchy of AccumuloInputFormat which lets you specify the Range[s]. Not using the TimestampFilter (not being able to use the timestamp for this filtering), you have two options to perform

Re: Fwd: Extacting ALL Data using multiple java processes

2016-10-16 Thread Bob Cook
Josh, Thanks. I was able to get TimestampFilter to works for my needs. But I originally wanted "createdDate" as our application creates that date which is known to the user and may be different than accumulo timestamp due to when the data actually got processed into accumulo. So if I wanted to

Re: Fwd: Extacting ALL Data using multiple java processes

2016-10-16 Thread Josh Elser
The TimestampFilter will return only the Keys whose timestamp fall in the range you specify. The timestamp is an attribute on every Key, a long value which, when not set by the client at write time, is the number of millis since the epoch. You specify the numeric range of timestamps you want.

Fwd: Extacting ALL Data using multiple java processes

2016-10-15 Thread Bob Cook
All, I'm new to accumulo and inherited this project to extract all data from accumulo (assembled as a "document" by RowID) into another web service. So I started with SimpleReadClient.java to "scan" all data, and built a "document" based on the RowID, ColumnFamily and Value. Sending this