Re: Using AccumuloOutputFormat, All Records Stored In One Tablet (Node)

David Medinets Mon, 16 Apr 2012 12:01:35 -0700

I'll ask another basic question. The row id values are stored as
strings. So "1" and "1111" are sorted together. Let's say that I have
five nodes. Would I run this?


addsplits 2 4 6 8 -t table

On Mon, Apr 16, 2012 at 2:55 PM, David Medinets
<[email protected]> wrote:
> argh ... Just to be clear. The splits are essentially partitions of the row 
> id?
>
> Can I add splits after the data is ingested? If so, how can I redistribute?
>
> On Mon, Apr 16, 2012 at 2:45 PM, Eric Newton <[email protected]> wrote:
>> Create the table with splits, but this requires you to know something about
>> the distribution of your data.
>>
>> -Eric
>>
>>
>> On Mon, Apr 16, 2012 at 2:38 PM, David Medinets <[email protected]>
>> wrote:
>>>
>>> Hopefully I am doing something wrong that can be easily rectified. I
>>> have an hadoop job that is sending well over 200M entries into
>>> accumulo. But every entry is being sent to a single node. The table
>>> was created by the hadoop job.
>>>
>>> How can I get the entries to be spread over several nodes?
>>
>>

Re: Using AccumuloOutputFormat, All Records Stored In One Tablet (Node)

Reply via email to