On Monday, April 16, 2012 2:55:48 PM, "David Medinets" <[email protected]> wrote: > argh ... Just to be clear. The splits are essentially partitions of > the row id?
Yes, specified by the end of the range. > Can I add splits after the data is ingested? If so, how can I > redistribute? Yes. You can either add specific split points, or you can lower the split threshold based on the size of the table. For example, if the table size is S bytes, and you ideally want to have T tablets, then set the table's split threshold to S/T. These calculations are rarely exact, so I would start high on the split threshold, let it split out, see if the number of tablets is ok, then lower again if necessary. Billie > On Mon, Apr 16, 2012 at 2:45 PM, Eric Newton <[email protected]> > wrote: > > Create the table with splits, but this requires you to know > > something about > > the distribution of your data. > > > > -Eric > > > > > > On Mon, Apr 16, 2012 at 2:38 PM, David Medinets > > <[email protected]> > > wrote: > >> > >> Hopefully I am doing something wrong that can be easily rectified. > >> I > >> have an hadoop job that is sending well over 200M entries into > >> accumulo. But every entry is being sent to a single node. The table > >> was created by the hadoop job. > >> > >> How can I get the entries to be spread over several nodes? > > > >
