Hey Andrew
Do we have plans to include setMaxFileSize for the shell,thrift,rest?

So non java users can change this as needed with out having to learn java.

Billy

"Andrew Purtell" <[EMAIL PROTECTED]> wrote in message news:[EMAIL PROTECTED]
Hello David,

Current trunk (upcoming 0.2.0) has support for per-table metadata. See https://issues.apache.org/jira/browse/HBASE-42 and https://issues.apache.org/jira/browse/HBASE-62.

So maybe you can set the split threshold quite low for the table in question?

The default is 256MB (268435456), set globally for all tables in the HBase configuration as "hbase.hregion.max.filesize". However it's reasonable to set it as low as the DFS blocksize. The guidance for a typical HBase installation is to set the DFS blocksize to 8MB (8388608), instead of the default 64MB.

At create time:

 HTableDescriptor htd = new HTableDescriptor("foo");
 htd.setMaxFileSize(8388608);
 ...
 HBaseAdmin admin = new HBaseAdmin(hconf);
 admin.createTable(htd);

If the table already exists:

 HTable table = new HTable(hconf, "foo");
 admin.disableTable("foo");
 // make a read-write descriptor
 HTableDescriptor htd =
   new HTableDescriptor(table.getTableDescriptor());
 htd.setMaxFileSize(83388608);
 admin.modifyTableMeta("foo", htd);
 admin.enableTable("foo");

Hope this helps,

  - Andy

From: David Alves <[EMAIL PROTECTED]>
Subject: Region Splits
To: "[email protected]" <[email protected]>
Date: Thursday, July 31, 2008, 6:06 AM
[...]
I use hbase (amongst other things) to crawl some repos of infomation
and util now I've been using the Nutch segment generation paradigm.
I would very much like to skip the segment generation step using
hbase as source and sink directly but in order to do that I would
need to either allow more that one split to be generated for a
single region or make the regions in this particular table split
with much less entries than other tables.
[...]







Reply via email to