Dear Wiki user, You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.
The following page has been changed by stack: http://wiki.apache.org/hadoop/Hbase/NewFileFormat ------------------------------------------------------------------------------ * Always-on General bloomfilter. We know how many entries a file will have when we go to flush it so we can optimally size a bloomfilter. The small amount of memory a bloomfilter occupies will pay for itself many-fold in the seeks saved trying to figure is a file contains an asked for key. * Optimal random-access * Iterate over keys only, rather than mapfiles currenty key+values always. This'd be useful when trying to find closest. TFile and SequenceFile can do this (Its not exposed in !MapFile). + * Smart getClosest and getClosestAtOrBefore [https://issues.apache.org/jira/browse/HBASE-792 hbase-792] === Index === TODO, but the TFile block-based rather than !MapFile interval-based would seem better for us; indices then are of predicatable size; a seek to the index position will load at an amenable spot when blocks are compressed. === Nice-to-haves === - * Don't write out the family portion of column when writing keys. + * Don't write out the family portion of column when writing keys [https://issues.apache.org/jira/browse/HBASE-68 HBASE-68] === Excercise ===
