Re: what's the roadmap of secondary index of hbase?

Lars George Mon, 28 Feb 2011 22:09:10 -0800

Hi,

What is the reason not to use JDs replication WLA tracking (using ZK)
and add another "scope" like attribute to the coldefs to define the
indexing? Just curious because when I read the questions on the Wiki
it mentions issues that JD I think had to solve already, being "here
did we left off?", "no aggressive reapplying edits" etc.


Lars

On Tue, Mar 1, 2011 at 6:24 AM, Dhruba Borthakur <dhr...@gmail.com> wrote:
> Hi Jonathan,
>
> Nice wiki. You mention that
>> 2. Generate a new, special kind of WALEdit for secondary table update
>
> is it possible to store this secondary-wal-edits as the contents on another
> hbase table(say indexTable)? The advantage is that then this can be
> implemented as a pure layer wrapping the regionserver (via co-processors and
> such). The disadvantage is that a non-remote log (for indexTable) might need
> to be updated. Is there a easy way to enhance hbase to co-locate regions on
> the same machine?
>
> thanks,
> dhruba
>
> On Mon, Feb 28, 2011 at 12:05 PM, Jonathan Gray <jg...@fb.com> wrote:
>
>> I've started a wiki page:
>> http://wiki.apache.org/hadoop/Hbase/SecondaryIndexing
>>
>> I gave a basic description of the idea I had and the open questions.
>>
>> Let's get all our thoughts in there.
>>
>> > -----Original Message-----
>> > From: saint....@gmail.com [mailto:saint....@gmail.com] On Behalf Of
>> Stack
>> > Sent: Friday, February 25, 2011 4:07 PM
>> > To: dev@hbase.apache.org
>> > Subject: Re: what's the roadmap of secondary index of hbase?
>> >
>> > The MegaStore paper,
>> > http://www.cidrdb.org/cidr2011/Papers/CIDR11_Paper32.pdf, in section
>> > 3.2.2, lists secondary indexing options MegaStore provides on top of
>> > BigTable.  For example, MS allows specifying secondary index on protobuf
>> > cell content or duplicating data into secondary index so you have the
>> data to
>> > hand to satisfy first query and only if the client wants more do you go
>> dig in
>> > the primary table.  It also talks about how secondary indices can be
>> described
>> > using their schema which might be of use.  Might be worth a gander.
>> >
>> > St.Ack
>> >
>> > On Fri, Feb 25, 2011 at 3:32 PM, Stack <st...@duboce.net> wrote:
>> > > On Fri, Feb 25, 2011 at 1:47 PM, Eugene Koontz <ekoo...@hiro-tan.org>
>> > wrote:
>> > >> I'm thinking that we could use a coprocessor that watches the
>> > >> Write-Ahead Log (using the WAL-edit operations
>> > >>  https://issues.apache.org/jira/browse/HBASE-3257 "Coprocessors:
>> > >> Extend server side integration API to include HLog operations"). This
>> > >> coprocessor would write these edits, perhaps filtering or
>> > >> transforming them, and enqueing the results in a global queue. A
>> > >> separate process would be responsible for pulling operations off the
>> > >> queue and using HBase client operations to do the insert into a
>> > >> secondary index table appropriate for that operation.
>> > >>    Perhaps we could use some of the work that the Lily people have
>> > >> done with HBase indexing (see
>> > >> http://www.lilyproject.org/lily/about/playground/hbaseindexes.html)
>> > >> in order to do the edit->hbase operation transformations and the
>> > >> secondary index table creation.
>> > >
>> > > This sounds good as first approach (including lily part).
>> > >
>> > > St.Ack
>> > >
>>
>
>
>
> --
> Connect to me at http://www.facebook.com/dhruba
>

Re: what's the roadmap of secondary index of hbase?

Reply via email to