Deploy filter on per table baiss

2014-09-09 Thread Jianshi Huang
Hi, According to the HBAse definitive guide, I need to change to change hbase-env.sh and put my jars in hbase's classpath, then I also need to restart hbase daemon to make my customized filters effective. In the Coprocessor loading section, it also mentioned that coprocessor can be setup and

Re: Deploy filter on per table baiss

2014-09-09 Thread Ted Yu
Please take a look at HBASE-1936 Cheers On Mon, Sep 8, 2014 at 11:26 PM, Jianshi Huang jianshi.hu...@gmail.com wrote: Hi, According to the HBAse definitive guide, I need to change to change hbase-env.sh and put my jars in hbase's classpath, then I also need to restart hbase daemon to make

Upadting a HBase KeyValue using bulk upload

2014-09-09 Thread Prakhar Srivastava
Hi, I have a MapReduce job which creates a StoreFile which I can load using LoadIncrementalFiles in HBase. I am also using the timestamp component of the KeyValue in my mapper to maintain version in an custom manner. But when I am trying to overwrite the same version using the bulk import, it is

Re: Deploy filter on per table baiss

2014-09-09 Thread Jianshi Huang
Thanks Ted! Jianshi On Tue, Sep 9, 2014 at 10:39 PM, Ted Yu yuzhih...@gmail.com wrote: Please take a look at HBASE-1936 Cheers On Mon, Sep 8, 2014 at 11:26 PM, Jianshi Huang jianshi.hu...@gmail.com wrote: Hi, According to the HBAse definitive guide, I need to change to change

Re: Deploy filter on per table baiss

2014-09-09 Thread Ted Yu
Kudo goes to Jimmy, not me. Cheers On Tue, Sep 9, 2014 at 8:17 AM, Jianshi Huang jianshi.hu...@gmail.com wrote: Thanks Ted! Jianshi On Tue, Sep 9, 2014 at 10:39 PM, Ted Yu yuzhih...@gmail.com wrote: Please take a look at HBASE-1936 Cheers On Mon, Sep 8, 2014 at 11:26 PM, Jianshi

HBase custom filter protocol buffers

2014-09-09 Thread Kevin
Hi, I'm making the switch from 0.92.1 to 0.98.1, and I'm in the process of updating all my custom filters to conform to the new HBase Filter API. I have quite a few custom filters, so my question is: Must I create a custom protocol buffer for each of my filters or I can reuse the custom logic

Re: HBase custom filter protocol buffers

2014-09-09 Thread Ted Yu
For each of your filters that carries custom information (limit, range, etc), you need to create corresponding protobuf entity. See hbase-protocol/src/main/protobuf/Filter.proto for examples. Cheers On Tue, Sep 9, 2014 at 12:55 PM, Kevin kevin.macksa...@gmail.com wrote: Hi, I'm making the

Re: Nested data structures examples for HBase

2014-09-09 Thread Michael Segel
You do realize that everything you store in Hbase are byte arrays, right? That is each cell is a blob. So you have the ability to create nested structures like… JSON records? ;-) So to your point. You can have a column A which represents a set of values. This is one reason why you shouldn’t

Re: HBase - Performance issue

2014-09-09 Thread Michael Segel
So you have large RS and you have large regions. Your regions are huge relative to your RS memory heap. (Not ideal.) You have slow drives (5400rpm) and you have 1GbE network. Do didn’t say how many drives per server. Under load, you will saturate your network with just 4 drives. (Give or

Re: One-table w/ multi-CF or multi-table w/ one-CF?

2014-09-09 Thread Michael Segel
Locality? Then the data should be in the same column family. That’s as local as you can get. I would suggest that you think of the following: What’s the predominant use case? How are you querying the data. If you’re always hitting multiple CFs to get the data… then you should have it in

Re: Nested data structures examples for HBase

2014-09-09 Thread Stephen Boesch
Thanks Michael, yes cells are byte[]; therefore, storing JSON or other document structures is always possible. Our use cases include querying individual elements in the structure - so that would require reconstituting the documents and then parsing them for every row. We probably are not headed

SKIP_FLUSH

2014-09-09 Thread Guangle Fan
Hi, anybody knows why I can't skip flush when taking snapshot ? snapshot 'aaa', 'aaa_snapshot', {SKIP_FLUSH = true} NameError: uninitialized constant SKIP_FLUSH without {SKIP_FLUSH = true}, the command works fine/ Regards, Guangle

Re: SKIP_FLUSH

2014-09-09 Thread Matteo Bertozzi
which version are you using? Matteo On Tue, Sep 9, 2014 at 5:34 PM, Guangle Fan fanguan...@gmail.com wrote: Hi, anybody knows why I can't skip flush when taking snapshot ? snapshot 'aaa', 'aaa_snapshot', {SKIP_FLUSH = true} NameError: uninitialized constant SKIP_FLUSH without

Re: SKIP_FLUSH

2014-09-09 Thread Ted Yu
Matteo is so fast :-) HBASE-10935 went into 0.98.4 FYI On Tue, Sep 9, 2014 at 5:35 PM, Matteo Bertozzi theo.berto...@gmail.com wrote: which version are you using? Matteo On Tue, Sep 9, 2014 at 5:34 PM, Guangle Fan fanguan...@gmail.com wrote: Hi, anybody knows why I can't skip flush

Re: SKIP_FLUSH

2014-09-09 Thread Guangle Fan
That explains. I'm on .96 On Tue, Sep 9, 2014 at 5:37 PM, Ted Yu yuzhih...@gmail.com wrote: Matteo is so fast :-) HBASE-10935 went into 0.98.4 FYI On Tue, Sep 9, 2014 at 5:35 PM, Matteo Bertozzi theo.berto...@gmail.com wrote: which version are you using? Matteo On Tue, Sep

Re: need help understand log output

2014-09-09 Thread Qiang Tian
out of curiosity, did you see below messages in RS log? LOG.warn(Snapshot called again without clearing previous. + Doing nothing. Another ongoing flush or did we fail last attempt?); thanks. On Tue, Sep 9, 2014 at 2:15 AM, Brian Jeltema brian.jelt...@digitalenvoy.net wrote:

Re: Nested data structures examples for HBase

2014-09-09 Thread Michael Segel
Are you just kicking the tires or do you want to roll up your sleeves and do some work? You have options. Secondary Indexes. I don’t mean an inverted table but things like SOLR, Lucene, Elastic search… The only downside is that depending on what you index, you can see an explosion in the