Endpoint coprocessor performance issue

2016-10-20 Thread Arulanand
.679495.n3.nabble.com/Endpoint-coprocessor-performance-issue-tp4083614.html Sent from the HBase User mailing list archive at Nabble.com.

Re: Not able to Load EndPoint Coprocessor

2015-05-08 Thread Ted Yu
/TestAggregateProtocol.java Cheers On Thu, May 7, 2015 at 11:27 PM, Gupta, Kunal IN BLR STS < kunal.gu...@siemens.com> wrote: > Hi, > > I want to implement EndPoint Coprocessor. From the link ( > http://www.3pillarglobal.com/insights/hbase-coprocessors) I created my > own files as

Not able to Load EndPoint Coprocessor

2015-05-07 Thread Gupta, Kunal IN BLR STS
Hi, I want to implement EndPoint Coprocessor. From the link (http://www.3pillarglobal.com/insights/hbase-coprocessors) I created my own files as specified in the link. In the link they worked on Sum but I want to have Average along with Filters so I did according to what they have specified

RE: endpoint coprocessor

2014-04-15 Thread Bogala, Chandra Reddy
t; 'cf' Thanks, Chandra -Original Message- From: Ted Yu [mailto:yuzhih...@gmail.com] Sent: Thursday, April 10, 2014 5:36 PM To: user@hbase.apache.org Cc: user@hbase.apache.org Subject: Re: endpoint coprocessor Here is a reference implementation for aggregation : h

Re: endpoint coprocessor

2014-04-13 Thread anil gupta
an think of Endpoint Coprocessor as stored procedures in SQL DBs. > In order to use an Endpoint Coprocessor you need to build and install it on > the server side and then it invoke through HBase RPC. > > You can make use of > https://blogs.apache.org/hbase/entry/coprocessor_introduction

Re: endpoint coprocessor

2014-04-13 Thread Rabbit's Foot
You can think of Endpoint Coprocessor as stored procedures in SQL DBs. In order to use an Endpoint Coprocessor you need to build and install it on the server side and then it invoke through HBase RPC. You can make use of https://blogs.apache.org/hbase/entry/coprocessor_introduction. 2014-04-14

Re: endpoint coprocessor

2014-04-13 Thread Ted Yu
se classes and calling mechanism not available from hbase > shell. Then planning to use java client code to invoke coprocessor. > > Any pointers to java client to invoke aggregation< > http://search-hadoop.com/c/HBase:hbase-server/src/main/java/org/apache/hadoop/hbase/coprocessor/Aggr

RE: endpoint coprocessor

2014-04-11 Thread Bogala, Chandra Reddy
from hbase shell. Then planning to use java client code to invoke coprocessor. Any pointers to java client to invoke aggregation<http://search-hadoop.com/c/HBase:hbase-server/src/main/java/org/apache/hadoop/hbase/coprocessor/AggregateImplementation.java||Hbase+aggregation+endpoint> coprocessor will

Re: endpoint coprocessor

2014-04-11 Thread Ted Yu
12:10 PM > To: user@hbase.apache.org > Subject: Re: endpoint coprocessor > > Bear in mind each region will return its top n, then you will have to run > another top n in your client code. This introduce a numerical error : top > on top. > > On Thursday, April 10, 2014, Bogala,

RE: endpoint coprocessor

2014-04-11 Thread Bogala, Chandra Reddy
, April 11, 2014 12:10 PM To: user@hbase.apache.org Subject: Re: endpoint coprocessor Bear in mind each region will return its top n, then you will have to run another top n in your client code. This introduce a numerical error : top on top. On Thursday, April 10, 2014, Bogala, Chandra Reddy

Re: endpoint coprocessor

2014-04-10 Thread Asaf Mesika
Bear in mind each region will return its top n, then you will have to run another top n in your client code. This introduce a numerical error : top on top. On Thursday, April 10, 2014, Bogala, Chandra Reddy wrote: > Hi, > I am planning to write endpoint coprocessor to calculate TOP N r

Re: endpoint coprocessor

2014-04-10 Thread Ted Yu
ogala, Chandra Reddy" wrote: > Hi, > I am planning to write endpoint coprocessor to calculate TOP N results for my > usecase. I got confused with old apis and new apis. > I followed below links and try to implement. But looks like api's changed a > lot. I don't see many

endpoint coprocessor

2014-04-10 Thread Bogala, Chandra Reddy
Hi, I am planning to write endpoint coprocessor to calculate TOP N results for my usecase. I got confused with old apis and new apis. I followed below links and try to implement. But looks like api's changed a lot. I don't see many of these classes in hbase jars. We are using Hbase

Re: Endpoint Coprocessor Protocol and implementation versioning

2013-10-17 Thread Ted Yu
On Thu, Oct 17, 2013 at 12:18 PM, Asaf Mesika wrote: > Hi, > > I'm writing an Endpoint Coprocessor for HBase v0.94.6 (cdh4.3.1). > > I'm trying to understand how the CoprocessorProtocol and > CoprocessorEndPoint implementation versioning works, both backwards and > fo

Endpoint Coprocessor Protocol and implementation versioning

2013-10-17 Thread Asaf Mesika
Hi, I'm writing an Endpoint Coprocessor for HBase v0.94.6 (cdh4.3.1). I'm trying to understand how the CoprocessorProtocol and CoprocessorEndPoint implementation versioning works, both backwards and forward. For instance, I have my protocol as: public interface MyQueryProtoc

Re: endpoint coprocessor performance

2013-03-07 Thread Gary Helmling
> So should we close HBASE-5492 as a dup? > > Yes, that would make sense. Done.

Re: endpoint coprocessor performance

2013-03-07 Thread Andrew Purtell
> In any case, it looks like the same issue described in HBASE-6870, so if we can get that in, it should solve your problem. So should we close HBASE-5492 as a dup? On Fri, Mar 8, 2013 at 9:34 AM, Gary Helmling wrote: > > I profiled it and getStartKeysInRange is taking all the time. Recall I'm

Re: endpoint coprocessor performance

2013-03-07 Thread Gary Helmling
> I profiled it and getStartKeysInRange is taking all the time. Recall I'm > running 0.92.1. I think these factors are consistent with > https://issues.apache.org/jira/browse/HBASE-5492, which was fixed in > 0.92.3. > > We'll be upgrading soon, so I'll be able to verify the perf issue is gone. > U

Re: endpoint coprocessor performance

2013-03-07 Thread Andrew Purtell
with more testing and will > > report back. I'm definitely seeing significant load on .META. but want to > > see what I can determine about the root cause > > > > > > Sent from my Samsung smartphone on AT&T > > > > > > Original message --

Re: endpoint coprocessor performance

2013-03-07 Thread Kim Hamilton
t cause > > > Sent from my Samsung smartphone on AT&T > > > ---- Original message > Subject: RE: endpoint coprocessor performance > From: Anoop Sam John > To: "user@hbase.apache.org" > CC: > > > Yes agree with Andrew here... I checked

RE: endpoint coprocessor performance

2013-03-05 Thread Kimdhamilton
ng up time in your case? HBASE-6877 also I am seeing. -Anoop- From: Andrew Purtell [apurt...@apache.org] Sent: Wednesday, March 06, 2013 7:28 AM To: user@hbase.apache.org Subject: Re: endpoint coprocessor performance > In current logic, HTable#coprocesso

RE: endpoint coprocessor performance

2013-03-05 Thread Anoop Sam John
- From: Andrew Purtell [apurt...@apache.org] Sent: Wednesday, March 06, 2013 7:28 AM To: user@hbase.apache.org Subject: Re: endpoint coprocessor performance > In current logic, HTable#coprocessorExec always scan the whole table, its efficiency is low No, I don't think that is correct. In its

Re: endpoint coprocessor performance

2013-03-05 Thread Andrew Purtell
> In current logic, HTable#coprocessorExec always scan the whole table, its efficiency is low No, I don't think that is correct. In its current logic, coprocessorExec always scans the META table for all regions of the target table, to find the up to date locations, and then dispatches the exec in

Re: endpoint coprocessor performance

2013-03-05 Thread Kim Hamilton
Thanks so much! This describes exactly what I'm seeing. I did notice extremely heavy load on the region server carrying .META., as described in HBASE-6870: In current logic, HTable#coprocessorExec always scan the whole table, its efficiency is low and will affect the Regionserver carrying .META. u

Re: endpoint coprocessor performance

2013-03-04 Thread Stephen Boesch
great question from Kim and follow-up/answers. 2013/3/4 Gary Helmling > I see this is HBASE-6870. I thought that sounded familiar. > > > On Mon, Mar 4, 2013 at 6:23 PM, Gary Helmling wrote: > > > > > Check your logs for whether your end-point coprocessor is hitting > >> zookeeper on every inv

Re: endpoint coprocessor performance

2013-03-04 Thread Gary Helmling
I see this is HBASE-6870. I thought that sounded familiar. On Mon, Mar 4, 2013 at 6:23 PM, Gary Helmling wrote: > > Check your logs for whether your end-point coprocessor is hitting >> zookeeper on every invocation to figure out the region start key. >> Unfortunately (at least last time I chec

Re: endpoint coprocessor performance

2013-03-04 Thread Gary Helmling
> Check your logs for whether your end-point coprocessor is hitting > zookeeper on every invocation to figure out the region start key. > Unfortunately (at least last time I checked), the default way of invoking > an end point coprocessor doesn't use the meta cache. You can go through a > combinati

Re: endpoint coprocessor performance

2013-03-04 Thread Andrew Purtell
ct values. I ran performance tests >> with two distinct value implementations: one using endpoint coprocessors, >> and one using just scans (computing distinct values client side only). I >> noticed that the endpoint coprocessor implementation averaged 80 ms slower >> than

Re: endpoint coprocessor performance

2013-03-04 Thread James Taylor
to use coprocessors. One interesting scenario is computing distinct values. I ran performance tests with two distinct value implementations: one using endpoint coprocessors, and one using just scans (computing distinct values client side only). I noticed that the endpoint coprocessor implementation avera

Re: endpoint coprocessor performance

2013-03-04 Thread Andrew Purtell
de only). I > noticed that the endpoint coprocessor implementation averaged 80 ms slower > than the scan implementation. Details of that are below for anyone > interested. > > To drill into the performance, I instrumented the code and ultimately > deployed a no-op endpoint coprocessor, to l

Re: endpoint coprocessor performance

2013-03-04 Thread Gary Helmling
t values client side only). I > noticed that the endpoint coprocessor implementation averaged 80 ms slower > than the scan implementation. Details of that are below for anyone > interested. > > To drill into the performance, I instrumented the code and ultimately > deployed a no-o

Re: Loading Endpoint coprocessor from shell

2013-01-16 Thread Amit Sela
Forget about it, my bad :) On Wed, Jan 16, 2013 at 2:48 PM, Amit Sela wrote: > Hi all, > > It seems like I can't load Endpoint coprocessor from shell but I have no > problem loading RegionObserver from shell. > In both cases I pack a jar file, copy it to HDFS and l

Loading Endpoint coprocessor from shell

2013-01-16 Thread Amit Sela
Hi all, It seems like I can't load Endpoint coprocessor from shell but I have no problem loading RegionObserver from shell. In both cases I pack a jar file, copy it to HDFS and load from shell using table_att but only the RegionObserver is loaded (I can see it in the webapp). Is it suppos

Re: EndPoint Coprocessor could be dealocked?

2012-05-18 Thread Michael Segel
Fei DIng, I think you're making the solution harder than it should be. To start with, the only think you need to do is use co-processors to keep the indexes in sync with the underlying table. The code called from the co-processor will depend on the type of action and the type of index you ar

Re: EndPoint Coprocessor could be dealocked?

2012-05-17 Thread fding hbase
Hi Michel, On Fri, May 18, 2012 at 1:39 AM, Michael Segel wrote: > > You should not let just any user run coprocessors on the server. That's > madness. > > > > Best regards, > > > >- Andy > > Fei Ding, > > I'm a little confused. > Are you trying to solve the problem of querying data efficient

Re: EndPoint Coprocessor could be dealocked?

2012-05-17 Thread Michael Segel
> You should not let just any user run coprocessors on the server. That's > madness. > > Best regards, > >- Andy Fei Ding, I'm a little confused. Are you trying to solve the problem of querying data efficiently from a table, or are you trying to find an example of where and when to us

Re: EndPoint Coprocessor could be dealocked?

2012-05-16 Thread Andrew Purtell
On Wed, May 16, 2012 at 6:43 PM, fding hbase wrote: >> Not coprocessors in general. The client side support for Endpoints >> (Exec, etc.) gives the developer the fiction of addressing the cluster >> as a range of rows, and will parallelize per-region Endpoint >> invocations, and collect the respon

Re: EndPoint Coprocessor could be dealocked?

2012-05-16 Thread fding hbase
On Thu, May 17, 2012 at 6:28 AM, Andrew Purtell wrote: > Are HBase coprocessors explicitly wrong for this use case if the app > logic > needs to access multiple regions in a single call? Not coprocessors in general. The client side support for Endpoints > (Exec, etc.) gives the developer the

Re: EndPoint Coprocessor could be dealocked?

2012-05-16 Thread Andrew Purtell
On Wed, May 16, 2012 at 2:40 PM, Dave Revell wrote: > Many people will probably try to use coprocessors as a way of implementing > app logic on top of HBase without the headaches of writing a daemon. > Sometimes client-side approaches are inadvisable; for example, there may be > several client lan

Re: EndPoint Coprocessor could be dealocked?

2012-05-16 Thread Michael Segel
David, Its not a question of a daemon, its a question of the problem you are trying to solve. Using this as an example.. you are not always going to select data from a given table always using the same query. So you will not always want to use the index on column A and then the index on column

Re: EndPoint Coprocessor could be dealocked?

2012-05-16 Thread Dave Revell
Many people will probably try to use coprocessors as a way of implementing app logic on top of HBase without the headaches of writing a daemon. Sometimes client-side approaches are inadvisable; for example, there may be several client languages/runtimes and the app logic should not be reimplemented

Re: EndPoint Coprocessor could be dealocked?

2012-05-16 Thread Michael Segel
I think we need to look at the base problem that is trying to be solved. I mean the discussion on the RPC mechanism. but the problem that the OP is trying to solve is how to use multiple indexes in a 'query'. Note: I put ' ' around query because its a m/r job or a single thread where the us

Re: EndPoint Coprocessor could be dealocked?

2012-05-16 Thread Andrew Purtell
> On May 16, 2012, at 1:12 AM, fding hbase wrote: >> But sadly, HBase ipc doesn't allow coprocessor chaining mechanism... >> Someone mentioned on >> http://grokbase.com/t/hbase/user/116hrhhf8m/coprocessor-failure-question-and-examples >> : >> >> If a RegionObserver issues RPC to another table from

Re: EndPoint Coprocessor could be dealocked?

2012-05-16 Thread Michael Segel
ean you wanted to do multi-key indexes where the key is a >>>> multi-key part. >>>> >>>> Or did you mean that you really wanted to use multiple indexes at the >> same >>>> time on a single query? >>>> >>>> If its the latter, no

Re: EndPoint Coprocessor could be dealocked?

2012-05-15 Thread fding hbase
ort order? > >> > >> What happens when the results from the indexes exceed the amount of > >> allocated memory? > >> > >> What I am suggesting you to do is to set aside the underpinnings of > HBase > >> and look at the problem you are trying to sol

Re: EndPoint Coprocessor could be dealocked?

2012-05-15 Thread Michel Segel
; easy one... >> >> >> >> Sent from a remote device. Please excuse any typos... >> >> Mike Segel >> >> On May 14, 2012, at 4:35 AM, fding hbase wrote: >> >>> Hi all, >>> >>> Is it possible to use table scanner (diff

Re: EndPoint Coprocessor could be dealocked?

2012-05-14 Thread fding hbase
t; > > > Is it possible to use table scanner (different from the host table > region) > > or > > execute coprocessor of another table, in the endpoint coprocessor? > > It looks like chaining coprocessors. But I found a possible deadlock! > > Can anyone help me with this? &

Re: EndPoint Coprocessor could be dealocked?

2012-05-14 Thread Michel Segel
> Is it possible to use table scanner (different from the host table region) > or > execute coprocessor of another table, in the endpoint coprocessor? > It looks like chaining coprocessors. But I found a possible deadlock! > Can anyone help me with this? > > In my testing

EndPoint Coprocessor could be dealocked?

2012-05-14 Thread fding hbase
Hi all, Is it possible to use table scanner (different from the host table region) or execute coprocessor of another table, in the endpoint coprocessor? It looks like chaining coprocessors. But I found a possible deadlock! Can anyone help me with this? In my testing environment I deployed the