If you have a large table, then using hbase shell is probably not a good idea 
because it will take too long to run the job.  Instead, you need to start a 
mapreduce job to do the processing for you.

Since pig already exists as a shell that can create mapreduce jobs (like 
sawzall), I think it is probably best to work on pig until/unless you find 
something in their approach you fundamentally disagree with.

-Michael

On 12/10/07 6:22 PM, "edward yoon" <[EMAIL PROTECTED]> wrote:



Let's assume some data set of Hbase relations can be undergoing many changes by 
advance of human culture.

We need better relation for these changes.
Therefore, I think formal relational algebra operators will be good 
administrative tool in hbase shell.
We can use relational algebra operators like a administrative tool.

Also, it is helpful for temporary simulation situation.
Let's assume the tables.

1. huge-webTable ( URL  , title, content, image, language, metatag, color, ... 
, etc ).
2. huge-clickLogTable ( URL , userIP, search_Keyword, ... , etc)

And we wnat to categorize web document by language and userIP factors.

Then we can make the temporary table using relational algebra operators.
π language, userIP (webTable ▷◁ row = row clickLogTable)

Users can easily develop the numeric analysis application program for 
categorize document sets using new Relation table.
Formal relational algebra operators can be most useful feature in Hbase.

What do you think?

------------------------------

B. Regards,

Edward yoon @ NHN, corp.
Home : http://www.udanax.org
_________________________________________________________________
Put your friends on the big screen with Windows Vista® + Windows Live™.
http://www.microsoft.com/windows/shop/specialoffers.mspx?ocid=TXT_TAGLM_CPC_MediaCtr_bigscreen_102007


Reply via email to