If you have a large table, then using hbase shell is probably not a good idea because it will take too long to run the job. Instead, you need to start a mapreduce job to do the processing for you.
Since pig already exists as a shell that can create mapreduce jobs (like sawzall), I think it is probably best to work on pig until/unless you find something in their approach you fundamentally disagree with. -Michael On 12/10/07 6:22 PM, "edward yoon" <[EMAIL PROTECTED]> wrote: Let's assume some data set of Hbase relations can be undergoing many changes by advance of human culture. We need better relation for these changes. Therefore, I think formal relational algebra operators will be good administrative tool in hbase shell. We can use relational algebra operators like a administrative tool. Also, it is helpful for temporary simulation situation. Let's assume the tables. 1. huge-webTable ( URL , title, content, image, language, metatag, color, ... , etc ). 2. huge-clickLogTable ( URL , userIP, search_Keyword, ... , etc) And we wnat to categorize web document by language and userIP factors. Then we can make the temporary table using relational algebra operators. π language, userIP (webTable ▷◁ row = row clickLogTable) Users can easily develop the numeric analysis application program for categorize document sets using new Relation table. Formal relational algebra operators can be most useful feature in Hbase. What do you think? ------------------------------ B. Regards, Edward yoon @ NHN, corp. Home : http://www.udanax.org _________________________________________________________________ Put your friends on the big screen with Windows Vista® + Windows Live™. http://www.microsoft.com/windows/shop/specialoffers.mspx?ocid=TXT_TAGLM_CPC_MediaCtr_bigscreen_102007
