Re: About HBase Integration

2010-02-24 Thread xiao yang
Hi, Dogacan, I'm quite confused with the avro design nutchbase is using. The hbase schema is defined both in /org/apache/nutch/storage/NutchFields.java (http://github.com/dogacan/nutchbase/blob/master/src/java/org/apache/nutch/storage/NutchFields.java) and /webtable.json

Re: About HBase Integration

2010-02-09 Thread Andrzej Bialecki
On 2010-02-09 03:08, Hua Su wrote: Thanks. But heritrix is another project, right? Please see this Git repository, it contains the latest work in progress on Nutch+HBase: git://github.com/dogacan/nutchbase.git -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _

Re: About HBase Integration

2010-02-09 Thread Hua Su
Hi, I notice the repository has not been updated since last Christmas. Is that work still in progress? Best, Hua On Tue, Feb 9, 2010 at 4:23 PM, Andrzej Bialecki a...@getopt.org wrote: On 2010-02-09 03:08, Hua Su wrote: Thanks. But heritrix is another project, right? Please see this Git

About HBase Integration

2010-02-08 Thread Hua Su
Hi all, Any recent progress on HBase integration? There is a filed issue NUTCH-650http://issues.apache.org/jira/browse/NUTCH-650 . I really love the idea of using HBase as nutch storage backend. It not only simplifies nutch storage, but also makes much url/page processing work more efficient due

Re: About HBase Integration

2010-02-08 Thread Ryan Smith
FWIW, there is a plugin for heritrix to write to hbase as a back end store. Maybe it will help for making a nutch plugin? http://code.google.com/p/hbase-writer -Ryan On Mon, Feb 8, 2010 at 4:32 AM, Hua Su huas...@gmail.com wrote: Hi all, Any recent progress on HBase integration? There is a

Re: About HBase Integration

2010-02-08 Thread Hua Su
Thanks. But heritrix is another project, right? Is there any plan about nutch hbase? On Mon, Feb 8, 2010 at 5:45 PM, Ryan Smith ryan.justin.sm...@gmail.comwrote: FWIW, there is a plugin for heritrix to write to hbase as a back end store. Maybe it will help for making a nutch plugin?