Hi, Dogacan,
I'm quite confused with the avro design nutchbase is using. The hbase
schema is defined both in /org/apache/nutch/storage/NutchFields.java
(http://github.com/dogacan/nutchbase/blob/master/src/java/org/apache/nutch/storage/NutchFields.java)
and /webtable.json
On 2010-02-09 03:08, Hua Su wrote:
Thanks. But heritrix is another project, right?
Please see this Git repository, it contains the latest work in progress
on Nutch+HBase:
git://github.com/dogacan/nutchbase.git
--
Best regards,
Andrzej Bialecki
___. ___ ___ ___ _ _
Hi,
I notice the repository has not been updated since last Christmas. Is that
work still in progress?
Best,
Hua
On Tue, Feb 9, 2010 at 4:23 PM, Andrzej Bialecki a...@getopt.org wrote:
On 2010-02-09 03:08, Hua Su wrote:
Thanks. But heritrix is another project, right?
Please see this Git
Hi all,
Any recent progress on HBase integration? There is a filed issue
NUTCH-650http://issues.apache.org/jira/browse/NUTCH-650
.
I really love the idea of using HBase as nutch storage backend. It not only
simplifies nutch storage, but also makes much url/page processing work more
efficient due
FWIW, there is a plugin for heritrix to write to hbase as a back end store.
Maybe it will help for making a nutch plugin?
http://code.google.com/p/hbase-writer
-Ryan
On Mon, Feb 8, 2010 at 4:32 AM, Hua Su huas...@gmail.com wrote:
Hi all,
Any recent progress on HBase integration? There is a
Thanks. But heritrix is another project, right?
Is there any plan about nutch hbase?
On Mon, Feb 8, 2010 at 5:45 PM, Ryan Smith ryan.justin.sm...@gmail.comwrote:
FWIW, there is a plugin for heritrix to write to hbase as a back end store.
Maybe it will help for making a nutch plugin?