[ https://issues.apache.org/jira/browse/NUTCH-650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12743919#action_12743919 ]
Doğacan Güney commented on NUTCH-650: ------------------------------------- I just committed code to branch nutchbase. The scoring API did not turn out as clean as I expected but I decided to put in what I have. Also, I made some changes so that web UI also works. I am leaving this issue open because I will add documentation tomorrow. Meanwhile, To download: svn co http://svn.apache.org/repos/asf/lucene/nutch/branches/nutchbase Usage: After starting hbase 0.20 (checkout rev. 804408 from hbase branch 0.20), create a webtable with bin/nutch createtable webtable After that, usage is similar. bin/nutch inject webtable url_dir # inject urls for as many cycles as you want; bin/nutch generate webtable #-topN N works bin/nutch fetch webtable # -threads N works bin/nutch parse webtable bin/nutch updatetable webtable bin/nutch index <index> webtable or bin/nutch solrindex <solr url> webtable To use solr, use this schema file http://www.ceng.metu.edu.tr/~e1345172/schema.xml Again, a note of warning: This is extremely new code. I hope people will test and use it but there is no guarantee that it will work :) > Hbase Integration > ----------------- > > Key: NUTCH-650 > URL: https://issues.apache.org/jira/browse/NUTCH-650 > Project: Nutch > Issue Type: New Feature > Affects Versions: 1.0.0 > Reporter: Doğacan Güney > Assignee: Doğacan Güney > Fix For: 1.1 > > Attachments: hbase-integration_v1.patch, hbase_v2.patch, > malformedurl.patch, meta.patch, meta2.patch, nofollow-hbase.patch, > nutch-habase.patch, searching.diff, slash.patch > > > This issue will track nutch/hbase integration -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.