I would recommend reading this... http://hbase.apache.org/book.html
... and per what JD already suggested downloading and trying HBase. On 12/13/11 12:40 PM, "shashwat shriparv" <[email protected]> wrote: >Till now i was trying to use nutch to crawl from http addresses and index >into solr, and i got requirement to crawl from hbase, means the data will >be stored in hbase and i need to crawl it and index into solr, up to now >as >i have researched internet dint find anything to crawl or collect data >from >hbase through nutch, so i like to know any thing available on apache which >connect to hbase and crawl data. that much i have tried. >so may get some information about crawling or fetching information from >hbase and index. > >Thanks >Shashwat > >On Tue, Dec 13, 2011 at 11:05 PM, Jean-Daniel Cryans ><[email protected]>wrote: > >> Do you have a more specific question? Have you tried anything yet? >> >> Thanks for helping us helping you, >> >> J-D >> >> On Tue, Dec 13, 2011 at 5:24 AM, shashwat shriparv >> <[email protected]> wrote: >> > We are putting data into HBase in specific format, since the data in >> > HBase will be very large hence we need to crawl data from HBase and >>index >> > it into Solr, so which is the tool available for this requirement, or >>how >> > to approach on this requirement. >> > >> > Regards >> > >> > Shashwat >> > > > >-- >Shashwat Shriparv >09900059620 >09663531241 > > > ><iframe src=" >http://rcm.amazon.com/e/cm?t=shriparv-20&o=1&p=48&l=ur1&category=kindlerot >ating&f=ifr" >width="728" height="90" scrolling="no" border="0" marginwidth="0" >style="border:none;" frameborder="0"></iframe>
