When the webpage write the column "mtdt:_csh_" in the hbase?

2013-06-23 Thread RS
Hi: After I run the injectorJob (urls -crawlId my_id), there are 6 columns in hbase. 1)column=f:fi 2)column=f:ts 3)column=mk:_injmrk_ 4)column=mk:dist 5)column=mtdt:_csh_ 6)column=s:s Except the "column=mtdt:_csh_", other columns I have found where they are created in the source code. Where is t

Re: Nutch 2.x with HBase backend errors

2013-06-23 Thread Tejas Patil
Thanks Prashant. The link is modified in [0] [0] : https://wiki.apache.org/nutch/Nutch2Tutorial On Sun, Jun 23, 2013 at 8:56 PM, Prashant Ladha wrote: > I do know that this topic is closed but I found a better link so thought of > sharing it. > I found the Apache link [0] for downloading the hb

Re: Nutch 2.x with HBase backend errors

2013-06-23 Thread Prashant Ladha
I do know that this topic is closed but I found a better link so thought of sharing it. I found the Apache link [0] for downloading the hbase 0.90.4 plus it has previous versions as well. I recommend of adding this link in the installation documentation itself. [0] http://archive.apache.org/dist/h

Re: confusion over fetch schedule

2013-06-23 Thread Joe Zhang
Yes, i've done that. Thanks. On Sun, Jun 23, 2013 at 9:53 AM, Sebastian Nagel wrote: > Hi Joe, > > > Ideally, it should take higher priority than the default interval. This > is > > particularly important for sites such as cnn.com, whether the leaf page > > doesn't really change, but the portal

Re: confusion over fetch schedule

2013-06-23 Thread Sebastian Nagel
Hi Joe, > Ideally, it should take higher priority than the default interval. This is > particularly important for sites such as cnn.com, whether the leaf page > doesn't really change, but the portal page is updated all the time. AdaptiveFetchSchedule does exactly this: if a page is found modified

Re: Nutch 2.x with HBase backend errors

2013-06-23 Thread Tony Mullins
Hi, Now I have tried HBase 0.90.4 & 0.90.6 but still getting "Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/hadoop/hbase/HBaseConfiguration" I have also tried copying my HBase lib directory to nutch2.x/build/lib , but still getting this error. My ivy.xml is And I can see