Hi:
After I run the injectorJob (urls -crawlId my_id), there are 6 columns in
hbase.
1)column=f:fi
2)column=f:ts
3)column=mk:_injmrk_
4)column=mk:dist
5)column=mtdt:_csh_
6)column=s:s
Except the "column=mtdt:_csh_", other columns I have found where they are
created in the source code. Where is t
Thanks Prashant. The link is modified in [0]
[0] : https://wiki.apache.org/nutch/Nutch2Tutorial
On Sun, Jun 23, 2013 at 8:56 PM, Prashant Ladha wrote:
> I do know that this topic is closed but I found a better link so thought of
> sharing it.
> I found the Apache link [0] for downloading the hb
I do know that this topic is closed but I found a better link so thought of
sharing it.
I found the Apache link [0] for downloading the hbase 0.90.4 plus it has
previous versions as well.
I recommend of adding this link in the installation documentation itself.
[0] http://archive.apache.org/dist/h
Yes, i've done that. Thanks.
On Sun, Jun 23, 2013 at 9:53 AM, Sebastian Nagel wrote:
> Hi Joe,
>
> > Ideally, it should take higher priority than the default interval. This
> is
> > particularly important for sites such as cnn.com, whether the leaf page
> > doesn't really change, but the portal
Hi Joe,
> Ideally, it should take higher priority than the default interval. This is
> particularly important for sites such as cnn.com, whether the leaf page
> doesn't really change, but the portal page is updated all the time.
AdaptiveFetchSchedule does exactly this: if a page is found modified
Hi,
Now I have tried HBase 0.90.4 & 0.90.6 but still getting
"Exception in thread "main" java.lang.NoClassDefFoundError:
org/apache/hadoop/hbase/HBaseConfiguration"
I have also tried copying my HBase lib directory to nutch2.x/build/lib ,
but still getting this error.
My ivy.xml is
And I can see
6 matches
Mail list logo