Hi All,
I think I am just about finished my plugin (nutch 1.0) which adds extra metadata
to during parsing the problem I am having is it doesn't seem to be adding the
data to the system (via luke or readseg). I looked at in the wiki but it seems
to be for 0.9 and the syntax looks different.
david.stu...@progressivealliance.co.uk wrote:
Hi All,
I think I am just about finished my plugin (nutch 1.0) which adds extra
metadata to during parsing the problem I am having is it doesn't seem to
be adding the data to the system (via luke or readseg). I looked at in
the wiki but it
I thought I did but I thought before I did a bin/nutch index (or solrindex) it
would be stored somewhere it does seems to be getting to the doc.add bit which
makes me think the variable is empty
{code}
public void addIndexBackendOptions(Configuration conf) {
LOG.warn(+_+_You called me
Sorry I meant doesn't get to doc.add
David
On 24 Nov 2009, at 11:27, david.stu...@progressivealliance.co.uk david.stu...@progressivealliance.co.uk
wrote:
I thought I did but I thought before I did a bin/nutch index (or
solrindex) it would be stored somewhere it does seems to be getting
Sorry its suppose to say would be stored somewhere it DOESN'T seem to be
getting to the doc.add bit which
On 24 November 2009 at 12:27 david.stu...@progressivealliance.co.uk
david.stu...@progressivealliance.co.uk wrote:
I thought I did but I thought before I did a bin/nutch index (or solrindex)
Dear Wiki user,
You have subscribed to a wiki page or wiki category on Nutch Wiki for change
notification.
The OptimizingCrawls page has been changed by DennisKubes.
The comment on this change is: Page about optimizing crawling speed.
http://wiki.apache.org/nutch/OptimizingCrawls
Dear Wiki user,
You have subscribed to a wiki page or wiki category on Nutch Wiki for change
notification.
The FrontPage page has been changed by DennisKubes.
http://wiki.apache.org/nutch/FrontPage?action=diffrev1=122rev2=123
--
*
Hello everybody,
I don't know if it is a known issue, but it's been like that since at least
a couple of days so I figured I should tell someone. The root url for the
nutch wiki http://wiki.apache.org/nutch/ doesn't redirect to
http://wiki.apache.org/nutch/FrontPage ! It's annoying because that's
Add WebGraph classes to the bin/nutch script
Key: NUTCH-771
URL: https://issues.apache.org/jira/browse/NUTCH-771
Project: Nutch
Issue Type: Improvement
Affects Versions: 1.1
[
https://issues.apache.org/jira/browse/NUTCH-768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12782172#action_12782172
]
Dennis Kubes commented on NUTCH-768:
I have tested the upgrade with Hadoop 0.20. To
[
https://issues.apache.org/jira/browse/NUTCH-771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12782177#action_12782177
]
Andrzej Bialecki commented on NUTCH-771:
-
+1 to adding these to the script. The
[
https://issues.apache.org/jira/browse/NUTCH-768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12782179#action_12782179
]
Andrzej Bialecki commented on NUTCH-768:
-
Are there any source code changes
12 matches
Mail list logo