The reason is that they should not use the same HTML code :
1. OpenSearch should only use b around highlights
2. search.jsp should use some more complicated HTML code (span ... )
Add 3. Clustering would benefit from a plain text version.
D.
Add 3. Clustering would benefit from a plain text version.
Yes Dawid, but it is already committed = the clustering now uses the plain
text version returned by the toString() method.
Dawid, I have a question about clustering.
Actually, the clustering uses the summaries as input. I assumes it
Hi,
I'm planning to work on adding support in 0.8 for interleaved fetch cycles.
What this means is that (within some limits) you can generate multiple
fetchlists, fetch them at different times, and then update the crawldb
not necessarily in the original sequence as they were generated. You
[
http://issues.apache.org/jira/browse/NUTCH-267?page=comments#action_12379072 ]
Andrzej Bialecki commented on NUTCH-267:
-
Hmm, resetting the score to 0 is also dubious - it's as if we didn't want it to
be re-crawled if we can't find any inlinks
Jérôme Charron wrote:
(but if the nutch-site.xml overrides the plugin.include property and
doen't
include it it will not be activated, like any other plugin)
yes, that's what I ment, I quess that's the default case for people
hacking plugins.
--
Sami Siren
Hi,
the easiest way is to download one of the binary distributions.
However as far I know the patches still work and need to be applied
to both projects.
Stefan
Am 11.05.2006 um 08:38 schrieb TDLN:
Hi Stephan.
I am about to get started with the Admin GUI and was wondering if
these
I have my local changes, so I can't use the binary distribution.
Anyway, I will have a go at it and let you know.
Rgrds, Thomas
On 5/11/06, Stefan Groschupf [EMAIL PROTECTED] wrote:
Hi,
the easiest way is to download one of the binary distributions.
However as far I know the patches still
[
http://issues.apache.org/jira/browse/NUTCH-267?page=comments#action_12379116 ]
Doug Cutting commented on NUTCH-267:
re: it's as if we didn't want it to be re-crawled if we can't find any inlinks
to it
We prioritize crawling based on the number of
(but if the nutch-site.xml overrides the plugin.include property and
doen't
include it it will not be activated, like any other plugin)
yes, that's what I ment, I quess that's the default case for people
hacking plugins.
Oh, yes Sami, I understand what you mean...
Sorry, I just forgot to
Bob Carpenter of alias-i had this to say when I brought up this very
idea:
http://article.gmane.org/gmane.comp.jakarta.lucene.devel/12599
Thanks for you response Marvin.
But finally my question is : shouldn't the nutch clustering uses some
fixed size snippets instead of the configurable
Hi there,
since there is such a big interest in the nutch user meeting,
we decide to move to a other location.
We will now meet:
Rite-Spot Cafe
(415) 552-6066
2099 Folsom St
San Francisco, CA 94110
Its in a good location too for parking and its even reachable by
public transport -- 2 blocks
11 matches
Mail list logo