Re: svn commit: r405565 - in /lucene/nutch/trunk/src: java/org/apache/nutch/searcher/ test/org/apache/nutch/searcher/ web/jsp/

2006-05-11 Thread Dawid Weiss
The reason is that they should not use the same HTML code : 1. OpenSearch should only use b around highlights 2. search.jsp should use some more complicated HTML code (span ... ) Add 3. Clustering would benefit from a plain text version. D.

Re: svn commit: r405565 - in /lucene/nutch/trunk/src: java/org/apache/nutch/searcher/ test/org/apache/nutch/searcher/ web/jsp/

2006-05-11 Thread Jérôme Charron
Add 3. Clustering would benefit from a plain text version. Yes Dawid, but it is already committed = the clustering now uses the plain text version returned by the toString() method. Dawid, I have a question about clustering. Actually, the clustering uses the summaries as input. I assumes it

Interleaved (parallel) fetch cycles

2006-05-11 Thread Andrzej Bialecki
Hi, I'm planning to work on adding support in 0.8 for interleaved fetch cycles. What this means is that (within some limits) you can generate multiple fetchlists, fetch them at different times, and then update the crawldb not necessarily in the original sequence as they were generated. You

[jira] Commented: (NUTCH-267) Indexer doesn't consider linkdb when calculating boost value

2006-05-11 Thread Andrzej Bialecki (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-267?page=comments#action_12379072 ] Andrzej Bialecki commented on NUTCH-267: - Hmm, resetting the score to 0 is also dubious - it's as if we didn't want it to be re-crawled if we can't find any inlinks

Re: svn commit: r405565 - in /lucene/nutch/trunk/src: java/org/apache/nutch/searcher/ test/org/apache/nutch/searcher/ web/jsp/

2006-05-11 Thread Sami Siren
Jérôme Charron wrote: (but if the nutch-site.xml overrides the plugin.include property and doen't include it it will not be activated, like any other plugin) yes, that's what I ment, I quess that's the default case for people hacking plugins. -- Sami Siren

Re: [jira] Updated: (NUTCH-251) Administration GUI

2006-05-11 Thread Stefan Groschupf
Hi, the easiest way is to download one of the binary distributions. However as far I know the patches still work and need to be applied to both projects. Stefan Am 11.05.2006 um 08:38 schrieb TDLN: Hi Stephan. I am about to get started with the Admin GUI and was wondering if these

Re: [jira] Updated: (NUTCH-251) Administration GUI

2006-05-11 Thread TDLN
I have my local changes, so I can't use the binary distribution. Anyway, I will have a go at it and let you know. Rgrds, Thomas On 5/11/06, Stefan Groschupf [EMAIL PROTECTED] wrote: Hi, the easiest way is to download one of the binary distributions. However as far I know the patches still

[jira] Commented: (NUTCH-267) Indexer doesn't consider linkdb when calculating boost value

2006-05-11 Thread Doug Cutting (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-267?page=comments#action_12379116 ] Doug Cutting commented on NUTCH-267: re: it's as if we didn't want it to be re-crawled if we can't find any inlinks to it We prioritize crawling based on the number of

Re: svn commit: r405565 - in /lucene/nutch/trunk/src: java/org/apache/nutch/searcher/ test/org/apache/nutch/searcher/ web/jsp/

2006-05-11 Thread Jérôme Charron
(but if the nutch-site.xml overrides the plugin.include property and doen't include it it will not be activated, like any other plugin) yes, that's what I ment, I quess that's the default case for people hacking plugins. Oh, yes Sami, I understand what you mean... Sorry, I just forgot to

Re: [Nutch-dev] Re: svn commit: r405565 - in /lucene/nutch/trunk/src: java/org/apache/nutch/searcher/ test/org/apache/nutch/searcher/ web/jsp/

2006-05-11 Thread Jérôme Charron
Bob Carpenter of alias-i had this to say when I brought up this very idea: http://article.gmane.org/gmane.comp.jakarta.lucene.devel/12599 Thanks for you response Marvin. But finally my question is : shouldn't the nutch clustering uses some fixed size snippets instead of the configurable

new location! nutch user meeting San Francisco

2006-05-11 Thread Stefan Groschupf
Hi there, since there is such a big interest in the nutch user meeting, we decide to move to a other location. We will now meet: Rite-Spot Cafe (415) 552-6066 2099 Folsom St San Francisco, CA 94110 Its in a good location too for parking and its even reachable by public transport -- 2 blocks