> The top 10 voted issues are currently:
>
> NUTCH-61 Adaptive re-fetch interval. Detecting umodified content
>
Well ... I'm of a split mind on this. I can bring this patch up to date
and apply it before 0.9.0, if we understand that this is a "0" release
... ;) Otherwise I'd prefer to wait with it right after the release.
+1 for putting it in after 0.9.0
I would like also to proceed with NUTCH-339 (Fetcher2 patches + plus
some changes I made in the meantime), since I'd like to expose the new
fetcher to a broader audience, and it doesn't affect the existing
implementation.
+1 for putting it in before 0.9.0
NUTCH-48 "Did you mean" query enhancement/refignment feature
> NUTCH-251 Administration GUI
> NUTCH-289 CrawlDatum should store IP address
>
I'm still not entirely convinced about this - and there is already a
mechanism in place to support it if someone really wishes to keep this
particular info (CrawlDatum.metaData).
> NUTCH-36 Chinese in Nutch
> NUTCH-185 XMLParser is configurable xml parser
plugin. NUTCH-59 meta
> data support in webdb
> NUTCH-92 DistributedSearch incorrectly scores
results NUTCH-68
This is too intrusive to fix just before the release - and needs
additional discussion.
+1
NUTCH-68 A
> tool to generate arbitrary fetchlists
Easy to port this to 0.9.0 - I can do this.
cool.
I'll start working on the headers and stuff to get the blocking issue away.
--
Sami Siren