Hi there,
great, I notice something is going on regarding heritrix and nutch integration.
http://crawler.archive.org/cgi-bin/wiki.pl?NutchSearchingArcs
I'm a bit confused by this page since it looks like the actually work is focused to make arcs (heritrix file format) search able but not to use heritrix as crawler.
Wouldn't make it sense to make to have a crawler extension point and have the nutch crawler and a heritrix crawler plugin?
I haven't that much experience with heritrix, but looks like this crawler is very powerful.
Can someone involved in the integration project give any status comment and tell us the long term goal of the integration?
Thank you very much!
Stefan
-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now. http://productguide.itmanagersjournal.com/
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers
