Nutch is designed to be international, but it has not been tested extensively, so there may be some issues, e.g., with character sets, etc.
Cheers,
Doug
Stas wrote:
Hi.
Are there any known issues with indexing non-English pages? There only a question marks in the cached copies of the sites. The Lucene, as far as I know, can work with any language.
Thanks,
Stas Oskin.
------------------------------------------------------- This SF.Net email is sponsored by: IBM Linux Tutorials Free Linux tutorial presented by Daniel Robbins, President and CEO of GenToo technologies. Learn everything from fundamentals to system administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click _______________________________________________ Nutch-general mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-general
