Stas,

Nutch is designed to be international, but it has not been tested extensively, so there may be some issues, e.g., with character sets, etc.

Cheers,

Doug

Stas wrote:
Hi.
Are there any known issues with indexing non-English pages? There only a question marks in the cached copies of the sites. The Lucene, as far as I know, can work with any language.
Thanks,
Stas Oskin.


-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
Nutch-general mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to