hi

have you tried to change this property:

parser.character.encoding.default



>
> Hej,
>
> I am a newbie in Nutch and I need some help with a problem because I do
> not
> find clear documentation.
>
> In crawling proccess when the each of the FetcherThread get the content,
> this is in formatted in a way which deletes the new line characters ("\n")
> and transform useful characters in Spanish as á,é,í,ó,ú,ñ,ü in the
> default
> encoding like: �¡, �³, �­, �³, �º, �±,
> �¼.
>
> I would like to know if it is possible to set this default encoding (is
> UTF-8?) to the one that I need (ASCII I guess).
>
> Thanks in advance ;)
> --
> View this message in context:
> http://old.nabble.com/Encoding-the-content-got-from-Fetcher-tp26528468p26528468.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
>


Reply via email to