Re: Encoding the content got from Fetcher

2009-11-27 Thread Santiago Pérez
Yes, I tried in that configuration file setting with the latin encoding Windows-1250, but the value of this property does not affect to the encoding of the content (I also tried with unexistent encoding and the result is the same...) property nameparser.character.encoding.default/name

Re: Encoding the content got from Fetcher

2009-11-27 Thread Andrzej Bialecki
Santiago Pérez wrote: Yes, I tried in that configuration file setting with the latin encoding Windows-1250, but the value of this property does not affect to the encoding of the content (I also tried with unexistent encoding and the result is the same...) property

Re: Encoding the content got from Fetcher

2009-11-27 Thread Santiago Pérez
I had already tried with: property nameparser.character.encoding.default/name valueUTF-8/value descriptionThe character encoding to fall back to when no other information is available/description /property and System.out.println(content.toString()); is still the HTML code with the

Re: Encoding the content got from Fetcher

2009-11-26 Thread fadzi
hi have you tried to change this property: parser.character.encoding.default Hej, I am a newbie in Nutch and I need some help with a problem because I do not find clear documentation. In crawling proccess when the each of the FetcherThread get the content, this is in formatted in a