Yes, I tried in that configuration file setting with the latin encoding
Windows-1250, but the value of this property does not affect to the encoding
of the content (I also tried with unexistent encoding and the result is the
same...)
property
nameparser.character.encoding.default/name
Santiago Pérez wrote:
Yes, I tried in that configuration file setting with the latin encoding
Windows-1250, but the value of this property does not affect to the encoding
of the content (I also tried with unexistent encoding and the result is the
same...)
property
I had already tried with:
property
nameparser.character.encoding.default/name
valueUTF-8/value
descriptionThe character encoding to fall back to when no other
information
is available/description
/property
and System.out.println(content.toString());
is still the HTML code with the
hi
have you tried to change this property:
parser.character.encoding.default
Hej,
I am a newbie in Nutch and I need some help with a problem because I do
not
find clear documentation.
In crawling proccess when the each of the FetcherThread get the content,
this is in formatted in a