> as you can see, parse-html is included.

 ok.

i did not made any changes in parse-plugins.xml
> i cannot even find it.

That's normal. It was introduced in nutch-0.8

In fact, there is some content-type management issues in 0.7.1
Mainly, the content-type returned by server was not cleaned and some
extra-parameters (like charset)
disturb the mapping to the good parser ...
(it is solved in 0.8)

Jérôme

--
http://motrech.free.fr/
http://www.frutch.org/

Reply via email to