same thing when there is
charset=ISO-8859-2

2009/9/16 MilleBii <mille...@gmail.com>

> Not sure where to look for explanations:
>
> I have a problem with some Polish pages which I can not index properly on
> the specific polish characters such as :
> &#321;
>
> They are havin the following  charset=windows-1252
>
> Does the HTML parser convert them into their Unicode equivalent ....
>
> --
> -MilleBii-
>



-- 
-MilleBii-

Reply via email to