According to Konrad Bajor - Magellan Net:
> Hello,
> I've problem with my htDig.
> The engine htsearch change some characters (for example: ³ -> ³ )
> I don't know while htsearch change characters like � to ³
> Is it possible to set same variable in the system?

I assume you're running one of the 3.2 beta releases or snapshots,
rather than the 3.1.6 stable release.

The problem is that htsearch 3.2 converts all characters between 160 and
255 back to SGML entities for ISO-8859-1 characters, which is obviously
wrong when your documents are encoded in a different character set.
The fix will be to add a translate_latin1 attribute to disable these
translations in the htcommon/HtSGMLCodec.cc constructor.  For now, the
only quick fix is to modify this constructor not to do these translations.
The HtSGMLCodec class is new to 3.2, which is why 3.1.6 doesn't have
this problem.  3.1.6 doesn't translate the accented characters back to
SGML entities.  So, if you don't need the fancy new features of 3.2.0b4,
then I recommend sticking to the 3.1.6 stable release.  If you need 3.2,
then you need to hack the HtSGMLCodec constructor.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/
Dept. Physiology, U. of Manitoba  Winnipeg, MB  R3E 3J7  (Canada)


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to