On Mon, 6 Jan 2003, Mike Holderness wrote:

> Looks as though I'm going to have to read the code to be 
> sure, but so far it seems from the FAQ and Attrs.html that 
> there may still be a (very minor) inconsistency. 
> 
> Some HTML 4.1 entities (∥, £...) will be included 
> in excerpts as singly characters and therefore won't display 
> in Opera or Mozilla. 

At the moment, the code can deal with some HTML entities, but as no one
has stepped forward to deal with some of the newer standards, esp. in
regards to entities, there are undoubtedly a great deal of problems like
this.

> Will others (“ etc) continue to be included in 
> excerpts as “ (& a m p ; l d q u o ;)?

Any HTML entity that is not part of the recognized list will show this
bug. If you have a suggestion as to which entities should be transformed
into the appropriate localized character set (i.e. accents) and which
should be ignored, please let us know or point us to an appropriate URL.

I (for one) didn't know that there even was an HTML 4.1 standard. Have a
URL?

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to