On Fri, 23 Mar 2001, Gilles Detillieux wrote:
> According to Jonathan Gardner:
> > Okay, I put all the results in XML format with the templates and such, but
> > occasionally it picks up a bad character in the excerpt, and so a good XML
> > parser dies when it sees it.
> > 
> > How do you filter the results so that only "good" characters are stored?
> 
> What do you consider "good" vs. "bad" characters?  Remember that most of
> us have little or no XML experience, so you need to define your terms
> if we're to understand each other.  htsearch 3.1.5 converts <, >, &,
> '"' and non-breaking space to their SGML entities to prevent problems
> in the HTML output.  Are there others that should get similar treatment,
> or is this another matter altogether?
> 
I think XML doesn't like characters beyond 127 decimal. I am pretty sure I read
that somewhere, but I can't remember now where it is. Isn't ASCII only defined
for 0-127, and 128-255 is OS/Computer/Platform dependent?

-- 
Jonathan Gardner
[EMAIL PROTECTED]
(425)820-2244 x123


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to