On Wed, 19 Nov 2003, Gilles Detillieux wrote:
> According to Neal Richter:
> > >   This error is happening in the DISPLAY of the excerpts... so it
> > > seems like looking for &#XXX; patterns and NOT encoding them before
> > > display is a reasonable strategy... the browser will decide how to display it.
>
> That would be a reasonable compromise, but note that it is a compromise.
> For example, if an HTML document has something like "use ™ in
> your HTML to encode a ™ character", this will end up in db.excerpts
> as "use ™ in your HTML to encode a ™ character".  At that point,
> htsearch has no way of knowing that the first occurrence was originally
> different than the second.  It comes down to a decision between encoding
> both or leaving both as-is.

  Eh... why not explicitly look for patterns like '™' and leave
them as-is?

  For the pupose of excerpts.... I think we may not need to do encoding at
all... so that there is no conflict between store and display.

  Encoding SGML entities is beneficial for searchability via the
db.words.db,  but I don't see how it is a benefit for db.excerpts.

  I don't want to go tearing up code that is there for a reason.... please
elaborate.

> The other option would be for htdig to replace the & lead-in character
> for undecoded entities into some other, non-ambiguous lead-in character in
> the database, so that htsearch could always distinguish between the two.
> But what character could we use, that wouldn't conflict with anything
> else?

  For that matter we could be storing excerpts marked up via XML and
process this XML as appropriate during display.

  A bigger project would be to make the entire search-query process
produce an XML document that we could render to HTML via XSLT.  This would
allow pretty magnificent user customization of the search results.

Neal Richter
Knowledgebase Developer
RightNow Technologies, Inc.
Customer Service for Every Web Site
Office: 406-522-1485




-------------------------------------------------------
This SF.net email is sponsored by: SF.net Giveback Program.
Does SourceForge.net help you be more productive?  Does it
help you create better code?  SHARE THE LOVE, and help us help
YOU!  Click Here: http://sourceforge.net/donate/
_______________________________________________
ht://Dig Developer mailing list:
[EMAIL PROTECTED]
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to