Re: feasible smart quote solution

Andrew Dunbar Wed, 14 Aug 2002 21:03:59 -0700

 --- Tomas Frydrych <[EMAIL PROTECTED]>
wrote: > 
> > The solution sounds feasible for actually
> performing the character
> > remapping to screen. A larger and probably more
> interesting problem
> > would be how to identify these smart-glyphs and
> differentiate between
> > these and normal ones (the " inch marker,
> mismatched start/end quote
> > pairs, ...
> > 
> > Do you have any idea of how to feasibly do this?
> Might the unicode
> > website help us?
> 
> There is no way to really differentiate between the
> two kinds of 
> glyphs, in terms of the information presented by the
> glyph codes 
> that problem is not deterministic, i.e., you cannot
> tell if  "21" is a 21 
> in quotation marks, or 21 inches openning a quote,
> or is it?" -- you 
> can only tell that when you know not just the wider
> context but also 
> _meaning_ of the text in that context. Matching
> start and end quotes 
> is equally impossible for the very same reason, as
> long as you 
> cannot tell whether quote is really a quote, you
> cannot tell which 
> quotes should be paired.


This is why I call them "amgiuous quotes".

> Under the circumstances I see only two realistic
> options; (1) no 
> smart quotes at all. This is pretty much the case at
> the moment, and 
> I find it rather unsatisfactory. (2) Assume that all
> straight quotes are 
> quotes or mid-word apostorphies and let the user
> handle the rest 
> manually. If he does not like it, let him turn it
> off.

The main reason we have smart quotes turned off right
now is not because we weren't getting the context
right or anything like that.  The problem was that we
have smart quote code sprinkled all over the place
trying to do the same kind of guessing under various
conditions.  In some cases this proved so difficult
that we had crashes or endless loops.  Undo was the
main problem.  Various attempts were made to fix it
but it was so hairy it kept cropping up.

So what we need is a single point of failure.  We
need a "smart quote engine" which the various bits of
code can call instead of trying to figure it out for
themselves.
We need this engine to be really really simple but to
be language-dependent.
We need to call this engine from as few places as
possible.

> I think as soon as you start striving for something
> "better" than this, 
> i.e., you start assuming that you can work out when
> the quote is not 
> a quote (which you cannot), you will end up with
> behaviour which is 
> less predictable, and consquently more irritating
> for the user.

I agree.  The engine should know what the previous
word
or punctuation is, what word it's in the middle of,
and
what what word or punctuation comes next.  If that's
enough too bad.  At least it won't crash.

Andrew.

> Anyway, that's what I think
> 
> Tomas 

=====
http://linguaphile.sourceforge.net/cgi-bin/translator.pl http://www.abisource.com

__________________________________________________
Do You Yahoo!?
Everything you'll ever need on one web page
from News and Sport to Email and Music Charts
http://uk.my.yahoo.com

Re: feasible smart quote solution

Reply via email to