Hi Dave,
On Tuesday, January 7, 2003 at 08:24:38 [GMT -0800], you wrote:

DC> My own preference is NOT to compress space. People vary in how they
DC> do spacing -- espcially end-of-sentence spacing. Users do,
DC> sometimes, accidentally add spaces they do not want, but it is a
DC> rather uncommon error.

DC> So, the change to the algorithm would simply be to preserve all
DC> spacing.

Sorry, but -- although this may be possible -- it would really lead to
some weird results. Just consider the following 2 paragraphs and tell me
(roughly) an algorithm to reformat them, preserving the spacing:

This is a rather long and pretty senseless sentence just to get a line longer than 80 
characters.
And this is a shorter sentence.

Now we have a second paragraph in a message about the volcanic activity at Mount St.
Helens, which goes a lot more into detail than this one. :-)


The problem is that a programme can't distinguish between the different
kinds of periods used in texts. That is why any algorithm can only
insert *one* word seperator after every period it encounters, because it
doesn't know anything about how the period is used there.


-- 
Regards,
Lars

The Bat! 1.63 Beta/2 on Windows XP 5.1 Build 2600


________________________________________________
Current version is 1.62 | "Using TBUDL" information:
http://www.silverstones.com/thebat/TBUDLInfo.html

Reply via email to