Joseph Boyle <Boyle at siebel dot com> wrote:

> Newline problems are a good analogy. They still require bookkeeping of
> different formats and attention in any new coding and cause new bugs,
> even though the problem has been around for decades. Nobody is holding
> their breath for any of the platforms to change their newline
> convention to match the others or even update all their tools to deal
> with the differences - bare LF still doesn't work in Notepad.

Of the hundreds of little utility programs I've written over the past 10
years or so, one of the ones I still use most often is FIXCRLF, which
(as you might expect) converts files between different CR/LF
conventions.  I have to; most text files downloaded from the Internet
are LF, but most DOS/Windows tools demand CRLF.  It's a shame, but
hardly a surprise, that the industry could never standardize on one or
the other.

The invention of U+2028 LINE SEPARATOR was supposed to relieve us of all
this misery -- but ironically, the success of UTF-8 has probably killed
LS for good.  Not only do people now expect Unicode text files to be
backward-compatible with ASCII, which favors CR and/or LF instead of LS,
but the single character LS requires more bytes in UTF-8 than the two
characters CR and LF.

-Doug Ewell
 Fullerton, California


Reply via email to