> > Hello, >> >> Recently confronted with another end of line mess (or wealth of >> expressions surrounding it) , I was wondering if any popular programming >> language (like Java?) would recognize U+2028 as an end of line. >> >> For instance, would a [BufferedReader] readLine() interpret U+2028 > > as an end of line ? > > > > Patrick Andries > >
BufferedReader only interprets line feed (U+000A), carriage return (U+000D) or carriage return followed immediately by a linefeed as signalling the end of a line (http://java.sun.com/j2se/1.3/docs/api/java/io/BufferedReader.html#readLine()). The text annotations to U+000A and U+000D in Unicode 3.0 do not refer to U+2028 and do not recommend the use of U+2028 as the preferred character for for text processing in this context. Does the UTC have a recommendation about using U+2028 in preference to the other characters? If not, the problem won't go away as far as I can see. -- Christian Cooke * http://www.fishamble.net

