A typical way to refer to a particular Unicode character by code point hex value
is U+xxxx (with more x's if necessary). For example,
2602 * Returns a string whose value is this string, with all leading
2603 * and trailing space removed, where space is defined
2604 * as any character whose codepoint is less than or equal to
2605 * U+0020 (the space character).
It doesn't even need to be the code font.
Oh well, you pushed already. Maybe fix this up in your next change to
String.java.
s'marks
On 5/8/18 6:43 AM, Roger Riggs wrote:
Hi Jim,
I would agree about code points in methods that refer to code points and need a
more
precise notation. However, trim() is not one of them and the alternative 0x
format is quite acceptable.
Would the syntax for raw string literals (not there yet) make the source more
readable?
Roger
On 5/8/2018 9:36 AM, Jim Laskey wrote:
Roger,
You withdrew the comment from the CSR so I assumed that you had changed your
mind.
Stuart, Sherman and Joe have be pushing the use of codepoints versus char (or
ASCII) in new character related comments hence the choice of ‘\unnnn'
notation. Unfortunately, unicode preprocessing vs backslash processing vs
Javadoc does not allow the '\\u0020' in comments (it ends up being '\\u0020’
in the Javadoc) and '\u0020’ just ends up being ‘ ‘.
Cheers,
— Jim
On May 8, 2018, at 10:04 AM, Roger Riggs <roger.ri...@oracle.com> wrote:
Hi Jim,
The use of \u005c in the source makes the source code unreadable.
The more conventional use of the 0x prefix (i.e. 0x0130) is preferred.
Though \u is necessary in some cases, it should be avoided where a more
readable alternative is available.
Thanks, Roger
On 5/8/2018 8:19 AM, Jim Laskey wrote:
Comment change approved in CSR
webrev: http://cr.openjdk.java.net/~jlaskey/8200372/webrev/index.html
JBS: https://bugs.openjdk.java.net/browse/JDK-8200372
CSR: https://bugs.openjdk.java.net/browse/JDK-8196005