[ https://issues.apache.org/jira/browse/NETBEANS-6233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Nicolas Richard closed NETBEANS-6233. ------------------------------------- Fix Version/s: 14 Resolution: Fixed I can not reproduce the bug with Netbeans 14. Closing this bug report. > Netbeans console input has unknown encoding > ------------------------------------------- > > Key: NETBEANS-6233 > URL: https://issues.apache.org/jira/browse/NETBEANS-6233 > Project: NetBeans > Issue Type: Bug > Affects Versions: 11.0, 12.5 > Environment: I'm using Netbeans 12.5 on a Linux (Ubuntu 20.04) > platform > Maven is Maven 3.6.3 (bundled) – I also tried with Maven 3.8.4 > and Java is Open JDK 11 – I also tried running with Open JDK 15.0.2 (by > adding that as a Java platform from within Netbeans) > > Reporter: Nicolas Richard > Priority: Major > Fix For: 14 > > > I'm trying to understand what charset is being used when typing into the > Netbeans console. I expected it to obey my current locale (which is UTF8 on > my system), but it obviously isn't doing that. I then set on trying to > understand what charset it is. And the answer seems to be : none. Here are my > findings : > Using Netbeans 12.5 + Maven and the following code : > {code:java} > final InputStream IN = System.in; > do { > System.out.println("Byte: " + IN.read()); > } while (IN.available() > 0); > {code} > When I enter "€" in the Netbeans console, I get the following unexpected > output ("10" is just the newline char): > {noformat} > Byte: 172 > Byte: 10{noformat} > It is unexpected because € is never encoded as 172 (0xac) alone. In UTF8 it > is three bytes (0xe2 0x82 0xac), and in UTF16 it is two (0x20 0xac) > Similarly, entering 𐐷 (DESERET SMALL LETTER YEE), I get something unexpected: > {noformat} > Byte: 1 > Byte: 55 > Byte: 10{noformat} > IOW these are 0x01 0x37. In UTF8, it should be 0xf0 0x90 0x90 0xb7 – in UTF16 > it should be 0xd8 0x01 0xdc 0x37 > I'm on Linux, my locale (as reported by the command {_}locale{_}) is UTF8, > but these results look like the encoding is "half UTF16" : it's like UTF16 > but every other byte is missing. > If I run the same code within an Ant project or a Gradle project it works > fine for the symbol €, and the bytes reported are consistent with my UTF8 > locale ; > but if I enter 𐐷 (DESERET SMALL LETTER YEE), then : > * with the Gradle project, it outputs "Byte: -1" (no other bytes reported), > and > * with the Ant project, it outputs nothing and the program does not seem to > stop, I have to manually abort the Run. > -- This message was sent by Atlassian Jira (v8.20.7#820007) --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@netbeans.apache.org For additional commands, e-mail: commits-h...@netbeans.apache.org For further information about the NetBeans mailing lists, visit: https://cwiki.apache.org/confluence/display/NETBEANS/Mailing+lists