[ 
https://issues.apache.org/jira/browse/NETBEANS-6233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nicolas Richard closed NETBEANS-6233.
-------------------------------------
    Fix Version/s: 14
       Resolution: Fixed

I can not reproduce the bug with Netbeans 14. Closing this bug report.

> Netbeans console input has unknown encoding
> -------------------------------------------
>
>                 Key: NETBEANS-6233
>                 URL: https://issues.apache.org/jira/browse/NETBEANS-6233
>             Project: NetBeans
>          Issue Type: Bug
>    Affects Versions: 11.0, 12.5
>         Environment: I'm using Netbeans 12.5 on a Linux (Ubuntu 20.04) 
> platform
> Maven is Maven 3.6.3 (bundled) – I also tried with Maven 3.8.4
> and Java is Open JDK 11 – I also tried running with Open JDK 15.0.2 (by 
> adding that as a Java platform from within Netbeans)
>  
>            Reporter: Nicolas Richard
>            Priority: Major
>             Fix For: 14
>
>
> I'm trying to understand what charset is being used when typing into the 
> Netbeans console. I expected it to obey my current locale (which is UTF8 on 
> my system), but it obviously isn't doing that. I then set on trying to 
> understand what charset it is. And the answer seems to be : none. Here are my 
> findings :
> Using Netbeans 12.5 + Maven and the following code :
> {code:java}
>         final InputStream IN = System.in;
>         do {
>             System.out.println("Byte: " + IN.read());
>         } while (IN.available() > 0);
> {code}
> When I enter "€" in the Netbeans console, I get the following unexpected 
> output ("10" is just the newline char):
> {noformat}
> Byte: 172
> Byte: 10{noformat}
> It is unexpected because € is never encoded as 172 (0xac) alone. In UTF8 it 
> is three bytes (0xe2 0x82 0xac), and in UTF16 it is two (0x20 0xac)
> Similarly, entering 𐐷 (DESERET SMALL LETTER YEE), I get something unexpected:
> {noformat}
> Byte: 1
> Byte: 55
> Byte: 10{noformat}
> IOW these are 0x01 0x37. In UTF8, it should be 0xf0 0x90 0x90 0xb7 – in UTF16 
> it should be 0xd8 0x01 0xdc 0x37
> I'm on Linux, my locale (as reported by the command {_}locale{_}) is UTF8, 
> but these results look like the encoding is "half UTF16" : it's like UTF16 
> but every other byte is missing.
> If I run the same code within an Ant project or a Gradle project it works 
> fine for the symbol €, and the bytes reported are consistent with my UTF8 
> locale ;
> but if I enter 𐐷 (DESERET SMALL LETTER YEE), then :
>  * with the Gradle project, it outputs "Byte: -1" (no other bytes reported), 
> and
>  * with the Ant project, it outputs nothing and the program does not seem to 
> stop, I have to manually abort the Run.
>  



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@netbeans.apache.org
For additional commands, e-mail: commits-h...@netbeans.apache.org

For further information about the NetBeans mailing lists, visit:
https://cwiki.apache.org/confluence/display/NETBEANS/Mailing+lists

Reply via email to