paulirwin opened a new issue, #1027:
URL: https://github.com/apache/lucenenet/issues/1027

   ### Is there an existing issue for this?
   
   - [X] I have searched the existing issues
   
   ### Task description
   
   Java's `StandardCharsets.UTF_8` does not write a Byte-Order Mark (BOM), 
while .NET's `System.Text.Encoding.UTF8` does include a BOM by default. We have 
ensured that the `IOUtils.CHARSET_UTF_8` does not include a BOM to match Java, 
and as part of #1018 we've added an internal Support class to allow for using 
`StandardCharsets.UTF_8`, but we need to review all usage of 
`System.Text.Encoding.UTF8` to determine if it should be replaced with 
`StandardCharsets.UTF_8` or `IOUtils.CHARSET_UTF_8` (whatever best matches the 
corresponding Java Lucene code) to avoid BOM issues.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@lucenenet.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to