Re: RFR: 8369564: Provide a MemorySegment API to read strings with known lengths [v7]

Liam Miller-Cushon Thu, 20 Nov 2025 01:00:47 -0800

On Wed, 19 Nov 2025 14:45:52 GMT, Maurizio Cimadamore <[email protected]> 
wrote:


>> src/java.base/share/classes/java/lang/String.java line 2030:
>> 
>>> 2028:     }
>>> 2029: 
>>> 2030:     void copyToSegmentRaw(MemorySegment segment, long offset, int 
>>> srcIndex, int numChars) {
>> 
>> This method takes an index, expressed in chars, and uses that as a byte 
>> offset in a bulk copy operation. I don't think this is correct. E.g. if the 
>> string is UTF16 (and not LATIN1), there is a scaling factor to be applied?
>
> In other words, it seems to me that here we have hardwired the knowledge that 
> we can only get here is the string is latin1. I don't think this was the 
> original intent of this method -- however, if that's the case, we should also 
> add an assertion to avoid misuse.

Thanks for catching this. For `copyToSegmentRaw`, I have updated the parameter 
names to not refer to chars.

I have also tentatively added an assertion to `copyToSegmentRaw` to only 
support latin1 strings, which could be relaxed if `bytesCompatible` is updated 
to handle UTF-16

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/28043#discussion_r2544854187

Re: RFR: 8369564: Provide a MemorySegment API to read strings with known lengths [v7]

Reply via email to