zeroshade commented on issue #37976:
URL: https://github.com/apache/arrow/issues/37976#issuecomment-1745331157

   So, for types like string and binary, it's generally pretty easy to figure 
out. In both cases you have your null bitmaps, for String/Binary you're going 
to have 32bit offsets (length + 1 values), LargeString/LargeBinary would have 
64bit offsets (again, length + 1 values). Then you can determine the actual 
character data by simply taking the difference between the offset value for the 
first row and the last row + 1 (which would be the total length of the 
character data for those rows).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to