zeroshade commented on issue #37976: URL: https://github.com/apache/arrow/issues/37976#issuecomment-1745331157
So, for types like string and binary, it's generally pretty easy to figure out. In both cases you have your null bitmaps, for String/Binary you're going to have 32bit offsets (length + 1 values), LargeString/LargeBinary would have 64bit offsets (again, length + 1 values). Then you can determine the actual character data by simply taking the difference between the offset value for the first row and the last row + 1 (which would be the total length of the character data for those rows). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
