svilupp commented on PR #399: URL: https://github.com/apache/arrow-julia/pull/399#issuecomment-1465238006
Two additional thoughts on InlineStrings: - We could add support for native serialization/deserialization of InlineStrings (but it wouldn't be readable in other languages / would look like Integers) - At the moment I find the smallest suitable type by searches the _whole_vector of offsets which is expensive (c. 1ms for 10M elements). We could try sampling only a subset, but we would have to introduce some safety checks if suddenly a bigger String shows up. I decided to pay the cost upfront and make sure no strings are larger. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
