mapleFU commented on issue #5530:
URL: https://github.com/apache/arrow-rs/issues/5530#issuecomment-2051818298

   In parquet, `ARROW:SCHEMA` is used to identify the schema and extended info 
in parquet file. I think the trick point is that, without key-value metadata, 
both view and string are stored as same thing. So I think we can add 
optimization for "read string as stringview", but I also think maybe storing a 
stringview as string can make some legacy reader not confused about it.
   
   For some view, this might work for some encoding( like PLAIN and 
DELTA_LENGTH_BYTE_ARRAY), but might be tricky for DELTA_BYTE_ARRAY.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to