alamb commented on issue #5530: URL: https://github.com/apache/arrow-rs/issues/5530#issuecomment-2052194848
I think the current writer has the very nice property that RecordBatches round trp cleanly -- they can be written to parquet and then re-read and will be equal. Thus, yy personal preference is that the writer should (by default) write the metadata that matches the data it was passed (so in this case specify `StringView`) I think it would be valuable to add an option on the writer to write in "compatiblity" mode (and store string/binary data with StringArray metadata, for example) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
