Rafferty97 opened a new pull request, #9468: URL: https://github.com/apache/arrow-rs/pull/9468
Implement support for reading CSV files with charsets other than UTF-8, via an optional dependency on `encoding_rs` and a corresponding configuration option. # Which issue does this PR close? Closes #9465 # What changes are included in this PR? - Add optional dependency on `encoding_rs` - Add configuration option to CSV reader called "encoding" - When an encoding is set, input data is pre-processed before being handed to `csv-core` # Are these changes tested? I haven't had the opportunity to create tests yet, and will probably need some guidance on how to do so. I wanted to create a draft PR in the meantime so the community could sanity-check my approach. # Are there any user-facing changes? The public API is only changed when the new optional feature is enabled, and even then, it's just a new optional configuration parameter. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
