Rafferty97 opened a new pull request, #9468:
URL: https://github.com/apache/arrow-rs/pull/9468

   Implement support for reading CSV files with charsets other than UTF-8, via 
an optional dependency on `encoding_rs` and a corresponding configuration 
option.
   
   # Which issue does this PR close?
   
   Closes #9465
   
   # What changes are included in this PR?
   
   - Add optional dependency on `encoding_rs`
   - Add configuration option to CSV reader called "encoding"
   - When an encoding is set, input data is pre-processed before being handed 
to `csv-core`
   
   # Are these changes tested?
   
   I haven't had the opportunity to create tests yet, and will probably need 
some guidance on how to do so. I wanted to create a draft PR in the meantime so 
the community could sanity-check my approach.
   
   # Are there any user-facing changes?
   
   The public API is only changed when the new optional feature is enabled, and 
even then, it's just a new optional configuration parameter.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to