[
https://issues.apache.org/jira/browse/ARROW-7251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17093465#comment-17093465
]
Antoine Pitrou edited comment on ARROW-7251 at 4/27/20, 12:45 PM:
------------------------------------------------------------------
cc [~saschahofmann] Is there anything that prevents you from recoding the CSV
file before opening it with Arrow?
(what are your constraints? performance? file size?)
With some care, you could even implement a file-like object in Python that
recodes data to UTF-8 on the fly. It should be accepted by {{csv.read_csv}}.
was (Author: pitrou):
cc [~saschahofmann] Is there anything that prevents you from recoding the CSV
file before opening it with Arrow?
(what are you constraints? performance? file size?)
With some care, you could even implement a file-like object in Python that
recodes data to UTF-8 on the fly. It should be accepted by {{csv.read_csv}}.
> [Python] Open CSVs with different encodings
> -------------------------------------------
>
> Key: ARROW-7251
> URL: https://issues.apache.org/jira/browse/ARROW-7251
> Project: Apache Arrow
> Issue Type: Wish
> Components: Python
> Reporter: Sascha Hofmann
> Priority: Major
>
> I would like to open an UTF-16 encoded CSVs (among others) without
> preprocessing in let's say Pandas. Is there maybe a way to do this already ?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)