[
https://issues.apache.org/jira/browse/ARROW-7251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16983531#comment-16983531
]
Antoine Pitrou commented on ARROW-7251:
---------------------------------------
The default is not None:
{code:python}
>>> from pyarrow import csv
>>>
>>>
>>> opts = csv.ReadOptions()
>>>
>>>
>>> opts.block_size
>>>
>>>
1048576
{code}
By changing this value you will control the size of chunks. But do note this
has an impact in performance, especially in parallel mode.
> [Python] Open CSVs with different encodings
> -------------------------------------------
>
> Key: ARROW-7251
> URL: https://issues.apache.org/jira/browse/ARROW-7251
> Project: Apache Arrow
> Issue Type: Wish
> Components: Python
> Reporter: Sascha Hofmann
> Priority: Major
>
> I would like to open an UTF-16 encoded CSVs (among others) without
> preprocessing in let's say Pandas. Is there maybe a way to do this already ?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)