[
https://issues.apache.org/jira/browse/ARROW-4823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17661845#comment-17661845
]
Rok Mihevc commented on ARROW-4823:
-----------------------------------
This issue has been migrated to [issue
#21340|https://github.com/apache/arrow/issues/21340] on GitHub. Please see the
[migration documentation|https://github.com/apache/arrow/issues/14542] for
further details.
> [Python] read_csv shouldn't close file handles it doesn't own
> -------------------------------------------------------------
>
> Key: ARROW-4823
> URL: https://issues.apache.org/jira/browse/ARROW-4823
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 0.12.1
> Reporter: Dave Hirschfeld
> Assignee: Wes McKinney
> Priority: Minor
> Labels: csv, pull-request-available
> Fix For: 0.14.0
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> If a file-handle is passed into `read_csv` it is automatically closed:
>
> {{In [47]: csv =
> io.BytesIO(b'''issue_date_utc,variable_name,station_name,station_id,value_date_utc,value}}
> {{ ...: 2019-02-26 22:00:00,TEMPERATURE,ARCHERFIELD,040211,2019-02-27
> 03:00,29.1}}
> {{ ...: ''')}}
> {{In [48]: pa.csv.read_csv(csv, convert_options=opts)}}
> {{Out[48]: }}
> {{pyarrow.Table}}
> {{issue_date_utc: timestamp[ns]}}
> {{variable_name: string}}
> {{station_name: string}}
> {{station_id: int64}}
> {{value_date_utc: string}}
> {{value: double}}
> {{In [49]: csv.seek(0)}}
> {{Traceback (most recent call last):}}
> {{ File "<ipython-input-50-0644e6e50712>", line 1, in <module>}}
> {{ csv.seek(0)}}
> {{ValueError: I/O operation on closed file.}}
>
> This behaviour is in contrast to pandas which leaves the file handle open.
> Since the function didn't create the file handle I don't think it should
> close it.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)