[ https://issues.apache.org/jira/browse/ARROW-4836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rok Mihevc updated ARROW-4836: ------------------------------ External issue URL: https://github.com/apache/arrow/issues/21350 > [Python] "Cannot tell() a compressed stream" when using > RecordBatchStreamWriter > ------------------------------------------------------------------------------- > > Key: ARROW-4836 > URL: https://issues.apache.org/jira/browse/ARROW-4836 > Project: Apache Arrow > Issue Type: Bug > Components: Python > Affects Versions: 0.12.1 > Reporter: Mike Pedersen > Assignee: Antoine Pitrou > Priority: Major > Labels: pull-request-available > Fix For: 0.15.0 > > Time Spent: 50m > Remaining Estimate: 0h > > It does not seem like RecordBatchStreamWriter works with compressed streams: > {code:python} > >>> import pyarrow as pa > >>> pa.__version__ > '0.12.1' > >>> stream = pa.output_stream('/tmp/a.gz') > >>> batch = pa.RecordBatch.from_arrays([pa.array([1])], ['a']) > >>> writer = pa.RecordBatchStreamWriter(stream, batch.schema) > >>> writer.write(batch) > Traceback (most recent call last): > File "<stdin>", line 1, in <module> > File "pyarrow/ipc.pxi", line 181, in pyarrow.lib._RecordBatchWriter.write > File "pyarrow/ipc.pxi", line 196, in > pyarrow.lib._RecordBatchWriter.write_batch > File "pyarrow/error.pxi", line 89, in pyarrow.lib.check_status > pyarrow.lib.ArrowNotImplementedError: Cannot tell() a compressed stream > {code} > As I understand the documentation, this should be possible, right? -- This message was sent by Atlassian Jira (v8.20.10#820010)