[
https://issues.apache.org/jira/browse/ARROW-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Antoine Pitrou updated ARROW-18254:
-----------------------------------
Component/s: Python
(was: C++)
> [Python] ArrowInvalid: Expected to read 578488923 metadata bytes, but only
> read 374478920
> -----------------------------------------------------------------------------------------
>
> Key: ARROW-18254
> URL: https://issues.apache.org/jira/browse/ARROW-18254
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Reporter: jack
> Priority: Major
>
> The following is the piece of code I am trying to run but it fails with the
> following error message
> The version of pyarrow is 5.0.0. How do I fix this?
>
> {code:java}
> //ArrowInvalid Traceback (most recent call last)
> <ipython-input-38-9c279286c928> in <module>
> 1 f = '../data/wikidata-20220926-all-ichunk_0.json'
> 2 stream = pa.memory_map(f)
> ----> 3 opened_stream = pa.ipc.open_stream(stream)
> 4 table =
> opened_stream.read_all()~/anaconda3/lib/python3.8/site-packages/pyarrow/ipc.py
> in open_stream(source)
> 152 reader : RecordBatchStreamReader
> 153 """
> --> 154 return RecordBatchStreamReader(source)
> 155
> 156 ~/anaconda3/lib/python3.8/site-packages/pyarrow/ipc.py in
> __init__(self, source)
> 43
> 44 def __init__(self, source):
> ---> 45 self._open(source)
> 46
> 47 ~/anaconda3/lib/python3.8/site-packages/pyarrow/ipc.pxi in
> pyarrow.lib._RecordBatchStreamReader._open()~/anaconda3/lib/python3.8/site-packages/pyarrow/error.pxi
> in
> pyarrow.lib.pyarrow_internal_check_status()~/anaconda3/lib/python3.8/site-packages/pyarrow/error.pxi
> in pyarrow.lib.check_status()ArrowInvalid: Expected to read 578488923
> metadata bytes, but only read 374478920{code}
>
>
> {code:java}
> f = '../data/wikidata-20220926-all-ichunk_0.json'
> stream = pa.memory_map(f)
> opened_stream = pa.ipc.open_stream(stream)
> table = opened_stream.read_all(){code}
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)