[ 
https://issues.apache.org/jira/browse/ARROW-18254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Antoine Pitrou updated ARROW-18254:
-----------------------------------
    Component/s: Python
                     (was: C++)

> [Python] ArrowInvalid: Expected to read 578488923 metadata bytes, but only 
> read 374478920
> -----------------------------------------------------------------------------------------
>
>                 Key: ARROW-18254
>                 URL: https://issues.apache.org/jira/browse/ARROW-18254
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>            Reporter: jack
>            Priority: Major
>
> The following is the piece of code I am trying to run but it fails with the 
> following error message
> The version of pyarrow is 5.0.0. How do I fix this?
>  
> {code:java}
> //ArrowInvalid                              Traceback (most recent call last)
> <ipython-input-38-9c279286c928> in <module>
>       1 f = '../data/wikidata-20220926-all-ichunk_0.json'
>       2 stream = pa.memory_map(f)
> ----> 3 opened_stream = pa.ipc.open_stream(stream)
>       4 table = 
> opened_stream.read_all()~/anaconda3/lib/python3.8/site-packages/pyarrow/ipc.py
>  in open_stream(source)
>     152     reader : RecordBatchStreamReader
>     153     """
> --> 154     return RecordBatchStreamReader(source)
>     155 
>     156 ~/anaconda3/lib/python3.8/site-packages/pyarrow/ipc.py in 
> __init__(self, source)
>      43 
>      44     def __init__(self, source):
> ---> 45         self._open(source)
>      46 
>      47 ~/anaconda3/lib/python3.8/site-packages/pyarrow/ipc.pxi in 
> pyarrow.lib._RecordBatchStreamReader._open()~/anaconda3/lib/python3.8/site-packages/pyarrow/error.pxi
>  in 
> pyarrow.lib.pyarrow_internal_check_status()~/anaconda3/lib/python3.8/site-packages/pyarrow/error.pxi
>  in pyarrow.lib.check_status()ArrowInvalid: Expected to read 578488923 
> metadata bytes, but only read 374478920{code}
>  
>  
> {code:java}
> f = '../data/wikidata-20220926-all-ichunk_0.json'
> stream = pa.memory_map(f)
> opened_stream = pa.ipc.open_stream(stream)
> table = opened_stream.read_all(){code}
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to