a5creel commented on issue #2468:
URL: https://github.com/apache/arrow/issues/2468#issuecomment-1234826588

   I've been working on this problem for a week now and am still getting this 
error. Tried rearranging the import order, as well as uninstalling numpy and 
pyarrow. 
   
   It worked on a previous machine and now fails on a new ec2 instance. 
   
   numpy 1.21.6
   
   pyarrow 9.0.0     
   
   ```
   
   myPacker_df = dd.read_parquet('s3://XXX/YYY/summary_i/', engine = "pyarrow") 
 
   ---------------------------------------------------------------------------
   ModuleNotFoundError                       Traceback (most recent call last)
   <ipython-input-2-9934e78b02e8> in <module>
   ----> 1 myPacker_df = 
dd.read_parquet('s3://yse-bioecon/meat_packers/summary_i/', engine = "pyarrow")
   
   ~/anaconda3/lib/python3.7/site-packages/dask/dataframe/io/parquet/core.py in 
read_parquet(path, columns, filters, categories, index, storage_options, 
engine, gather_statistics, split_row_groups, chunksize, **kwargs)
       208 
       209     if isinstance(engine, str):
   --> 210         engine = get_engine(engine)
       211 
       212     if hasattr(path, "name"):
   
   ~/anaconda3/lib/python3.7/site-packages/dask/dataframe/io/parquet/core.py in 
get_engine(engine)
       495     elif engine == "pyarrow" or engine == "arrow":
       496         pa = import_required("pyarrow", "`pyarrow` not installed")
   --> 497         from .arrow import ArrowEngine
       498 
       499         if LooseVersion(pa.__version__) < "0.13.1":
   
   ~/anaconda3/lib/python3.7/site-packages/dask/dataframe/io/parquet/arrow.py 
in <module>
         7 import pyarrow as pa
         8 import pyarrow.parquet as pq
   ----> 9 from pyarrow.compat import guid
        10 from ....utils import natural_sort_key, getargspec
        11 from ..utils import _get_pyarrow_dtypes, _meta_from_dtypes
   
   ModuleNotFoundError: No module named 'pyarrow.compat'
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to