In cudf 0.18.0 (https://github.com/rapidsai/cudf/releases/tag/v0.18.0) you are looking at pyarrow 1.0.1 I believe ( https://github.com/rapidsai/cudf/blob/20778e5ddb7470845605acd7f879620eb25ff4ff/conda/environments/cudf_dev_cuda11.0.yml#L21 ) There are many API changes between 1.0.1 --> 6.0.0
On Thu, Mar 3, 2022 at 11:00 AM Cindy McMullen <cmcmul...@twitter.com> wrote: > Hi, Niranda - > > We generally use 'pip', since 'conda' is not configured in our env to > reach external repositories. We need 'pip' to install 'dgl' (Deep Graph > Library). > From pip: > > (base) jupyter@cmcmullen-usersims-gdl-cuda:~$ pip freeze | grep cudf > cudf==0.18.0a0+253.g53ed28e91c > cudf-kafka==0.18.0a0+253.g53ed28e91c > dask-cudf==0.18.0a0+253.g53ed28e91c > > -------- > From conda: > > conda list | grep cudf > cudf 0.18.0a210224 cuda_11.0_py37_g53ed28e91c_253 > rapidsai-nightly > cudf_kafka 0.18.0a210224 py37_g53ed28e91c_253 > rapidsai-nightly > dask-cudf 0.18.0a210224 py37_g53ed28e91c_253 > rapidsai-nightly > libcudf 0.18.0a210224 cuda11.0_g1544474166_254 > rapidsai-nightly > libcudf_kafka 0.18.0a210224 g1544474166_254 > rapidsai-nightly > ------- > > On Wed, Mar 2, 2022 at 8:10 PM Niranda Perera <niranda.per...@gmail.com> > wrote: > >> nvcc is the Nvidia c compiler. Can you try >> >> conda list | grep cudf >> >> On Wed, Mar 2, 2022, 19:55 Cindy McMullen <cmcmul...@twitter.com> wrote: >> >>> /usr/local/cuda/bin/nvcc --version >>> nvcc: NVIDIA (R) Cuda compiler driver >>> Copyright (c) 2005-2020 NVIDIA Corporation >>> Built on Thu_Jun_11_22:26:38_PDT_2020 >>> Cuda compilation tools, release 11.0, V11.0.194 >>> Build cuda_11.0_bu.TC445_37.28540450_0 >>> >>> >>> On Wed, Mar 2, 2022 at 5:54 PM Niranda Perera <niranda.per...@gmail.com> >>> wrote: >>> >>>> What's your cudf version? >>>> >>>> >>>> On Wed, Mar 2, 2022 at 7:44 PM Cindy McMullen <cmcmul...@twitter.com> >>>> wrote: >>>> >>>>> Same issue. >>>>> >>>>> >>>>> Installing collected packages: pyarrow >>>>> Attempting uninstall: pyarrow >>>>> Found existing installation: pyarrow 7.0.0 >>>>> Uninstalling pyarrow-7.0.0: >>>>> Successfully uninstalled pyarrow-7.0.0 >>>>> Successfully installed pyarrow-6.0.1 >>>>> >>>>> .... >>>>> >>>>> cudf/_lib/gpuarrow.pyx in init cudf._lib.gpuarrow() >>>>> AttributeError: module 'pyarrow.lib' has no attribute >>>>> '_CRecordBatchReader' >>>>> >>>>> >>>>> >>>>> On Wed, Mar 2, 2022 at 5:34 PM Niranda Perera < >>>>> niranda.per...@gmail.com> wrote: >>>>> >>>>>> Hi Cindy, >>>>>> I don't think cudf supports arrow 7.0.0. Even the master branch is >>>>>> going with 6.0.1 at the moment. >>>>>> >>>>>> https://github.com/rapidsai/cudf/blob/6bcfc104051f926f46467fc55c456a9b012fc4af/conda/environments/cudf_dev_cuda11.5.yml#L20 >>>>>> >>>>>> On Wed, Mar 2, 2022 at 7:21 PM Cindy McMullen <cmcmul...@twitter.com> >>>>>> wrote: >>>>>> >>>>>>> pyarrow-7.0.0 >>>>>>> >>>>>>> >>>>>>> /usr/local/cuda/bin/nvcc --version >>>>>>> nvcc: NVIDIA (R) Cuda compiler driver >>>>>>> Copyright (c) 2005-2020 NVIDIA Corporation >>>>>>> Built on Thu_Jun_11_22:26:38_PDT_2020 >>>>>>> Cuda compilation tools, release 11.0, V11.0.194 >>>>>>> Build cuda_11.0_bu.TC445_37.28540450_0 >>>>>>> >>>>>>> >>>>>>> >>>>>>> On Wed, Mar 2, 2022 at 9:09 AM Keith Kraus <keith.j.kr...@gmail.com> >>>>>>> wrote: >>>>>>> >>>>>>>> Hey Cindy, >>>>>>>> >>>>>>>> What versions of cuDF and PyArrow do you have installed? I've >>>>>>>> typically seen this pop up when there's a mismatched version. >>>>>>>> >>>>>>>> -Keith >>>>>>>> >>>>>>>> On Tue, Mar 1, 2022 at 8:28 PM Cindy McMullen < >>>>>>>> cmcmul...@twitter.com> wrote: >>>>>>>> >>>>>>>>> Hi - >>>>>>>>> >>>>>>>>> I'm trying to use DGL (Deep Graph Library) DGLDataset API with the >>>>>>>>> RAPIDS cuda DataFrame API. Am getting this error: >>>>>>>>> >>>>>>>>> module 'pyarrow.lib' has no attribute '_CRecordBatchReader' >>>>>>>>> >>>>>>>>> >>>>>>>>> Wonder if you see anything obvious in the stack trace that might help >>>>>>>>> me debug? >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> Here's the full stack trace: >>>>>>>>> >>>>>>>>> >>>>>>>>> <ipython-input-5-81f2b1833437> in __init__(self) 10 """ 11 >>>>>>>>> def __init__(self):---> 12 super(UserSimsSingleFileDataset, >>>>>>>>> self).__init__(name='UserSimsDataset', verbose=False) 13 14 >>>>>>>>> # One quirk of DGLDataset is that process() and __len__ (load, >>>>>>>>> save) are called immediately after super(), >>>>>>>>> /opt/conda/lib/python3.7/site-packages/dgl/data/dgl_dataset.py in >>>>>>>>> __init__(self, name, url, raw_dir, save_dir, hash_key, force_reload, >>>>>>>>> verbose) 91 self._save_dir = save_dir 92 ---> 93 >>>>>>>>> self._load() 94 95 def download(self): >>>>>>>>> /opt/conda/lib/python3.7/site-packages/dgl/data/dgl_dataset.py in >>>>>>>>> _load(self) 176 if not load_flag: 177 >>>>>>>>> self._download()--> 178 self.process() 179 >>>>>>>>> self.save() 180 if self.verbose: >>>>>>>>> <ipython-input-5-81f2b1833437> in process(self) 20 21 >>>>>>>>> import gcsfs---> 22 import cudf 23 self.rows_per_batch = >>>>>>>>> 10000 24 gs = gcsfs.GCSFileSystem() >>>>>>>>> /opt/conda/lib/python3.7/site-packages/cudf/__init__.py in <module> >>>>>>>>> 9 import rmm 10 ---> 11 from cudf import core, datasets, >>>>>>>>> testing 12 from cudf._version import get_versions 13 from >>>>>>>>> cudf.api.extensions import ( >>>>>>>>> /opt/conda/lib/python3.7/site-packages/cudf/core/__init__.py in >>>>>>>>> <module> 1 # Copyright (c) 2018-2020, NVIDIA CORPORATION. 2 >>>>>>>>> ----> 3 from cudf.core import buffer, column, column_accessor, common >>>>>>>>> 4 from cudf.core.buffer import Buffer 5 from >>>>>>>>> cudf.core.dataframe import DataFrame, from_pandas, merge >>>>>>>>> /opt/conda/lib/python3.7/site-packages/cudf/core/column/__init__.py >>>>>>>>> in <module> 1 # Copyright (c) 2020-2021, NVIDIA CORPORATION. >>>>>>>>> 2 ----> 3 from cudf.core.column.categorical import CategoricalColumn >>>>>>>>> 4 from cudf.core.column.column import ( 5 ColumnBase, >>>>>>>>> /opt/conda/lib/python3.7/site-packages/cudf/core/column/categorical.py >>>>>>>>> in <module> 20 21 import cudf---> 22 from cudf import _lib >>>>>>>>> as libcudf 23 from cudf._lib.scalar import as_device_scalar >>>>>>>>> 24 from cudf._lib.transform import bools_to_mask >>>>>>>>> /opt/conda/lib/python3.7/site-packages/cudf/_lib/__init__.py in >>>>>>>>> <module> 2 import numpy as np 3 ----> 4 from . import ( >>>>>>>>> 5 avro, 6 binaryop, >>>>>>>>> cudf/_lib/gpuarrow.pyx in init cudf._lib.gpuarrow() >>>>>>>>> AttributeError: module 'pyarrow.lib' has no attribute >>>>>>>>> '_CRecordBatchReader' >>>>>>>>> >>>>>>>>> >>>>>> >>>>>> -- >>>>>> Niranda Perera >>>>>> https://niranda.dev/ >>>>>> @n1r44 <https://twitter.com/N1R44> >>>>>> >>>>>> >>>> >>>> -- >>>> Niranda Perera >>>> https://niranda.dev/ >>>> @n1r44 <https://twitter.com/N1R44> >>>> >>>> -- Niranda Perera https://niranda.dev/ @n1r44 <https://twitter.com/N1R44>