anemohan opened a new issue, #39172:
URL: https://github.com/apache/arrow/issues/39172

   ### Describe the usage question you have. Please include as many useful 
details as  possible.
   
   
   Hello, I use a schema file defining all the column types. I have a column 
with a list of dicts and when I try to call the dataset with this schema I get 
an error stating:
   
   ```
   The original error is below:
   ------------------------
   ArrowNotImplementedError('Unsupported cast from list<item: struct<count: 
int64, gender: string, name: string, probability: double>> to utf8 using 
function cast_string')
   
   Traceback:
   ---------
     File "/opt/conda/lib/python3.10/site-packages/dask/dataframe/utils.py", 
line 193, in raise_on_meta_error
       yield
     File "/opt/conda/lib/python3.10/site-packages/dask/dataframe/core.py", 
line 6897, in _emulate
       return func(*_extract_meta(args, True), **_extract_meta(kwargs, True))
     File 
"/opt/conda/lib/python3.10/site-packages/intakewrapper/MHParquetSource.py", 
line 67, in __call__
       df = pd.read_parquet(
     File "/opt/conda/lib/python3.10/site-packages/pandas/io/parquet.py", line 
509, in read_parquet
       return impl.read(
     File "/opt/conda/lib/python3.10/site-packages/pandas/io/parquet.py", line 
227, in read
       pa_table = self.api.parquet.read_table(
     File 
"/opt/conda/lib/python3.10/site-packages/pyarrow/parquet/__init__.py", line 
2780, in read_table
       return dataset.read(columns=columns, use_threads=use_threads,
     File 
"/opt/conda/lib/python3.10/site-packages/pyarrow/parquet/__init__.py", line 
2443, in read
       table = self._dataset.to_table(
     File "pyarrow/_dataset.pyx", line 304, in pyarrow._dataset.Dataset.to_table
     File "pyarrow/_dataset.pyx", line 2549, in 
pyarrow._dataset.Scanner.to_table
     File "pyarrow/error.pxi", line 144, in 
pyarrow.lib.pyarrow_internal_check_status
     File "pyarrow/error.pxi", line 121, in pyarrow.lib.check_status
   ```
   
   Can someone help me with this?
   
   ### Component(s)
   
   Python


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to