Jovann Kung created ARROW-1599:
----------------------------------

             Summary: PyArrow unable to read Parquet files with vector as column
                 Key: ARROW-1599
                 URL: https://issues.apache.org/jira/browse/ARROW-1599
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
    Affects Versions: 0.7.0
         Environment: Ubuntu
            Reporter: Jovann Kung
            Priority: Critical


Is PyArrow currently unable to read in Parquet files with a vector as a column? 
For example, the schema of such a file is below:

{{<pyarrow._parquet.ParquetSchema object at 0x7f2d42493c88>
mbc: FLOAT
deltae: FLOAT
labels: FLOAT
features.type: INT32 INT_8
features.size: INT32
features.indices.list.element: INT32
features.values.list.element: DOUBLE}}

Using either pq.read_table() or pq.ParquetDataset('/path/to/parquet').read() 
yields the following error: ArrowNotImplementedError: Currently only nesting 
with Lists is supported.

>From the error I assume that this may be implemented in further releases?




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to