[
https://issues.apache.org/jira/browse/ARROW-1398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16136902#comment-16136902
]
Wes McKinney commented on ARROW-1398:
-------------------------------------
hi [~LudwikB] indeed support for reading these in parquet-cpp has not been
implemented yet. [~cpcloud] has been working on this (there's a patch up to
make the decimal in-memory representation consistent with Java:
https://github.com/apache/arrow/pull/981); I suspect we can get this fully
working within the next month or so. Any help would be appreciated
> No support reading columns of type decimal(19,4)
> ------------------------------------------------
>
> Key: ARROW-1398
> URL: https://issues.apache.org/jira/browse/ARROW-1398
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 0.6.0
> Environment: Ubuntu 16.04, Python 3.6
> Reporter: Ludwik Bielczynski
> Labels: decimal, python
> Fix For: 0.7.0
>
>
> I have a localy saved parquet database created in spark from querrying an SQL
> database. When I run:
> {code:python}
> import pyarrow.parquet as pq
> path = "path/to/parquet/dataset"
> dataset = pq.ParquetDataset(path)
> dataset.read()
> {code}
> an error indicating that there is no support for reading columns of type
> decimal(19,4). It's quite a common type used in SQL databases and I saw in
> the source code that there is an implementation for decimals. I'm stuck
> trying to figuring out a solution. Is there a walk around (conversion of
> decimals to integers during reading)?
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)