[
https://issues.apache.org/jira/browse/ARROW-17464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17581544#comment-17581544
]
Antoine Pitrou commented on ARROW-17464:
----------------------------------------
There's already an old JIRA open apparently:
https://issues.apache.org/jira/browse/PARQUET-758
Also, AFAIK format additions are discussed on the parquet dev mailing-list (see
e.g. an unrelated proposal I did so in
https://lists.apache.org/thread/l15qq12v38w9jnkd6p9mdd11kr0nq3gr).
> [C++] Support float16 in writing/reading parquet
> ------------------------------------------------
>
> Key: ARROW-17464
> URL: https://issues.apache.org/jira/browse/ARROW-17464
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++, Parquet, Python
> Reporter: Anja Boskovic
> Assignee: Anja Boskovic
> Priority: Major
> Labels: parquet
>
> Half-float values are not supported in Parquet. Here is a previous issue that
> talks about that: https://issues.apache.org/jira/browse/PARQUET-1647
> So, this will not work:
> {code:java}
> import pyarrow as pa
> import pyarrow.parquet as pq
> import numpy as np
> arr = pa.array(np.float16([0.1, 2.2, 3]))
> table = pa.table({'a': arr})
> pq.write_table(table, "test_halffloat.parquet") {code}
> {{This is a proposal to store float16 values in Parquet as FixedSizeBinary,
> and then restore them to float16 when reading them back in.}}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)