Jakub Kukul created PARQUET-660:
-----------------------------------
Summary: Writing Protobuf messages with extensions results in an
error or data corruption.
Key: PARQUET-660
URL: https://issues.apache.org/jira/browse/PARQUET-660
Project: Parquet
Issue Type: Bug
Components: parquet-mr
Affects Versions: 1.8.1
Reporter: Jakub Kukul
Currently, in ProtoParquetWriter there's not any support for extended protobuf
messages. An attempt to write a protobuf message with extension either:
* Results in an uninformative error like this:
{code}
parquet.proto.ProtoWriteSupport: Cannot write message
YOUR_EXTENSION_TYPE cannot be cast to A_BASE_FIELD_TYPE : BASE_FIELD_NAME
{code}
* Results in a data corruption, if an extension type is compatible with a base
field type with a corresponding index.
Imo there are two possible solutions:
1. Ignore extension fields.
2. When an extension field is detected in a message, throw an informative error.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)