[
https://issues.apache.org/jira/browse/PARQUET-660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15369942#comment-15369942
]
Jakub Kukul commented on PARQUET-660:
-------------------------------------
I submitted a patch:
https://github.com/apache/parquet-mr/pull/351
> Writing Protobuf messages with extensions results in an error or data
> corruption.
> ---------------------------------------------------------------------------------
>
> Key: PARQUET-660
> URL: https://issues.apache.org/jira/browse/PARQUET-660
> Project: Parquet
> Issue Type: Bug
> Components: parquet-mr
> Affects Versions: 1.8.1
> Reporter: Jakub Kukul
>
> Currently, in ProtoParquetWriter there's not any support for extended
> protobuf messages. An attempt to write a protobuf message with extension
> either:
> * Results in an uninformative error like this:
> {code}
> parquet.proto.ProtoWriteSupport: Cannot write message
> YOUR_EXTENSION_TYPE cannot be cast to A_BASE_FIELD_TYPE : BASE_FIELD_NAME
> {code}
> * Results in a data corruption, if an extension type is compatible with a
> base field type with a corresponding index.
> Imo there are two possible solutions:
>
> 1. Ignore extension fields.
> 2. When an extension field is detected in a message, throw an informative
> error.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)