Paul Praet created SPARK-21666: ---------------------------------- Summary: Cannot handle Parquet type FIXED_LEN_BYTE_ARRAY Key: SPARK-21666 URL: https://issues.apache.org/jira/browse/SPARK-21666 Project: Spark Issue Type: Bug Components: SQL Affects Versions: 2.2.0 Reporter: Paul Praet
I have a parquet schema that looks like this: {{ optional group connection { required fixed_len_byte_array(6) localMacAddress; required fixed_len_byte_array(6) remoteMacAddress; } }} When I try to load this parquet file in Spark, I get: Caused by: org.apache.spark.sql.AnalysisException: Illegal Parquet type: FIXED_LEN_BYTE_ARRAY; at org.apache.spark.sql.execution.datasources.parquet.ParquetSchemaConverter.illegalType$1(ParquetSchemaConverter.scala:126) at org.apache.spark.sql.execution.datasources.parquet.ParquetSchemaConverter.convertPrimitiveField(ParquetSchemaConverter.scala:193) at org.apache.spark.sql.execution.datasources.parquet.ParquetSchemaConverter.convertField(ParquetSchemaConverter.scala:108) at org.apache.spark.sql.execution.datasources.parquet.ParquetSchemaConverter$$anonfun$2.apply(ParquetSchemaConverter.scala:90) at org.apache.spark.sql.execution.datasources.parquet.ParquetSchemaConverter$$anonfun$2.apply(ParquetSchemaConverter.scala:84) We are not able to change the schema so this issue prevents us from processing the data. Duplicate of https://issues.apache.org/jira/browse/SPARK-2489 -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org