islamismailov commented on PR #957:
URL: https://github.com/apache/parquet-mr/pull/957#issuecomment-1137620395
I debugged this some more and it looks like some of the problem is coming
from conversion between parquet and avro. Especially if you read parquetSchema,
convert it to avro and set projection in avro schema format, it would get
converted back to parquet and it will look different from the original.
`System.out.println("ORIGINAL PARQUET " + fileSchema);
Schema avroSchema = new
AvroSchemaConverter(configuration).convert(fileSchema);
MessageType parquetSchema = new
AvroSchemaConverter(configuration).convert(avroSchema);
System.out.println("RECONSTRUCTED PARQUET " + parquetSchema);`
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]