islamismailov commented on PR #957:
URL: https://github.com/apache/parquet-mr/pull/957#issuecomment-1137620395

   I debugged this some more and it looks like some of the problem is coming 
from conversion between parquet and avro. Especially if you read parquetSchema, 
convert it to avro and set projection in avro schema format, it would get 
converted back to parquet and it will look different from the original.
   
   `System.out.println("ORIGINAL PARQUET " + fileSchema);
   Schema avroSchema = new 
AvroSchemaConverter(configuration).convert(fileSchema);
   MessageType parquetSchema = new 
AvroSchemaConverter(configuration).convert(avroSchema);
   System.out.println("RECONSTRUCTED PARQUET " + parquetSchema);`


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to