wzx140 commented on code in PR #7003:
URL: https://github.com/apache/hudi/pull/7003#discussion_r1025423023
##########
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/io/storage/HoodieSparkParquetReader.java:
##########
@@ -108,7 +117,12 @@ private ClosableIterator<InternalRow>
getInternalRowIterator(Schema readerSchema
@Override
public Schema getSchema() {
- return parquetUtils.readAvroSchema(conf, path);
+ // Some types in avro are not compatible with parquet
+ // decimal as int32/int64
+ MessageType messageType = ((ParquetUtils) parquetUtils).readSchema(conf,
path);
+ StructType structType = new
ParquetToSparkSchemaConverter(conf).convert(messageType);
Review Comment:
Some types in avro are not compatible with parquet. For ex, decimal as
int32/int64 in parquet will convert to int/long in avro. Because avro do not
has decimal as int/long . We will lose the logic type info if we direct convert
it to avro.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]