xushiyan commented on code in PR #5379:
URL: https://github.com/apache/hudi/pull/5379#discussion_r854734744


##########
hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/HoodieSparkUtils.scala:
##########
@@ -322,7 +323,9 @@ object HoodieSparkUtils extends SparkAdapterSupport {
       val name2Fields = tableAvroSchema.getFields.asScala.map(f => f.name() -> 
f).toMap
       // Here have to create a new Schema.Field object
       // to prevent throwing exceptions like 
"org.apache.avro.AvroRuntimeException: Field already used".
-      val requiredFields = requiredColumns.map(c => name2Fields(c))
+      // For a nested field, we include the root-level field
+      val requiredFields = requiredColumns.map(c => 
HoodieAvroUtils.getRootLevelFieldName(c))
+        .distinct.map(c => name2Fields(c))

Review Comment:
   this code path affects both MOR and COW, right?  is it for spark later to 
drill down to specific nested cols?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to