[ 
https://issues.apache.org/jira/browse/HUDI-7609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ethan Guo updated HUDI-7609:
----------------------------
    Story Points: 4

> Spark cannot write the hudi table containing array type created by flink
> ------------------------------------------------------------------------
>
>                 Key: HUDI-7609
>                 URL: https://issues.apache.org/jira/browse/HUDI-7609
>             Project: Apache Hudi
>          Issue Type: Bug
>            Reporter: 陈磊
>            Assignee: Jonathan Vexler
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.16.0, 1.0.0
>
>
> When flink creates a Hudi table containing an array field, the elements of 
> the default array field cannot be nullable. When using Spark SQL to read data 
> from the Hive table to the Hudi table, a field verification exception will 
> occur.
> {code:java}
> 2024-03-27 12:47:51 INFO 
> org.apache.spark.sql.catalyst.analysis.UnresolvedException: Invalid call to 
> dataType on unresolved object, tree: 'decentral_level1
> 2024-03-27 12:47:51 INFO at 
> org.apache.spark.sql.catalyst.analysis.UnresolvedAttribute.dataType(unresolved.scala:138)
> 2024-03-27 12:47:51 INFO at 
> org.apache.spark.sql.types.StructType$.$anonfun$fromAttributes$1(StructType.scala:549)
> 2024-03-27 12:47:51 INFO at 
> scala.collection.TraversableLike.$anonfun$map$1(TraversableLike.scala:238)
> 2024-03-27 12:47:51 INFO at 
> scala.collection.immutable.List.foreach(List.scala:392)
> 2024-03-27 12:47:51 INFO at 
> scala.collection.TraversableLike.map(TraversableLike.scala:238)
> 2024-03-27 12:47:51 INFO at 
> scala.collection.TraversableLike.map$(TraversableLike.scala:231)
> 2024-03-27 12:47:51 INFO at 
> scala.collection.immutable.List.map(List.scala:298)
> 2024-03-27 12:47:51 INFO at 
> org.apache.spark.sql.types.StructType$.fromAttributes(StructType.scala:549)
> 2024-03-27 12:47:51 INFO at 
> org.apache.spark.sql.catalyst.plans.QueryPlan.schema$lzycompute(QueryPlan.scala:281)
> 2024-03-27 12:47:51 INFO at 
> org.apache.spark.sql.catalyst.plans.QueryPlan.schema(QueryPlan.scala:281)
> 2024-03-27 12:47:51 INFO at 
> org.apache.spark.sql.hudi.command.InsertIntoHoodieTableCommand$.alignQueryOutput(InsertIntoHoodieTableCommand.scala:153)
> 2024-03-27 12:47:51 INFO at 
> org.apache.spark.sql.hudi.command.InsertIntoHoodieTableCommand$.run(InsertIntoHoodieTableCommand.scala:105)
> 2024-03-27 12:47:51 INFO at 
> org.apache.spark.sql.hudi.command.InsertIntoHoodieTableCommand.run(InsertIntoHoodieTableCommand.scala:60)
>  {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to