michael1991 opened a new issue, #11446:
URL: https://github.com/apache/hudi/issues/11446
**Describe the problem you faced**
I'm using Spark3.5 + Hudi0.15.0 for partitioned table, when I choose
req_date and req_hour for partition column name, I will get this error, but
task would be executed successfully finally;
when I choose date and hour for partition column name, error disappeared.
**Expected behavior**
We should get no errors when we just make partition column names a bit
longer.
**Environment Description**
* Hudi version : 0.15.0
* Spark version : 3.5.0
* Hive version : NA
* Hadoop version : 3.3.6
* Storage (HDFS/S3/GCS..) : GCS
* Running on Docker? (yes/no) : no
**Stacktrace**
```
2024-06-13 13:21:13 ERROR PriorityBasedFileSystemView:129 - Got error
running preferred function. Trying secondary
org.apache.hudi.exception.HoodieRemoteException: URI Too Long
at
org.apache.hudi.common.table.view.RemoteHoodieTableFileSystemView.loadPartitions(RemoteHoodieTableFileSystemView.java:447)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.common.table.view.RemoteHoodieTableFileSystemView.loadPartitions(RemoteHoodieTableFileSystemView.java:465)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.common.table.view.PriorityBasedFileSystemView.lambda$loadPartitions$6e5c444d$1(PriorityBasedFileSystemView.java:187)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.common.table.view.PriorityBasedFileSystemView.execute(PriorityBasedFileSystemView.java:69)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.common.table.view.PriorityBasedFileSystemView.loadPartitions(PriorityBasedFileSystemView.java:185)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.table.action.clean.CleanPlanActionExecutor.requestClean(CleanPlanActionExecutor.java:133)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.table.action.clean.CleanPlanActionExecutor.requestClean(CleanPlanActionExecutor.java:174)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.table.action.clean.CleanPlanActionExecutor.execute(CleanPlanActionExecutor.java:200)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.table.HoodieSparkCopyOnWriteTable.scheduleCleaning(HoodieSparkCopyOnWriteTable.java:212)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.client.BaseHoodieTableServiceClient.scheduleTableServiceInternal(BaseHoodieTableServiceClient.java:647)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.client.BaseHoodieTableServiceClient.clean(BaseHoodieTableServiceClient.java:746)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.client.BaseHoodieWriteClient.clean(BaseHoodieWriteClient.java:843)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.client.BaseHoodieWriteClient.clean(BaseHoodieWriteClient.java:816)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.client.BaseHoodieWriteClient.clean(BaseHoodieWriteClient.java:847)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.client.BaseHoodieWriteClient.autoCleanOnCommit(BaseHoodieWriteClient.java:581)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.client.BaseHoodieWriteClient.mayBeCleanAndArchive(BaseHoodieWriteClient.java:560)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.client.BaseHoodieWriteClient.commitStats(BaseHoodieWriteClient.java:251)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.client.SparkRDDWriteClient.commit(SparkRDDWriteClient.java:108)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.HoodieSparkSqlWriterInternal.commitAndPerformPostOperations(HoodieSparkSqlWriter.scala:1082)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.HoodieSparkSqlWriterInternal.writeInternal(HoodieSparkSqlWriter.scala:508)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.HoodieSparkSqlWriterInternal.write(HoodieSparkSqlWriter.scala:187)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.HoodieSparkSqlWriter$.write(HoodieSparkSqlWriter.scala:125)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.DefaultSource.createRelation(DefaultSource.scala:168)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.spark.sql.execution.datasources.SaveIntoDataSourceCommand.run(SaveIntoDataSourceCommand.scala:48)
~[spark-sql_2.12-3.5.0.jar:0.15.0]
at
org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult$lzycompute(commands.scala:75)
~[spark-sql_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult(commands.scala:73)
~[spark-sql_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.execution.command.ExecutedCommandExec.executeCollect(commands.scala:84)
~[spark-sql_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.execution.QueryExecution$$anonfun$eagerlyExecuteCommands$1.$anonfun$applyOrElse$1(QueryExecution.scala:107)
~[spark-sql_2.12-3.5.0.jar:0.15.0]
at
org.apache.spark.sql.execution.SQLExecution$.$anonfun$withNewExecutionId$6(SQLExecution.scala:125)
~[spark-sql_2.12-3.5.0.jar:0.15.0]
at
org.apache.spark.sql.execution.SQLExecution$.withSQLConfPropagated(SQLExecution.scala:201)
~[spark-sql_2.12-3.5.0.jar:0.15.0]
at
org.apache.spark.sql.execution.SQLExecution$.$anonfun$withNewExecutionId$1(SQLExecution.scala:108)
~[spark-sql_2.12-3.5.0.jar:0.15.0]
at org.apache.spark.sql.SparkSession.withActive(SparkSession.scala:900)
~[spark-sql_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.execution.SQLExecution$.withNewExecutionId(SQLExecution.scala:66)
~[spark-sql_2.12-3.5.0.jar:0.15.0]
at
org.apache.spark.sql.execution.QueryExecution$$anonfun$eagerlyExecuteCommands$1.applyOrElse(QueryExecution.scala:107)
~[spark-sql_2.12-3.5.0.jar:0.15.0]
at
org.apache.spark.sql.execution.QueryExecution$$anonfun$eagerlyExecuteCommands$1.applyOrElse(QueryExecution.scala:98)
~[spark-sql_2.12-3.5.0.jar:0.15.0]
at
org.apache.spark.sql.catalyst.trees.TreeNode.$anonfun$transformDownWithPruning$1(TreeNode.scala:473)
~[spark-catalyst_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.catalyst.trees.CurrentOrigin$.withOrigin(origin.scala:76)
~[spark-sql-api_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.catalyst.trees.TreeNode.transformDownWithPruning(TreeNode.scala:473)
~[spark-catalyst_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.catalyst.plans.logical.LogicalPlan.org$apache$spark$sql$catalyst$plans$logical$AnalysisHelper$$super$transformDownWithPruning(LogicalPlan.scala:32)
~[spark-catalyst_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.catalyst.plans.logical.AnalysisHelper.transformDownWithPruning(AnalysisHelper.scala:267)
~[spark-catalyst_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.catalyst.plans.logical.AnalysisHelper.transformDownWithPruning$(AnalysisHelper.scala:263)
~[spark-catalyst_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.catalyst.plans.logical.LogicalPlan.transformDownWithPruning(LogicalPlan.scala:32)
~[spark-catalyst_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.catalyst.plans.logical.LogicalPlan.transformDownWithPruning(LogicalPlan.scala:32)
~[spark-catalyst_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.catalyst.trees.TreeNode.transformDown(TreeNode.scala:449)
~[spark-catalyst_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.execution.QueryExecution.eagerlyExecuteCommands(QueryExecution.scala:98)
~[spark-sql_2.12-3.5.0.jar:0.15.0]
at
org.apache.spark.sql.execution.QueryExecution.commandExecuted$lzycompute(QueryExecution.scala:85)
~[spark-sql_2.12-3.5.0.jar:0.15.0]
at
org.apache.spark.sql.execution.QueryExecution.commandExecuted(QueryExecution.scala:83)
~[spark-sql_2.12-3.5.0.jar:0.15.0]
at
org.apache.spark.sql.execution.QueryExecution.assertCommandExecuted(QueryExecution.scala:142)
~[spark-sql_2.12-3.5.0.jar:0.15.0]
at
org.apache.spark.sql.DataFrameWriter.runCommand(DataFrameWriter.scala:859)
~[spark-sql_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.DataFrameWriter.saveToV1Source(DataFrameWriter.scala:388)
~[spark-sql_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.sql.DataFrameWriter.saveInternal(DataFrameWriter.scala:361)
~[spark-sql_2.12-3.5.0.jar:3.5.0]
at org.apache.spark.sql.DataFrameWriter.save(DataFrameWriter.scala:240)
~[spark-sql_2.12-3.5.0.jar:3.5.0]
at
scala.collection.mutable.ResizableArray.foreach(ResizableArray.scala:62)
[scala-library-2.12.18.jar:?]
at
scala.collection.mutable.ResizableArray.foreach$(ResizableArray.scala:55)
[scala-library-2.12.18.jar:?]
at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:49)
[scala-library-2.12.18.jar:?]
at
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
~[?:?]
at
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
~[?:?]
at
java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
~[?:?]
at java.base/java.lang.reflect.Method.invoke(Method.java:566) ~[?:?]
at
org.apache.spark.deploy.JavaMainApplication.start(SparkApplication.scala:52)
[spark-core_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.deploy.SparkSubmit.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:1032)
[spark-core_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.deploy.SparkSubmit.doRunMain$1(SparkSubmit.scala:194)
[spark-core_2.12-3.5.0.jar:3.5.0]
at org.apache.spark.deploy.SparkSubmit.submit(SparkSubmit.scala:217)
[spark-core_2.12-3.5.0.jar:3.5.0]
at org.apache.spark.deploy.SparkSubmit.doSubmit(SparkSubmit.scala:91)
[spark-core_2.12-3.5.0.jar:3.5.0]
at
org.apache.spark.deploy.SparkSubmit$$anon$2.doSubmit(SparkSubmit.scala:1124)
[spark-core_2.12-3.5.0.jar:3.5.0]
at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:1133)
[spark-core_2.12-3.5.0.jar:3.5.0]
at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
[spark-core_2.12-3.5.0.jar:3.5.0]
Caused by: org.apache.hudi.org.apache.http.client.HttpResponseException: URI
Too Long
at
org.apache.hudi.org.apache.http.impl.client.AbstractResponseHandler.handleResponse(AbstractResponseHandler.java:69)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.org.apache.http.client.fluent.Response.handleResponse(Response.java:90)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.org.apache.http.client.fluent.Response.returnContent(Response.java:97)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.common.table.view.RemoteHoodieTableFileSystemView.executeRequest(RemoteHoodieTableFileSystemView.java:189)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
at
org.apache.hudi.common.table.view.RemoteHoodieTableFileSystemView.loadPartitions(RemoteHoodieTableFileSystemView.java:445)
~[hudi-spark3.5-bundle_2.12-0.15.0.jar:0.15.0]
... 71 more
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]