vamshikrishnakyatham commented on code in PR #13793:
URL: https://github.com/apache/hudi/pull/13793#discussion_r2316667736


##########
hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/spark/sql/hudi/command/procedures/ShowCleansProcedure.scala:
##########
@@ -17,53 +17,80 @@
 
 package org.apache.spark.sql.hudi.command.procedures
 
-import org.apache.hudi.{HoodieCLIUtils, SparkAdapterSupport}
-import org.apache.hudi.common.table.timeline.{HoodieInstant, HoodieTimeline, 
TimelineLayout}
+import org.apache.hudi.SparkAdapterSupport
+import org.apache.hudi.common.table.HoodieTableMetaClient
+import org.apache.hudi.common.table.timeline.{HoodieInstant, HoodieTimeline}
 
 import org.apache.spark.internal.Logging
 import org.apache.spark.sql.Row
 import org.apache.spark.sql.types.{DataTypes, Metadata, StructField, 
StructType}
 
-import java.util
-import java.util.Collections
 import java.util.function.Supplier
 
 import scala.collection.JavaConverters._
 
 /**
- * Spark SQL procedure to show completed clean operations for a Hudi table.
+ * Spark SQL procedure to show all clean operations for a Hudi table.
  *
- * This procedure displays information about clean operations that have been 
executed.
- * Clean operations remove old file versions to reclaim storage space and 
maintain table performance.
+ * This procedure provides a comprehensive view of Hudi clean operations.
+ * It displays completed clean operations with full partition metadata for 
both completed and pending operations.
  *
  * == Parameters ==
  * - `table`: Required. The name of the Hudi table to query
- * - `limit`: Optional. Maximum number of clean operations to return (default: 
10)
+ * - `path`: Optional. The path of the Hudi table (anyone of `table` or `path` 
must be provided)
+ * - `limit`: Optional. Maximum number of clean operations to return (default: 
10, ignored if time range specified)
  * - `showArchived`: Optional. Whether to include archived clean operations 
(default: false)
  * - `filter`: Optional. SQL expression to filter results (default: empty 
string)
+ * - `startTime`: Optional. Start time for clean operations (format: 
yyyyMMddHHmmss, default: empty)

Review Comment:
   Sure, @yihua just FYI we are using camrl cased style for the input 
parameters of all the show procedures to be consistent.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to