[ https://issues.apache.org/jira/browse/HUDI-4433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
leesf reassigned HUDI-4433: --------------------------- Assignee: brightwon > Hudi-CLI repair deduplicate not working with non-partitioned dataset > -------------------------------------------------------------------- > > Key: HUDI-4433 > URL: https://issues.apache.org/jira/browse/HUDI-4433 > Project: Apache Hudi > Issue Type: Improvement > Components: cli > Reporter: brightwon > Assignee: brightwon > Priority: Minor > > hudi-cli's *repair deduplicate* command is not working with non-partitioned > dataset. > because can't pass *empty value* for *--duplicatedPartitionPath* parameter. > for example, this command > repair deduplicate --duplicatedPartitionPath "" --repairedOutputPath > "s3://myBucket/table/" --sparkMaster yarn --sparkMemory 4G --dryrun true > --dedupeType "upsert_type" > result is, +_You should specify value for option 'duplicatedPartitionPath' > for this command_+ > > My slack message link in #general channel > [https://apache-hudi.slack.com/archives/C4D716NPQ/p1657854371469139|http://example.com/] -- This message was sent by Atlassian Jira (v8.20.10#820010)