Re: [I] [SUPPORT] Data loss due to incorrect selection of log file during compaction [hudi]

2024-03-23 Thread via GitHub
beyond1920 commented on issue #10803: URL: https://github.com/apache/hudi/issues/10803#issuecomment-2016687994 @nsivabalan @Ytimetravel Another data loss case caused by the whole stage retry. There are 4 cases that the task retry: * Task is slow, another speculation task is

Re: [PR] [HUDI-6758] Fixing deducing spurious log blocks due to spark retries [hudi]

2024-03-23 Thread via GitHub
beyond1920 commented on PR #9611: URL: https://github.com/apache/hudi/pull/9611#issuecomment-2016687160 @nsivabalan Good job. We found a minor drawback. There are 4 cases that the task retry: 1. Task is slow, another speculation task is retried 2. The task failed and retry

Re: [PR] [HUDI-7534] Refactoring of handleUpdate in CommitActionExecutors and HoodieTables [hudi]

2024-03-23 Thread via GitHub
hudi-bot commented on PR #10917: URL: https://github.com/apache/hudi/pull/10917#issuecomment-2016670285 ## CI report: * 84dd6612f0f52236936b70cfcb734eaf33fbe9e7 Azure:

Re: [PR] [HUDI-7534] Refactoring of handleUpdate in CommitActionExecutors and HoodieTables [hudi]

2024-03-23 Thread via GitHub
hudi-bot commented on PR #10917: URL: https://github.com/apache/hudi/pull/10917#issuecomment-2016657794 ## CI report: * f1161ad007a7bb3b4b748d85e200ce29a87f34b2 Azure:

Re: [PR] [HUDI-7534] Refactoring of handleUpdate in CommitActionExecutors and HoodieTables [hudi]

2024-03-23 Thread via GitHub
hudi-bot commented on PR #10917: URL: https://github.com/apache/hudi/pull/10917#issuecomment-2016656151 ## CI report: * f1161ad007a7bb3b4b748d85e200ce29a87f34b2 Azure:

[jira] [Updated] (HUDI-7534) Refactoring of handleUpdate in CommitActionExecutors and HoodieTables

2024-03-23 Thread Vova Kolmakov (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vova Kolmakov updated HUDI-7534: Description: After refactoring HUDI-7530 there still remains many places to be cleaned in

Re: [PR] [HUDI-7534] Refactoring of handleUpdate in CommitActionExecutors and HoodieTables [hudi]

2024-03-23 Thread via GitHub
wombatu-kun commented on code in PR #10917: URL: https://github.com/apache/hudi/pull/10917#discussion_r1536716915 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/HoodieTable.java: ## @@ -1096,11 +1104,37 @@ private Set getDropPartitionColNames() {

[jira] [Closed] (HUDI-7499) Support OverwriteWithGreaterRecordPayload for Hudi

2024-03-23 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen closed HUDI-7499. Resolution: Fixed Fixed via master branch: f98a40bd369ac4085a5da0e4864e39fe10a607a9 > Support

Re: [PR] [HUDI-7499] Support FirstValueAvroPayload for Hudi [hudi]

2024-03-23 Thread via GitHub
danny0405 merged PR #10857: URL: https://github.com/apache/hudi/pull/10857 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Updated] (HUDI-7499) Support OverwriteWithGreaterRecordPayload for Hudi

2024-03-23 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-7499: - Fix Version/s: 1.0.0 > Support OverwriteWithGreaterRecordPayload for Hudi >

(hudi) branch master updated (a8e9db446c3 -> f98a40bd369)

2024-03-23 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from a8e9db446c3 [HUDI-7530] Refactoring of handleUpdateInternal in CommitActionExecutors and HoodieTables (#10908)

Re: [PR] [HUDI-7534] Refactoring of handleUpdate in CommitActionExecutors and HoodieTables [hudi]

2024-03-23 Thread via GitHub
danny0405 commented on code in PR #10917: URL: https://github.com/apache/hudi/pull/10917#discussion_r1536712262 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/HoodieTable.java: ## @@ -1096,11 +1104,37 @@ private Set getDropPartitionColNames() {

Re: [PR] [HUDI-7525] prevent dag trigger in mappartitions if possible [hudi]

2024-03-23 Thread via GitHub
danny0405 commented on code in PR #10904: URL: https://github.com/apache/hudi/pull/10904#discussion_r1536712080 ## hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/AvroConversionUtils.scala: ## @@ -98,13 +98,10 @@ object AvroConversionUtils { */ def

Re: [PR] [HUDI-7525] prevent dag trigger in mappartitions if possible [hudi]

2024-03-23 Thread via GitHub
danny0405 commented on code in PR #10904: URL: https://github.com/apache/hudi/pull/10904#discussion_r1536712080 ## hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/AvroConversionUtils.scala: ## @@ -98,13 +98,10 @@ object AvroConversionUtils { */ def

Re: [PR] [HUDI-7466] Add tests to AWSGlueCatalogSyncClient [hudi]

2024-03-23 Thread via GitHub
parisni commented on PR #10897: URL: https://github.com/apache/hudi/pull/10897#issuecomment-2016639652 From the error logs you have ``` Error: DOCKER> [motoserver/moto:5.0.3] "it-aws": Timeout after 10060 ms while waiting on url http://localhost:5010/moto-api/ Error: DOCKER>

Re: [PR] [HUDI-7466] Add tests to AWSGlueCatalogSyncClient [hudi]

2024-03-23 Thread via GitHub
parisni commented on code in PR #10897: URL: https://github.com/apache/hudi/pull/10897#discussion_r1536698280 ## hudi-aws/src/test/java/org/apache/hudi/aws/sync/ITTestGluePartitionPushdown.java: ## @@ -131,8 +151,40 @@ public void testEmptyPartitionShouldReturnEmpty() {

Re: [PR] [HUDI-7535] Add metrics for sourceParallelism and Refresh profile in S3/GCS [hudi]

2024-03-23 Thread via GitHub
hudi-bot commented on PR #10918: URL: https://github.com/apache/hudi/pull/10918#issuecomment-2016617245 ## CI report: * 95436a55a29960c5bdeb8901f83c90d4712aa40b Azure:

Re: [PR] [HUDI-7466] Add tests to AWSGlueCatalogSyncClient [hudi]

2024-03-23 Thread via GitHub
parisni commented on code in PR #10897: URL: https://github.com/apache/hudi/pull/10897#discussion_r1536698280 ## hudi-aws/src/test/java/org/apache/hudi/aws/sync/ITTestGluePartitionPushdown.java: ## @@ -131,8 +151,40 @@ public void testEmptyPartitionShouldReturnEmpty() {

Re: [PR] [HUDI-7466] Add tests to AWSGlueCatalogSyncClient [hudi]

2024-03-23 Thread via GitHub
parisni commented on code in PR #10897: URL: https://github.com/apache/hudi/pull/10897#discussion_r1536697734 ## hudi-aws/src/test/java/org/apache/hudi/aws/sync/ITTestGluePartitionPushdown.java: ## @@ -131,8 +151,40 @@ public void testEmptyPartitionShouldReturnEmpty() {

Re: [PR] [HUDI-7535] Add metrics for sourceParallelism and Refresh profile in S3/GCS [hudi]

2024-03-23 Thread via GitHub
hudi-bot commented on PR #10918: URL: https://github.com/apache/hudi/pull/10918#issuecomment-2016608544 ## CI report: * 95436a55a29960c5bdeb8901f83c90d4712aa40b Azure:

[jira] [Updated] (HUDI-7535) Add metrics for source parallelism for Kafka and S3/GCS sources

2024-03-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7535: - Labels: pull-request-available (was: ) > Add metrics for source parallelism for Kafka and S3/GCS

Re: [PR] [HUDI-7535] Add metrics for sourceParallelism and Refresh profile in S3/GCS [hudi]

2024-03-23 Thread via GitHub
hudi-bot commented on PR #10918: URL: https://github.com/apache/hudi/pull/10918#issuecomment-2016607059 ## CI report: * 95436a55a29960c5bdeb8901f83c90d4712aa40b UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run

[PR] Add metrics for sourceParallelism and Refresh profile in S3/GCS [hudi]

2024-03-23 Thread via GitHub
vinishjail97 opened a new pull request, #10918: URL: https://github.com/apache/hudi/pull/10918 ### Change Logs Previous PR -> https://github.com/apache/hudi/pull/10861 Publish metrics for source parallelism for Kafka, S3/GCS sources. ### Impact No impact, only

[jira] [Updated] (HUDI-7535) Add metrics for source parallelism for Kafka and S3/GCS sources

2024-03-23 Thread Vinish Reddy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinish Reddy updated HUDI-7535: --- Status: In Progress (was: Open) > Add metrics for source parallelism for Kafka and S3/GCS sources >

[jira] [Created] (HUDI-7535) Add metrics for source parallelism for Kafka and S3/GCS sources

2024-03-23 Thread Vinish Reddy (Jira)
Vinish Reddy created HUDI-7535: -- Summary: Add metrics for source parallelism for Kafka and S3/GCS sources Key: HUDI-7535 URL: https://issues.apache.org/jira/browse/HUDI-7535 Project: Apache Hudi

[jira] [Updated] (HUDI-7508) Avoid converting iterator to list HoodieStreamerUtils.createHoodieRecords

2024-03-23 Thread Vinish Reddy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinish Reddy updated HUDI-7508: --- Priority: Minor (was: Major) > Avoid converting iterator to list

[jira] [Closed] (HUDI-4668) Syntax error in Hudi Quick Start Guide

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-4668. Resolution: Won't Do > Syntax error in Hudi Quick Start Guide > -- > >

[jira] [Commented] (HUDI-4859) Adding a blog on how to run Hudi on Serverless Platforms (AWS Glue)

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830160#comment-17830160 ] Raymond Xu commented on HUDI-4859: -- [~neuw84]any update on this?  > Adding a blog on how to run Hudi on

[jira] [Updated] (HUDI-4859) Adding a blog on how to run Hudi on Serverless Platforms (AWS Glue)

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4859: - Fix Version/s: 0.15.0 > Adding a blog on how to run Hudi on Serverless Platforms (AWS Glue) >

[jira] [Closed] (HUDI-2407) [Website] Support staging site for per pull request

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-2407. Resolution: Abandoned > [Website] Support staging site for per pull request >

[jira] [Closed] (HUDI-4634) update schema provider configuration in MTDS blog

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-4634. Resolution: Fixed > update schema provider configuration in MTDS blog >

[jira] [Closed] (HUDI-4610) Add docs for Hoodie metrics

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-4610. Fix Version/s: 0.13.0 Resolution: Fixed > Add docs for Hoodie metrics > ---

[jira] [Closed] (HUDI-5652) Add Docs for hudi cli bundle usage

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-5652. Fix Version/s: 0.13.0 Resolution: Fixed > Add Docs for hudi cli bundle usage >

[jira] [Closed] (HUDI-4627) [DOCS] Automate generation of basic_configurations page

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-4627. Resolution: Fixed already done > [DOCS] Automate generation of basic_configurations page >

[jira] [Closed] (HUDI-5668) Separate advanced configs from essential configs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-5668. Resolution: Fixed already done > Separate advanced configs from essential configs >

[jira] [Updated] (HUDI-5677) [DOCS] Update AWS libs version

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5677: - Fix Version/s: 0.15.0 0.14.2 > [DOCS] Update AWS libs version >

[jira] [Assigned] (HUDI-5677) [DOCS] Update AWS libs version

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-5677: Assignee: Raymond Xu > [DOCS] Update AWS libs version > -- > >

[jira] [Updated] (HUDI-3167) Update RFC27 with the design for the new HoodieIndex type based on metadata indices

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3167: - Component/s: (was: docs) > Update RFC27 with the design for the new HoodieIndex type based on

[jira] [Assigned] (HUDI-3309) Integrate quickstart examples into integration tests

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-3309: Assignee: Raymond Xu > Integrate quickstart examples into integration tests >

[jira] [Commented] (HUDI-3390) Update cleaner blog with KEEP_LATEST_BY_HOURS policy

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830155#comment-17830155 ] Raymond Xu commented on HUDI-3390: -- [~Pratyaksh] any update on this? > Update cleaner blog with

[jira] [Updated] (HUDI-3390) Update cleaner blog with KEEP_LATEST_BY_HOURS policy

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3390: - Fix Version/s: 0.15.0 0.14.2 > Update cleaner blog with KEEP_LATEST_BY_HOURS policy >

[jira] [Updated] (HUDI-2369) Blog on bulk insert sort modes

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2369: - Component/s: (was: docs) > Blog on bulk insert sort modes > -- > >

[jira] [Updated] (HUDI-4069) Document usages of diff lock providers in diff clouds

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4069: - Labels: (was: doc) > Document usages of diff lock providers in diff clouds >

[jira] [Updated] (HUDI-4069) Document usages of diff lock providers in diff clouds

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4069: - Fix Version/s: 0.15.0 > Document usages of diff lock providers in diff clouds >

[jira] [Updated] (HUDI-4045) DynamoDB billing_mode property is incorrectly documented

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4045: - Fix Version/s: 0.15.0 0.14.2 > DynamoDB billing_mode property is incorrectly

[jira] [Assigned] (HUDI-4069) Document usages of diff lock providers in diff clouds

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-4069: Assignee: Raymond Xu > Document usages of diff lock providers in diff clouds >

[jira] [Updated] (HUDI-4045) DynamoDB billing_mode property is incorrectly documented

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4045: - Component/s: (was: docs) > DynamoDB billing_mode property is incorrectly documented >

[jira] [Assigned] (HUDI-4004) Website config update utils to handle different bundle versions

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-4004: Assignee: Raymond Xu > Website config update utils to handle different bundle versions >

[jira] [Updated] (HUDI-4045) DynamoDB billing_mode property is incorrectly documented

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4045: - Component/s: docs > DynamoDB billing_mode property is incorrectly documented >

[jira] [Assigned] (HUDI-4045) DynamoDB billing_mode property is incorrectly documented

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-4045: Assignee: Raymond Xu > DynamoDB billing_mode property is incorrectly documented >

[jira] [Updated] (HUDI-4045) DynamoDB billing_mode property is incorrectly documented

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4045: - Issue Type: Bug (was: Task) > DynamoDB billing_mode property is incorrectly documented >

[jira] [Updated] (HUDI-4004) Website config update utils to handle different bundle versions

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4004: - Fix Version/s: 0.15.0 > Website config update utils to handle different bundle versions >

[jira] [Closed] (HUDI-3999) Add Update/Delete SQL DML to writing page

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-3999. Fix Version/s: 0.15.0 Resolution: Fixed Fixed [https://hudi.apache.org/docs/next/sql_dml] > Add

[jira] [Assigned] (HUDI-3854) Add doc for using delete partitions api in website

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-3854: Assignee: Raymond Xu (was: Bhavani Sudha) > Add doc for using delete partitions api in website >

[jira] [Updated] (HUDI-3854) Add doc for using delete partitions api in website

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3854: - Fix Version/s: 0.15.0 > Add doc for using delete partitions api in website >

[jira] [Closed] (HUDI-3874) Improve Hudi Quickstart Docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-3874. Resolution: Fixed > Improve Hudi Quickstart Docs > > > Key:

[jira] [Updated] (HUDI-3585) Docs for (consistent) hashing index

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3585: - Fix Version/s: 0.15.0 > Docs for (consistent) hashing index > --- > >

[jira] [Updated] (HUDI-3939) Website Contributing code to the project (newbie JIRAs) links wrong.

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3939: - Fix Version/s: 0.15.0 0.14.2 > Website Contributing code to the project (newbie JIRAs)

[jira] [Assigned] (HUDI-3585) Docs for (consistent) hashing index

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-3585: Assignee: Raymond Xu > Docs for (consistent) hashing index > --- >

[jira] [Assigned] (HUDI-3939) Website Contributing code to the project (newbie JIRAs) links wrong.

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-3939: Assignee: Raymond Xu > Website Contributing code to the project (newbie JIRAs) links wrong. >

[jira] [Closed] (HUDI-5755) Add detailed description of OCC early conflict detection to concurrency control docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-5755. Fix Version/s: 0.14.1 (was: 0.15.0) Resolution: Fixed > Add detailed

[jira] [Commented] (HUDI-5755) Add detailed description of OCC early conflict detection to concurrency control docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830148#comment-17830148 ] Raymond Xu commented on HUDI-5755: -- already done [https://hudi.apache.org/docs/next/concurrency_control]

[jira] [Updated] (HUDI-3874) Improve Hudi Quickstart Docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3874: - Fix Version/s: 0.13.0 > Improve Hudi Quickstart Docs > > >

[jira] [Updated] (HUDI-5756) Add Consistent Hashing Index to Indexing docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5756: - Fix Version/s: 0.14.1 (was: 0.15.0) > Add Consistent Hashing Index to Indexing

[jira] [Assigned] (HUDI-5756) Add Consistent Hashing Index to Indexing docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-5756: Assignee: (was: Raymond Xu) > Add Consistent Hashing Index to Indexing docs >

[jira] [Closed] (HUDI-5756) Add Consistent Hashing Index to Indexing docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-5756. Resolution: Fixed > Add Consistent Hashing Index to Indexing docs >

[jira] [Commented] (HUDI-5756) Add Consistent Hashing Index to Indexing docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830146#comment-17830146 ] Raymond Xu commented on HUDI-5756: -- already done [https://hudi.apache.org/docs/next/indexing] > Add

[jira] [Updated] (HUDI-5756) Add Consistent Hashing Index to Indexing docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5756: - Fix Version/s: 0.15.0 > Add Consistent Hashing Index to Indexing docs >

[jira] [Updated] (HUDI-5756) Add Consistent Hashing Index to Indexing docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5756: - Component/s: docs > Add Consistent Hashing Index to Indexing docs >

[jira] [Updated] (HUDI-5755) Add detailed description of OCC early conflict detection to concurrency control docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5755: - Fix Version/s: 0.15.0 > Add detailed description of OCC early conflict detection to concurrency >

[jira] [Updated] (HUDI-5757) Add Log Compaction to Write Operation docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5757: - Component/s: docs > Add Log Compaction to Write Operation docs >

[jira] [Assigned] (HUDI-5756) Add Consistent Hashing Index to Indexing docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-5756: Assignee: Raymond Xu (was: sivabalan narayanan) > Add Consistent Hashing Index to Indexing docs >

[jira] [Updated] (HUDI-5757) Add Log Compaction to Write Operation docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5757: - Epic Link: (was: HUDI-4978) > Add Log Compaction to Write Operation docs >

[jira] [Assigned] (HUDI-5757) Add Log Compaction to Write Operation docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-5757: Assignee: Raymond Xu (was: sivabalan narayanan) > Add Log Compaction to Write Operation docs >

[jira] [Updated] (HUDI-5757) Add Log Compaction to Write Operation docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5757: - Fix Version/s: 1.0.0 (was: 1.1.0) > Add Log Compaction to Write Operation docs >

[jira] [Commented] (HUDI-5784) Auto-generate configuration table for feature docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830145#comment-17830145 ] Raymond Xu commented on HUDI-5784: -- [~guoyihua]  can you add more details for this task? > Auto-generate

[jira] [Updated] (HUDI-5784) Auto-generate configuration table for feature docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5784: - Component/s: configs > Auto-generate configuration table for feature docs >

[jira] [Closed] (HUDI-5826) Add docs for how to use Hudi CLI on GCP Dataproc

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-5826. Fix Version/s: 0.13.1 Resolution: Fixed > Add docs for how to use Hudi CLI on GCP Dataproc >

[jira] [Updated] (HUDI-5856) [DOCS] Update and add Spark SQL in procedures.md

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5856: - Fix Version/s: 0.14.0 > [DOCS] Update and add Spark SQL in procedures.md >

[jira] [Closed] (HUDI-5856) [DOCS] Update and add Spark SQL in procedures.md

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-5856. Resolution: Fixed > [DOCS] Update and add Spark SQL in procedures.md >

[jira] [Assigned] (HUDI-5856) [DOCS] Update and add Spark SQL in procedures.md

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-5856: Assignee: lvyanquan > [DOCS] Update and add Spark SQL in procedures.md >

[jira] [Closed] (HUDI-5886) Improve File Sizing, Timeline, and Flink docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-5886. Reviewers: Bhavani Sudha, Danny Chen (was: Bhavani Sudha, DannyChan, Ethan Guo) Resolution: Fixed >

[jira] [Commented] (HUDI-5886) Improve File Sizing, Timeline, and Flink docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830142#comment-17830142 ] Raymond Xu commented on HUDI-5886: -- done in [https://github.com/apache/hudi/pull/9516] > Improve File

[jira] [Updated] (HUDI-5886) Improve File Sizing, Timeline, and Flink docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5886: - Fix Version/s: 0.14.1 > Improve File Sizing, Timeline, and Flink docs >

[jira] [Assigned] (HUDI-5886) Improve File Sizing, Timeline, and Flink docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-5886: Assignee: Bhavani Sudha (was: nadine) > Improve File Sizing, Timeline, and Flink docs >

[jira] [Closed] (HUDI-5912) Update snapshot_exporter to reflect the corrent jar name.md

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-5912. Resolution: Fixed > Update snapshot_exporter to reflect the corrent jar name.md >

[jira] [Updated] (HUDI-5946) Add glue sync configs to website

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5946: - Fix Version/s: 0.15.0 > Add glue sync configs to website > > >

[jira] [Updated] (HUDI-5912) Update snapshot_exporter to reflect the corrent jar name.md

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5912: - Fix Version/s: 0.14.0 (was: 1.1.0) > Update snapshot_exporter to reflect the

[jira] [Commented] (HUDI-5946) Add glue sync configs to website

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17830138#comment-17830138 ] Raymond Xu commented on HUDI-5946: -- [~Pratyaksh]  are you planning to do this? > Add glue sync configs

[jira] [Updated] (HUDI-5959) Add docs for clean policy inference

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5959: - Fix Version/s: 0.15.0 (was: 1.1.0) > Add docs for clean policy inference >

[jira] [Updated] (HUDI-5959) Add docs for clean policy inference

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5959: - Component/s: docs > Add docs for clean policy inference > --- > >

[jira] [Updated] (HUDI-6008) Update docs on key generator

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-6008: - Component/s: docs > Update docs on key generator > > > Key:

[jira] [Assigned] (HUDI-5959) Add docs for clean policy inference

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-5959: Assignee: Raymond Xu (was: Ethan Guo) > Add docs for clean policy inference >

[jira] [Updated] (HUDI-6008) Update docs on key generator

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-6008: - Fix Version/s: 0.15.0 (was: 1.1.0) > Update docs on key generator >

[jira] [Assigned] (HUDI-6008) Update docs on key generator

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-6008: Assignee: Raymond Xu (was: Ethan Guo) > Update docs on key generator >

[jira] [Updated] (HUDI-6037) Improve compaction docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-6037: - Fix Version/s: 0.15.0 > Improve compaction docs > --- > > Key:

[jira] [Updated] (HUDI-5974) Docs update for savepoint CALL procedure with table base path

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5974: - Fix Version/s: 0.15.0 > Docs update for savepoint CALL procedure with table base path >

[jira] [Assigned] (HUDI-6037) Improve compaction docs

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-6037: Assignee: Raymond Xu (was: nadine) > Improve compaction docs > --- > >

[jira] [Assigned] (HUDI-5974) Docs update for savepoint CALL procedure with table base path

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-5974: Assignee: Raymond Xu > Docs update for savepoint CALL procedure with table base path >

[jira] [Updated] (HUDI-5974) Docs update for savepoint CALL procedure with table base path

2024-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-5974: - Component/s: docs > Docs update for savepoint CALL procedure with table base path >

  1   2   >