[PR] [DO-NOT-MERGE] Trigger validate-release-candidate-bundles and validate-release-maven-artifacts for 1.0.2 [hudi]

2025-05-02 Thread via GitHub
voonhous opened a new pull request, #13251: URL: https://github.com/apache/hudi/pull/13251 ### Change Logs Creating a PR to trigger `validate-release-candidate-bundles` and `validate-release-maven-artifacts`. ### Impact None ### Risk level (write none, low medium

Re: [PR] [DO-NOT-MERGE] Trigger validate-release-candidate-bundles and validate-release-maven-artifacts for 1.0.2 [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13251: URL: https://github.com/apache/hudi/pull/13251#issuecomment-2847764587 ## CI report: * 095a2ed4bcc0726f07064db1aeb249c1e0b58c10 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

Re: [PR] [DO-NOT-MERGE] Trigger validate-release-candidate-bundles and validate-release-maven-artifacts for 1.0.2 [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13251: URL: https://github.com/apache/hudi/pull/13251#issuecomment-2847767102 ## CI report: * 095a2ed4bcc0726f07064db1aeb249c1e0b58c10 Azure: [PENDING](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5270)

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2847799690 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [PR] [DO-NOT-MERGE] Trigger validate-release-candidate-bundles and validate-release-maven-artifacts for 1.0.2 [hudi]

2025-05-02 Thread via GitHub
voonhous commented on PR #13251: URL: https://github.com/apache/hudi/pull/13251#issuecomment-2847848754 Ignore the `Release Candidate Validation` failures, we only need to focus on the `Maven Release Artifact Validation` I suppose it's failing because the staging repo `1151` is not av

Re: [PR] [DO-NOT-MERGE] Trigger validate-release-candidate-bundles for 1.0.2 [hudi]

2025-05-02 Thread via GitHub
voonhous closed pull request #13246: [DO-NOT-MERGE] Trigger validate-release-candidate-bundles for 1.0.2 URL: https://github.com/apache/hudi/pull/13246 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2847774060 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [PR] [DO-NOT-MERGE] Trigger validate-release-candidate-bundles and validate-release-maven-artifacts for 1.0.2 [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13251: URL: https://github.com/apache/hudi/pull/13251#issuecomment-2847881435 ## CI report: * 095a2ed4bcc0726f07064db1aeb249c1e0b58c10 Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5270)

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2847938108 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [I] [SUPPORT] docker demo not working: Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/parquet/format/TypeDefinedOrder [hudi]

2025-05-02 Thread via GitHub
uptycs-Sushrut commented on issue #12946: URL: https://github.com/apache/hudi/issues/12946#issuecomment-2847549167 @rangareddy this is being caused by a missing include in [this file](https://github.com/apache/hudi/blob/master/packaging/hudi-hive-sync-bundle/pom.xml) i checked the fil

Re: [PR] [HUDI-9281][DNM] Streaming writes to Metadata table [hudi]

2025-05-02 Thread via GitHub
nsivabalan commented on code in PR #13236: URL: https://github.com/apache/hudi/pull/13236#discussion_r2071222107 ## hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/client/SparkRDDWriteClient.java: ## @@ -310,6 +468,27 @@ private void initializeMetadataTable(Option i

Re: [PR] [HUDI-7915] Spark 4 support [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12772: URL: https://github.com/apache/hudi/pull/12772#issuecomment-2846566475 ## CI report: * c404e6d16eca90d173c6987e92503d84f344f626 UNKNOWN * 4c72fd9e1afdb76cbbaddc9b2fecf9597626f439 UNKNOWN * 7860a3d4abfaab77f7f717baf19454b32149ca41 UNKNOWN *

Re: [PR] [HUDI-9281][DNM] Streaming writes to Metadata table [hudi]

2025-05-02 Thread via GitHub
nsivabalan commented on code in PR #13236: URL: https://github.com/apache/hudi/pull/13236#discussion_r2071223543 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieTableMetadataWriter.java: ## @@ -34,7 +38,36 @@ /** * Interface that supports updati

Re: [PR] [HUDI-9281][DNM] Streaming writes to Metadata table [hudi]

2025-05-02 Thread via GitHub
nsivabalan commented on code in PR #13236: URL: https://github.com/apache/hudi/pull/13236#discussion_r2071225779 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java: ## @@ -1093,6 +1117,139 @@ public void buildMetadataPar

Re: [I] [SUPPORT] In the scene of using Flink to write (upsert) hudi, if index.bootstrap.enabled=false(default), does it mean that the primary key within the partition cannot be guaranteed to be uniqu

2025-05-02 Thread via GitHub
shanzhongkai commented on issue #13244: URL: https://github.com/apache/hudi/issues/13244#issuecomment-2847161032 Understood, thanks for answer~ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [I] [SUPPORT] In the scene of using Flink to write (upsert) hudi, if index.bootstrap.enabled=false(default), does it mean that the primary key within the partition cannot be guaranteed to be uniqu

2025-05-02 Thread via GitHub
shanzhongkai closed issue #13244: [SUPPORT] In the scene of using Flink to write (upsert) hudi, if index.bootstrap.enabled=false(default), does it mean that the primary key within the partition cannot be guaranteed to be unique? URL: https://github.com/apache/hudi/issues/13244 -- This is an

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2848091838 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

[jira] [Commented] (HUDI-8881) Potential dataloss in Flink hudi sink

2025-05-02 Thread Zhenqiu Huang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17949019#comment-17949019 ] Zhenqiu Huang commented on HUDI-8881: - [~danny0405] The data loss is identified intern

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2848102280 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

[jira] [Created] (HUDI-9367) Revisit caching in SparkMetadataTableRecordIndex

2025-05-02 Thread Y Ethan Guo (Jira)
Y Ethan Guo created HUDI-9367: - Summary: Revisit caching in SparkMetadataTableRecordIndex Key: HUDI-9367 URL: https://issues.apache.org/jira/browse/HUDI-9367 Project: Apache Hudi Issue Type: Sub-

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2848119849 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2848151886 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2848153691 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2848165043 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [PR] [HUDI-9281][DNM] Streaming writes to Metadata table [hudi]

2025-05-02 Thread via GitHub
nsivabalan commented on code in PR #13236: URL: https://github.com/apache/hudi/pull/13236#discussion_r2071196025 ## hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/client/SparkRDDWriteClient.java: ## @@ -224,45 +327,100 @@ public JavaRDD bulkInsertPreppedRecords(Jav

(hudi) branch asf-site__1.0.2 created (now de647982f99)

2025-05-02 Thread vhs
This is an automated email from the ASF dual-hosted git repository. vhs pushed a change to branch asf-site__1.0.2 in repository https://gitbox.apache.org/repos/asf/hudi.git at de647982f99 [HUDI-8997] Cutting docs for release 1.0.2 This branch includes the following new commits: new

[PR] [HUDI-9366] Cutting docs for release 1.0.2 [hudi]

2025-05-02 Thread via GitHub
voonhous opened a new pull request, #13250: URL: https://github.com/apache/hudi/pull/13250 ### Change Logs Cutting docs for release 1.0.2. Configurations page has been updated in current and 1.0.2. **Important**: release page for 1.0.2 has not been added. ### Imp

Re: [PR] [HUDI-9281][DNM] Streaming writes to Metadata table [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13236: URL: https://github.com/apache/hudi/pull/13236#issuecomment-2846801387 ## CI report: * 6e45bafbc1c6f9564fdf181e6e1798e7f46c8bbc Azure: [FAILURE](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5255)

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13229: URL: https://github.com/apache/hudi/pull/13229#issuecomment-2846861522 ## CI report: * ad94c783e5844bd5265efd17ce7958d88a0f69e3 UNKNOWN * a3cce750e58a224581e52218047e61f708891228 UNKNOWN * 28fe02d41a2de6d5c8f7748af57e5f5d84f3ea49 Azure: [SUCC

(hudi) branch asf-site__1.0.2 updated (de647982f99 -> 8f7ff7be2b5)

2025-05-02 Thread vhs
This is an automated email from the ASF dual-hosted git repository. vhs pushed a change to branch asf-site__1.0.2 in repository https://gitbox.apache.org/repos/asf/hudi.git discard de647982f99 [HUDI-8997] Cutting docs for release 1.0.2 add 8f7ff7be2b5 [HUDI-9366] Cutting docs for release 1

Re: [PR] [HUDI-7915] Spark 4 support [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12772: URL: https://github.com/apache/hudi/pull/12772#issuecomment-2846958467 ## CI report: * c404e6d16eca90d173c6987e92503d84f344f626 UNKNOWN * 4c72fd9e1afdb76cbbaddc9b2fecf9597626f439 UNKNOWN * 7860a3d4abfaab77f7f717baf19454b32149ca41 UNKNOWN *

[jira] [Updated] (HUDI-9366) Cut docs for release 1.0.2

2025-05-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9366: - Labels: pull-request-available (was: ) > Cut docs for release 1.0.2 > --

[jira] [Created] (HUDI-9366) Cut docs for release 1.0.2

2025-05-02 Thread voon (Jira)
voon created HUDI-9366: -- Summary: Cut docs for release 1.0.2 Key: HUDI-9366 URL: https://issues.apache.org/jira/browse/HUDI-9366 Project: Apache Hudi Issue Type: Improvement Reporter: voon

Re: [PR] [HUDI-9281][DNM] Streaming writes to Metadata table [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13236: URL: https://github.com/apache/hudi/pull/13236#issuecomment-2846804039 ## CI report: * 6e45bafbc1c6f9564fdf181e6e1798e7f46c8bbc Azure: [FAILURE](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5255)

Re: [PR] [HUDI-9366] Cutting docs for release 1.0.2 [hudi]

2025-05-02 Thread via GitHub
voonhous commented on PR #13250: URL: https://github.com/apache/hudi/pull/13250#issuecomment-2846845971 Local validation: Current version refers to 1.0.2. https://github.com/user-attachments/assets/7f4615cc-970f-48e4-99e9-c808f0e7aa81"; /> -- This is an automated

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13229: URL: https://github.com/apache/hudi/pull/13229#issuecomment-2846859012 ## CI report: * ad94c783e5844bd5265efd17ce7958d88a0f69e3 UNKNOWN * a3cce750e58a224581e52218047e61f708891228 UNKNOWN * 28fe02d41a2de6d5c8f7748af57e5f5d84f3ea49 Azure: [SUCC

Re: [PR] [HUDI-9281][DNM] Streaming writes to Metadata table [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13236: URL: https://github.com/apache/hudi/pull/13236#issuecomment-2846890256 ## CI report: * 1c5de4ae6e060e8bf5f7508b5cbb7bfa2d93ee3d Azure: [FAILURE](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5267)

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13229: URL: https://github.com/apache/hudi/pull/13229#issuecomment-2846903980 ## CI report: * ad94c783e5844bd5265efd17ce7958d88a0f69e3 UNKNOWN * a3cce750e58a224581e52218047e61f708891228 UNKNOWN * 28fe02d41a2de6d5c8f7748af57e5f5d84f3ea49 Azure: [SUCC

(hudi) branch asf-site__1.0.2 updated (8f7ff7be2b5 -> 604faa1f719)

2025-05-02 Thread vhs
This is an automated email from the ASF dual-hosted git repository. vhs pushed a change to branch asf-site__1.0.2 in repository https://gitbox.apache.org/repos/asf/hudi.git discard 8f7ff7be2b5 [HUDI-9366] Cutting docs for release 1.0.2 add 604faa1f719 [HUDI-9366] Cutting docs for release 1

Re: [PR] [HUDI-7915] Spark 4 support [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12772: URL: https://github.com/apache/hudi/pull/12772#issuecomment-2846984619 ## CI report: * c404e6d16eca90d173c6987e92503d84f344f626 UNKNOWN * 4c72fd9e1afdb76cbbaddc9b2fecf9597626f439 UNKNOWN * 7860a3d4abfaab77f7f717baf19454b32149ca41 UNKNOWN *

Re: [PR] [HUDI-7915] Spark 4 support [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12772: URL: https://github.com/apache/hudi/pull/12772#issuecomment-2846991409 ## CI report: * c404e6d16eca90d173c6987e92503d84f344f626 UNKNOWN * 4c72fd9e1afdb76cbbaddc9b2fecf9597626f439 UNKNOWN * 7860a3d4abfaab77f7f717baf19454b32149ca41 UNKNOWN *

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13229: URL: https://github.com/apache/hudi/pull/13229#issuecomment-2847030068 ## CI report: * ad94c783e5844bd5265efd17ce7958d88a0f69e3 UNKNOWN * a3cce750e58a224581e52218047e61f708891228 UNKNOWN * f31147cc18073e83f8094e453d792000d83d4768 UNKNOWN *

[jira] [Updated] (HUDI-9368) Use data table instant time for bloom filter records in expression index

2025-05-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9368: - Labels: pull-request-available (was: ) > Use data table instant time for bloom filter records in

[PR] [HUDI-9368] Use data table instant time for bloom filter records in expression index [hudi]

2025-05-02 Thread via GitHub
yihua opened a new pull request, #13252: URL: https://github.com/apache/hudi/pull/13252 ### Change Logs This PR fixes the timestamp stored in the bloom filter records in the expression index to use the data table instant time on which the expression index partition is initialized, to

Re: [PR] [HUDI-9368] Use data table instant time for bloom filter records in expression index [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13252: URL: https://github.com/apache/hudi/pull/13252#issuecomment-2848340437 ## CI report: * 5dea0ef9ee8f8799cd3956ad7e5c84a005637220 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

Re: [PR] [HUDI-9368] Use data table instant time for bloom filter records in expression index [hudi]

2025-05-02 Thread via GitHub
yihua merged PR #13252: URL: https://github.com/apache/hudi/pull/13252 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.o

(hudi) branch master updated: [HUDI-9369] Simplify bloom filter config passing in metadata table writer (#13253)

2025-05-02 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 30b4e8f3f31 [HUDI-9369] Simplify bloom filter confi

(hudi) branch master updated (e4d01dd791e -> 39b124af219)

2025-05-02 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from e4d01dd791e [HUDI-9336] Extract common logic of getting reader for secondary index (#13212) add 39b124af219 [HUDI-9

hudi.git: Error while running github feature from master:.asf.yaml

2025-05-02 Thread Apache Infrastructure
An error occurred while processing the github feature in .asf.yaml: GitHub discussions can only be enabled if a mailing list target exists for it. --- With regards, ASF Infra. For further information, please see the .asf.yaml documentation at: https://github.com/apache/infrastructure-asfyaml/b

Re: [PR] [HUDI-9369] Simplify bloom filter config passing in metadata table writer [hudi]

2025-05-02 Thread via GitHub
yihua merged PR #13253: URL: https://github.com/apache/hudi/pull/13253 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.o

hudi.git: Error while running github feature from master:.asf.yaml

2025-05-02 Thread Apache Infrastructure
An error occurred while processing the github feature in .asf.yaml: GitHub discussions can only be enabled if a mailing list target exists for it. --- With regards, ASF Infra. For further information, please see the .asf.yaml documentation at: https://github.com/apache/infrastructure-asfyaml/b

Re: [PR] [HUDI-9370] Unify logic of fetching files and file slices in the metadata table writer [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13254: URL: https://github.com/apache/hudi/pull/13254#issuecomment-2848443779 ## CI report: * 2930fe0cb12edda519b6e0e2ba48446b6966a53b Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5279)

Re: [PR] [HUDI-9370] Unify logic of fetching files and file slices in the metadata table writer [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13254: URL: https://github.com/apache/hudi/pull/13254#issuecomment-2848444390 ## CI report: * 2930fe0cb12edda519b6e0e2ba48446b6966a53b Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5279)

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2848202326 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2848224447 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13229: URL: https://github.com/apache/hudi/pull/13229#issuecomment-2848275921 ## CI report: * ad94c783e5844bd5265efd17ce7958d88a0f69e3 UNKNOWN * a3cce750e58a224581e52218047e61f708891228 UNKNOWN * f31147cc18073e83f8094e453d792000d83d4768 UNKNOWN *

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13229: URL: https://github.com/apache/hudi/pull/13229#issuecomment-2848334457 ## CI report: * ad94c783e5844bd5265efd17ce7958d88a0f69e3 UNKNOWN * a3cce750e58a224581e52218047e61f708891228 UNKNOWN * f31147cc18073e83f8094e453d792000d83d4768 UNKNOWN *

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2848196213 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2848311265 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [PR] [HUDI-9368] Use data table instant time for bloom filter records in expression index [hudi]

2025-05-02 Thread via GitHub
danny0405 commented on code in PR #13252: URL: https://github.com/apache/hudi/pull/13252#discussion_r2072307610 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java: ## @@ -453,7 +454,7 @@ private boolean initializeFromFil

Re: [PR] [HUDI-9369] Simplify bloom filter config passing in metadata table writer [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13253: URL: https://github.com/apache/hudi/pull/13253#issuecomment-2848410217 ## CI report: * feed3655c67ba19a3993c0b47a167e18f6624ab0 Azure: [FAILURE](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5277)

Re: [PR] [HUDI-9368] Use data table instant time for bloom filter records in expression index [hudi]

2025-05-02 Thread via GitHub
yihua commented on code in PR #13252: URL: https://github.com/apache/hudi/pull/13252#discussion_r2072307869 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java: ## @@ -453,7 +454,7 @@ private boolean initializeFromFilesys

Re: [PR] [HUDI-9369] Simplify bloom filter config passing in metadata table writer [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13253: URL: https://github.com/apache/hudi/pull/13253#issuecomment-2848410824 ## CI report: * feed3655c67ba19a3993c0b47a167e18f6624ab0 Azure: [FAILURE](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5277)

Re: [PR] [HUDI-9370] Unify logic of fetching files and file slices in the metadata table writer [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13254: URL: https://github.com/apache/hudi/pull/13254#issuecomment-2848385005 ## CI report: * 2930fe0cb12edda519b6e0e2ba48446b6966a53b UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

[PR] [HUDI-9370] Unify logic of fetching files and file slices in the metadata table writer [hudi]

2025-05-02 Thread via GitHub
yihua opened a new pull request, #13254: URL: https://github.com/apache/hudi/pull/13254 ### Change Logs This PR unifies logic of fetching files and file slices in the metadata table writer so the index initialization is only based on two types of information of the file system view:

[jira] [Updated] (HUDI-9370) Unify logic of fetching files and file slices in the metadata table writer

2025-05-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9370: - Labels: pull-request-available (was: ) > Unify logic of fetching files and file slices in the met

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
danny0405 commented on code in PR #13229: URL: https://github.com/apache/hudi/pull/13229#discussion_r2072297564 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java: ## @@ -1406,10 +1407,10 @@ protected void commitInternal

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
danny0405 commented on code in PR #13229: URL: https://github.com/apache/hudi/pull/13229#discussion_r2072297046 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/BaseHoodieWriteClient.java: ## @@ -1139,7 +1131,7 @@ public boolean scheduleLogCompactionAtInsta

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
danny0405 commented on code in PR #13229: URL: https://github.com/apache/hudi/pull/13229#discussion_r2072298752 ## hudi-common/src/main/java/org/apache/hudi/common/data/HoodieListPairData.java: ## @@ -204,9 +204,9 @@ public HoodiePairData>> leftOuterJoin(HoodiePairData union(H

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
danny0405 commented on code in PR #13229: URL: https://github.com/apache/hudi/pull/13229#discussion_r2072298810 ## hudi-common/src/main/java/org/apache/hudi/common/data/HoodiePairData.java: ## @@ -157,4 +157,5 @@ HoodiePairData mapToPair( * @return the deduce number of shuf

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
danny0405 commented on code in PR #13229: URL: https://github.com/apache/hudi/pull/13229#discussion_r2072296273 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/BaseHoodieTableServiceClient.java: ## @@ -514,23 +555,72 @@ public boolean purgePendingClusterin

Re: [PR] [HUDI-9370] Unify logic of fetching files and file slices in the metadata table writer [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13254: URL: https://github.com/apache/hudi/pull/13254#issuecomment-2848385648 ## CI report: * 2930fe0cb12edda519b6e0e2ba48446b6966a53b Azure: [PENDING](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5279)

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
danny0405 commented on code in PR #13229: URL: https://github.com/apache/hudi/pull/13229#discussion_r2072298218 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java: ## @@ -1600,15 +1605,16 @@ void compactIfNecessary(BaseH

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
danny0405 commented on code in PR #13229: URL: https://github.com/apache/hudi/pull/13229#discussion_r2072298467 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java: ## @@ -1418,6 +1419,10 @@ protected void commitInternal(

Re: [PR] [DO-NOT-MERGE] Trigger validate-release-candidate-bundles and validate-release-maven-artifacts for 1.0.2 [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13251: URL: https://github.com/apache/hudi/pull/13251#issuecomment-2848400219 ## CI report: * cc442f31be456430af3026df17f184c2bc21ec86 Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5278)

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
danny0405 commented on code in PR #13229: URL: https://github.com/apache/hudi/pull/13229#discussion_r2072303448 ## hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/client/SparkRDDTableServiceClient.java: ## @@ -33,18 +35,35 @@ import org.apache.spark.api.java.JavaR

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
danny0405 commented on code in PR #13229: URL: https://github.com/apache/hudi/pull/13229#discussion_r2072305629 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/HoodieCompactor.java: ## @@ -270,6 +270,7 @@ private int doCompact(JavaSparkContext jsc) throws Exception {

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2848200832 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [PR] [HUDI-9368] Use data table instant time for bloom filter records in expression index [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13252: URL: https://github.com/apache/hudi/pull/13252#issuecomment-2848341162 ## CI report: * 5dea0ef9ee8f8799cd3956ad7e5c84a005637220 Azure: [PENDING](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5276)

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
danny0405 commented on code in PR #13229: URL: https://github.com/apache/hudi/pull/13229#discussion_r2072273617 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/BaseHoodieTableServiceClient.java: ## @@ -369,6 +393,31 @@ protected void completeCompaction(Hoo

Re: [PR] [HUDI-9368] Use data table instant time for bloom filter records in expression index [hudi]

2025-05-02 Thread via GitHub
yihua commented on code in PR #13252: URL: https://github.com/apache/hudi/pull/13252#discussion_r2072276847 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java: ## @@ -453,7 +454,7 @@ private boolean initializeFromFilesys

Re: [PR] [HUDI-9369] Simplify bloom filter config passing in metadata table writer [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13253: URL: https://github.com/apache/hudi/pull/13253#issuecomment-2848437280 ## CI report: * a9ba67bd9fcb79d51955b01860b605f4f91e100b Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5280)

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2848245640 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #12983: URL: https://github.com/apache/hudi/pull/12983#issuecomment-2848251869 ## CI report: * 3690cb8aa9d7bc0d8857402e4dc286cce5eb994e UNKNOWN * a662d4a10abfa1e83d587f26684a59fa49e51cc5 UNKNOWN * e7b7f4c87cf6714d92a919cf0b701ac8021f96c9 UNKNOWN *

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
nsivabalan commented on code in PR #12983: URL: https://github.com/apache/hudi/pull/12983#discussion_r2072241431 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/index/IndexerFactory.java: ## @@ -0,0 +1,107 @@ +/* + * Licensed to the Apache Software Found

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13229: URL: https://github.com/apache/hudi/pull/13229#issuecomment-2848286104 ## CI report: * ad94c783e5844bd5265efd17ce7958d88a0f69e3 UNKNOWN * a3cce750e58a224581e52218047e61f708891228 UNKNOWN * f31147cc18073e83f8094e453d792000d83d4768 UNKNOWN *

Re: [PR] [HUDI-9177] Add Indexer abstraction and refactor metadata table initialization logic [hudi]

2025-05-02 Thread via GitHub
nsivabalan commented on code in PR #12983: URL: https://github.com/apache/hudi/pull/12983#discussion_r2072241431 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/index/IndexerFactory.java: ## @@ -0,0 +1,107 @@ +/* + * Licensed to the Apache Software Found

[PR] [HUDI-9369] Simplify bloom filter config passing in metadata table writer [hudi]

2025-05-02 Thread via GitHub
yihua opened a new pull request, #13253: URL: https://github.com/apache/hudi/pull/13253 ### Change Logs This PR simplifies bloom filter config passing in metadata table writer by avoiding passing `metadataWriteConfig` to `SparkMetadataWriterUtils#getExprIndexRecords` and instead usin

Re: [PR] [HUDI-9369] Simplify bloom filter config passing in metadata table writer [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13253: URL: https://github.com/apache/hudi/pull/13253#issuecomment-2848365377 ## CI report: * feed3655c67ba19a3993c0b47a167e18f6624ab0 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

[jira] [Updated] (HUDI-9369) Simplify bloom filter config passing in metadate table writer

2025-05-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-9369: - Labels: pull-request-available (was: ) > Simplify bloom filter config passing in metadate table w

Re: [PR] [DO-NOT-MERGE] Trigger validate-release-candidate-bundles and validate-release-maven-artifacts for 1.0.2 [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13251: URL: https://github.com/apache/hudi/pull/13251#issuecomment-2848366505 ## CI report: * 095a2ed4bcc0726f07064db1aeb249c1e0b58c10 Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5270)

Re: [PR] [HUDI-9369] Simplify bloom filter config passing in metadata table writer [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13253: URL: https://github.com/apache/hudi/pull/13253#issuecomment-2848366523 ## CI report: * feed3655c67ba19a3993c0b47a167e18f6624ab0 Azure: [PENDING](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5277)

Re: [PR] [DO-NOT-MERGE] Trigger validate-release-candidate-bundles and validate-release-maven-artifacts for 1.0.2 [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13251: URL: https://github.com/apache/hudi/pull/13251#issuecomment-2848367789 ## CI report: * 095a2ed4bcc0726f07064db1aeb249c1e0b58c10 Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5270)

Re: [PR] [HUDI-9369] Simplify bloom filter config passing in metadata table writer [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13253: URL: https://github.com/apache/hudi/pull/13253#issuecomment-2848370007 ## CI report: * feed3655c67ba19a3993c0b47a167e18f6624ab0 Azure: [FAILURE](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5277)

[jira] [Closed] (HUDI-9366) Cut docs for release 1.0.2

2025-05-02 Thread Voon Hou (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Voon Hou closed HUDI-9366. -- Resolution: Fixed > Cut docs for release 1.0.2 > -- > > Key: HUDI-9366 >

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
danny0405 commented on code in PR #13229: URL: https://github.com/apache/hudi/pull/13229#discussion_r2072292629 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/commit/BaseCommitActionExecutor.java: ## @@ -183,14 +184,26 @@ protected void runPrecommi

[jira] [Updated] (HUDI-9370) Unify logic of fetching files and file slices in the metadata table writer

2025-05-02 Thread Y Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Y Ethan Guo updated HUDI-9370: -- Summary: Unify logic of fetching files and file slices in the metadata table writer (was: Unify logic o

Re: [PR] [HUDI-9368] Use data table instant time for bloom filter records in expression index [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13252: URL: https://github.com/apache/hudi/pull/13252#issuecomment-2848381067 ## CI report: * 5dea0ef9ee8f8799cd3956ad7e5c84a005637220 Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5276)

Re: [PR] [HUDI-8470] Remove auto commit support in WriteClient [hudi]

2025-05-02 Thread via GitHub
danny0405 commented on code in PR #13229: URL: https://github.com/apache/hudi/pull/13229#discussion_r2072294624 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/BaseHoodieTableServiceClient.java: ## @@ -318,25 +326,41 @@ protected HoodieWriteMetadata compac

Re: [PR] [HUDI-9370] Unify logic of fetching files and file slices in the metadata table writer [hudi]

2025-05-02 Thread via GitHub
hudi-bot commented on PR #13254: URL: https://github.com/apache/hudi/pull/13254#issuecomment-2848472397 ## CI report: * a2a0dacbb27ecd8384b0c2537231f47cbd113d51 Azure: [SUCCESS](https://dev.azure.com/apachehudi/a1a51da7-8592-47d4-88dc-fd67bed336bb/_build/results?buildId=5282)

  1   2   >