[GitHub] [hudi] alexeykudinkin commented on pull request #7898: [HUDI-5731] Add guava dependency to Spark and MR bundle

2023-02-08 Thread via GitHub
alexeykudinkin commented on PR #7898: URL: https://github.com/apache/hudi/pull/7898#issuecomment-1423117320 After some discussions we agreed on following - This PR will be landed to unblock 0.13 - On master we revert back to remove guava as well as any excessive shading of

[GitHub] [hudi] hudi-bot commented on pull request #7159: [HUDI-5173]Skip if there is only one file in clusteringGroup

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7159: URL: https://github.com/apache/hudi/pull/7159#issuecomment-1423121210 ## CI report: * 15ecd91180d32c7fa1905c11408f4bc23347e682 UNKNOWN * 605ed5b76709927bb5c3440c627600a0c76f8b21 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7860: [HUDI-5673] Support multi writer for bucket index with guarded lock

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7860: URL: https://github.com/apache/hudi/pull/7860#issuecomment-1423172872 ## CI report: * e72f988f68e3021f857b43b14b8721be3f988df5 Azure:

[GitHub] [hudi] alexeykudinkin commented on pull request #7678: [HUDI-5562] Add maven wrapper

2023-02-08 Thread via GitHub
alexeykudinkin commented on PR #7678: URL: https://github.com/apache/hudi/pull/7678#issuecomment-1423210047 @wuzhenhua01 please update the Jira and this PR description to elaborate on what are the benefits of having the wrapper -- This is an automated message from the Apache Git Service.

[GitHub] [hudi] hudi-bot commented on pull request #7847: [HUDI-5697] Revisiting refreshing of Hudi relations after write operations on the tables

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7847: URL: https://github.com/apache/hudi/pull/7847#issuecomment-1423367248 ## CI report: * 28ce832318206166f9d72f58510d46b50ba652d2 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7752: [MINOR] De-duplicating Iterator implementations

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7752: URL: https://github.com/apache/hudi/pull/7752#issuecomment-1423367110 ## CI report: * 3ebe28ff5e180f1322cbd7621e57daa1234eb1dd Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7886: [HUDI-5726]Fix timestamp field is 8 hours longer than the time

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7886: URL: https://github.com/apache/hudi/pull/7886#issuecomment-1423402514 ## CI report: * 69c39e941d6ee3cc21512b9d41b6fe048a91cc56 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423518841 ## CI report: * a3e25d91fe89abb52b2019c5f5a68f28a321a1f8 Azure:

[jira] [Created] (HUDI-5734) Fix data lost because skip clustering when incremental batch read in flink

2023-02-08 Thread HBG (Jira)
HBG created HUDI-5734: - Summary: Fix data lost because skip clustering when incremental batch read in flink Key: HUDI-5734 URL: https://issues.apache.org/jira/browse/HUDI-5734 Project: Apache Hudi

[jira] [Updated] (HUDI-5665) Re-use table configs for subsequent writes

2023-02-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-5665: - Labels: pull-request-available (was: ) > Re-use table configs for subsequent writes >

[GitHub] [hudi] nsivabalan opened a new pull request, #7901: [HUDI-5665] Adding support to re-use table configs

2023-02-08 Thread via GitHub
nsivabalan opened a new pull request, #7901: URL: https://github.com/apache/hudi/pull/7901 ### Change Logs - As of now, we expect users to set some of the mandatory fields in every write. For eg, record keys, partition path etc. These cannot change for a given table and gets

[GitHub] [hudi] hudi-bot commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423523385 ## CI report: * a3e25d91fe89abb52b2019c5f5a68f28a321a1f8 Azure:

[GitHub] [hudi] alexeykudinkin commented on pull request #7898: [HUDI-5731] Add guava dependency to Spark and MR bundle

2023-02-08 Thread via GitHub
alexeykudinkin commented on PR #7898: URL: https://github.com/apache/hudi/pull/7898#issuecomment-1423397463 #7900 is addressing this properly by removing unnecessary relocations -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[jira] [Created] (HUDI-5733) TestHoodieDeltaStreamer.testHoodieIndexer failure

2023-02-08 Thread Jonathan Vexler (Jira)
Jonathan Vexler created HUDI-5733: - Summary: TestHoodieDeltaStreamer.testHoodieIndexer failure Key: HUDI-5733 URL: https://issues.apache.org/jira/browse/HUDI-5733 Project: Apache Hudi Issue

[GitHub] [hudi] hudi-bot commented on pull request #7872: [HUDI-5716] Cleaning up `Partitioner`s hierarchy

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7872: URL: https://github.com/apache/hudi/pull/7872#issuecomment-1423318544 ## CI report: * 8f42e8c18690c8ae76121c714c2c0cda21841264 UNKNOWN * bb3bd527c1c20fb046c23cd4d34e218fb7a06f82 UNKNOWN * de96182a55e0574c96d2b384734a4808c8ba6399 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7847: [HUDI-5697] Revisiting refreshing of Hudi relations after write operations on the tables

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7847: URL: https://github.com/apache/hudi/pull/7847#issuecomment-1423362089 ## CI report: * 28ce832318206166f9d72f58510d46b50ba652d2 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7891: [HUDI-5728] HoodieTimelineArchiver archives the latest instant before inflight replacecommit

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7891: URL: https://github.com/apache/hudi/pull/7891#issuecomment-1423362343 ## CI report: * 7b6cf690564944cfeacf6d2e29e029f86fddec51 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7900: [HUDI-5731] Cleaning up unnecessary relocation for com.google.common packages

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7900: URL: https://github.com/apache/hudi/pull/7900#issuecomment-1423362413 ## CI report: * a7c7f17108423f5d6f563faec66eb715d1a8f539 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #7752: [MINOR] De-duplicating Iterator implementations

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7752: URL: https://github.com/apache/hudi/pull/7752#issuecomment-1423361929 ## CI report: * 3ebe28ff5e180f1322cbd7621e57daa1234eb1dd Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7885: [MINOR] Make sure FTs are run in GH CI

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7885: URL: https://github.com/apache/hudi/pull/7885#issuecomment-1423312566 ## CI report: * c3d027696958b320912712447bcf41c3f2d28221 UNKNOWN * b132eda24a8e705f210d1116dab543632fa09b0c Azure:

[GitHub] [hudi] liaotian1005 commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
liaotian1005 commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423484954 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[jira] [Updated] (HUDI-5732) Launching hudi-spark3.3 in EMR cluster w/ OSS spark fails due to timeline server (spark) NoClassDefFound

2023-02-08 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5732: -- Description: I am trying to use hudi-spark3.3. bundle in EMR cluster using OSS spark. 

[jira] [Updated] (HUDI-5732) Launching hudi-spark3.3 in EMR cluster w/ OSS spark fails due to timeline server (spark) NoClassDefFound

2023-02-08 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5732: -- Description: I am trying to use hudi-spark3.3. bundle in EMR cluster using OSS spark. 

[GitHub] [hudi] hudi-bot commented on pull request #7901: [HUDI-5665] Adding support to re-use table configs

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7901: URL: https://github.com/apache/hudi/pull/7901#issuecomment-1423467799 ## CI report: * 4f81cc10efc5863beb8f9656c05fc2e03ce6c7ee UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423507868 ## CI report: * a3e25d91fe89abb52b2019c5f5a68f28a321a1f8 Azure:

[jira] [Updated] (HUDI-5734) Fix flink batch read skip clustering data lost

2023-02-08 Thread HBG (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] HBG updated HUDI-5734: -- Summary: Fix flink batch read skip clustering data lost (was: Fix data lost because skip clustering when incremental

[jira] [Updated] (HUDI-5562) Add maven wrapper

2023-02-08 Thread wuzhenhua (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wuzhenhua updated HUDI-5562: Description: In a project that uses Maven and often changes the required version, it might be easier to

[GitHub] [hudi] hudi-bot commented on pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7903: URL: https://github.com/apache/hudi/pull/7903#issuecomment-1423568635 ## CI report: * ee465d312a5953c8b8337d7fa4f6d7dbc97142a2 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7889: [HUDI-5727] Close stable PRs

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7889: URL: https://github.com/apache/hudi/pull/7889#issuecomment-1423255276 ## CI report: * 2297818b2c25f9afa23779c288fdab968d37ad59 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7898: [HUDI-5731] Add guava dependency to Spark and MR bundle

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7898: URL: https://github.com/apache/hudi/pull/7898#issuecomment-1423255409 ## CI report: * e10025521ff7a24978b9f22b1180bce55e2238fe Azure:

[GitHub] [hudi] alexeykudinkin opened a new pull request, #7900: [MINOR] Cleaning up unnecessary relocation for com.google.common packages

2023-02-08 Thread via GitHub
alexeykudinkin opened a new pull request, #7900: URL: https://github.com/apache/hudi/pull/7900 ### Change Logs TBA ### Impact _Describe any public API or user-facing feature change or any performance impact._ ### Risk level (write none, low medium or high below)

[jira] [Assigned] (HUDI-5731) Fix com.google.common classes still being relocated in Hudi Spark bundle

2023-02-08 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin reassigned HUDI-5731: - Assignee: Alexey Kudinkin > Fix com.google.common classes still being relocated in Hudi

[jira] [Updated] (HUDI-5731) Fix com.google.common classes still being relocated in Hudi Spark bundle

2023-02-08 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-5731: -- Summary: Fix com.google.common classes still being relocated in Hudi Spark bundle (was: Add

[GitHub] [hudi] hudi-bot commented on pull request #7900: [HUDI-5731] Cleaning up unnecessary relocation for com.google.common packages

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7900: URL: https://github.com/apache/hudi/pull/7900#issuecomment-1423367486 ## CI report: * a7c7f17108423f5d6f563faec66eb715d1a8f539 Azure:

[GitHub] [hudi] nbeeee opened a new issue, #7902: [SUPPORT].UnresolvedUnionException: Not in union exception occurred when writing data through spark

2023-02-08 Thread via GitHub
nb opened a new issue, #7902: URL: https://github.com/apache/hudi/issues/7902 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)? - Join the mailing list to engage in conversations and get faster support at

[GitHub] [hudi] hudi-bot commented on pull request #7894: [HUDI-5729]Fix RowDataKeyGen method getRecordKey

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7894: URL: https://github.com/apache/hudi/pull/7894#issuecomment-1423564610 ## CI report: * f5abcc66d670acbf7543915f127c96bd7622e01e Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7903: URL: https://github.com/apache/hudi/pull/7903#issuecomment-1423564644 ## CI report: * ee465d312a5953c8b8337d7fa4f6d7dbc97142a2 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] alexeykudinkin commented on pull request #6240: [HUDI-4482] remove guava and use caffeine instead for cache

2023-02-08 Thread via GitHub
alexeykudinkin commented on PR #6240: URL: https://github.com/apache/hudi/pull/6240#issuecomment-1423211987 @yihua removing Guava still the right long-term call -- Guava is eternal source of conflicts and any library should avoid packaging it at all costs -- This is an automated message

[jira] [Updated] (HUDI-5731) Fix com.google.common classes still being relocated in Hudi Spark bundle

2023-02-08 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-5731: -- Fix Version/s: 0.13.1 > Fix com.google.common classes still being relocated in Hudi Spark

[jira] [Updated] (HUDI-5731) Fix com.google.common classes still being relocated in Hudi Spark bundle

2023-02-08 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-5731: -- Description: As originally reported in:

[GitHub] [hudi] hudi-bot commented on pull request #7885: [MINOR] Make sure FTs are run in GH CI

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7885: URL: https://github.com/apache/hudi/pull/7885#issuecomment-1423355449 ## CI report: * c3d027696958b320912712447bcf41c3f2d28221 UNKNOWN * b132eda24a8e705f210d1116dab543632fa09b0c Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7890: [MINOR] bot.yml ignore more filetype.

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7890: URL: https://github.com/apache/hudi/pull/7890#issuecomment-1423355534 ## CI report: * e97e505b8697f2dfa6cd0d4d42e018204c08215f Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7872: [HUDI-5716] Cleaning up `Partitioner`s hierarchy

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7872: URL: https://github.com/apache/hudi/pull/7872#issuecomment-1423355371 ## CI report: * 8f42e8c18690c8ae76121c714c2c0cda21841264 UNKNOWN * bb3bd527c1c20fb046c23cd4d34e218fb7a06f82 UNKNOWN * de96182a55e0574c96d2b384734a4808c8ba6399 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7901: [HUDI-5665] Adding support to re-use table configs

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7901: URL: https://github.com/apache/hudi/pull/7901#issuecomment-1423472994 ## CI report: * 4f81cc10efc5863beb8f9656c05fc2e03ce6c7ee Azure:

[jira] [Created] (HUDI-5732) Launching hudi-spark3.3 in EMR cluster w/ OSS spark fails due to timeline server (spark) NoClassDefFound

2023-02-08 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-5732: - Summary: Launching hudi-spark3.3 in EMR cluster w/ OSS spark fails due to timeline server (spark) NoClassDefFound Key: HUDI-5732 URL:

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #7752: [MINOR] De-duplicating Iterator implementations

2023-02-08 Thread via GitHub
alexeykudinkin commented on code in PR #7752: URL: https://github.com/apache/hudi/pull/7752#discussion_r1100759052 ## hudi-common/src/main/java/org/apache/hudi/common/util/ClosableIterator.java: ## @@ -24,8 +24,29 @@ * An iterator that give a chance to release resources. *

[jira] [Updated] (HUDI-5734) Fix flink batch read skip clustering data lost

2023-02-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-5734: - Labels: pull-request-available (was: ) > Fix flink batch read skip clustering data lost >

[GitHub] [hudi] hbgstc123 opened a new pull request, #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 opened a new pull request, #7903: URL: https://github.com/apache/hudi/pull/7903 ### Change Logs When flink incremental batch read, disable skip_clustering config. Because skip_clustering could lost data when old commits are cleaned. ### Impact no ###

[GitHub] [hudi] wuzhenhua01 commented on pull request #7678: [HUDI-5562] Add maven wrapper

2023-02-08 Thread via GitHub
wuzhenhua01 commented on PR #7678: URL: https://github.com/apache/hudi/pull/7678#issuecomment-1423537795 cc @alexeykudinkin -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[jira] [Updated] (HUDI-915) Partition Columns missing in files upserted after Metadata Bootstrap

2023-02-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-915: Labels: pull-request-available (was: ) > Partition Columns missing in files upserted after Metadata

[GitHub] [hudi] hudi-bot commented on pull request #7868: [HUDI-1593] Add support for "show restores" and "show restore" commands in hudi-cli

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7868: URL: https://github.com/apache/hudi/pull/7868#issuecomment-1423600470 ## CI report: * 5b6f539ecdc4ba84b7b509b43bf4c3836c575dca Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7804: [HUDI-915][HUDI-5656] Rebased `HoodieBootstrapRelation` onto `HoodieBaseRelation`

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7804: URL: https://github.com/apache/hudi/pull/7804#issuecomment-1423600291 ## CI report: * 214938fa79f087400977256140ef633dace60663 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7860: [HUDI-5673] Support multi writer for bucket index with guarded lock

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7860: URL: https://github.com/apache/hudi/pull/7860#issuecomment-1423600420 ## CI report: * e72f988f68e3021f857b43b14b8721be3f988df5 Azure:

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1100978930 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/table/HoodieTableSource.java: ## @@ -380,7 +380,8 @@ private List buildFileIndex() {

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1100978930 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/table/HoodieTableSource.java: ## @@ -380,7 +380,8 @@ private List buildFileIndex() {

[jira] [Updated] (HUDI-5736) De-coupling column drop flag and schema validation flag in Flink

2023-02-08 Thread Alexander Trushev (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Trushev updated HUDI-5736: Component/s: flink writer-core > De-coupling column drop flag and schema

[jira] [Updated] (HUDI-5736) De-coupling column drop flag and schema validation flag in Flink

2023-02-08 Thread Alexander Trushev (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Trushev updated HUDI-5736: Description: Fix https://issues.apache.org/jira/browse/HUDI-5704 for Flink engine (was:

[jira] [Updated] (HUDI-5736) De-coupling column drop flag and schema validation flag in Flink

2023-02-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-5736: - Labels: pull-request-available (was: ) > De-coupling column drop flag and schema validation flag

[GitHub] [hudi] hudi-bot commented on pull request #7872: [HUDI-5716] Cleaning up `Partitioner`s hierarchy

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7872: URL: https://github.com/apache/hudi/pull/7872#issuecomment-1423652742 ## CI report: * 8f42e8c18690c8ae76121c714c2c0cda21841264 UNKNOWN * bb3bd527c1c20fb046c23cd4d34e218fb7a06f82 UNKNOWN * 20cba2df6bd792c5173b2ef7780ca093e2fac2b5 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7895: [HUDI-5736] De-coupling column drop flag and schema validation flag in Flink

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7895: URL: https://github.com/apache/hudi/pull/7895#issuecomment-1423652843 ## CI report: * 3b6dbf0bcc7059a7dc4a5132bf45d0d1451d7fb0 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1100996687 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101042060 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101042060 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void

[GitHub] [hudi] Zouxxyy commented on pull request #7876: [MINOR] Improve RunClusteringProcedure with partition selected

2023-02-08 Thread via GitHub
Zouxxyy commented on PR #7876: URL: https://github.com/apache/hudi/pull/7876#issuecomment-1423718585 Hi, In fact, `RunClusteringProcedure with partition selected` has been supported

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101042060 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void

[GitHub] [hudi] hbgstc123 commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101048291 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void

[GitHub] [hudi] hbgstc123 commented on pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 commented on PR #7903: URL: https://github.com/apache/hudi/pull/7903#issuecomment-1423760246 > thanks for review and advice -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] pramodbiligiri commented on a diff in pull request #7864: [HUDI-5688] Small workaround that can prevent NPE of EmptyRelation.schema

2023-02-08 Thread via GitHub
pramodbiligiri commented on code in PR #7864: URL: https://github.com/apache/hudi/pull/7864#discussion_r1100984391 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/DefaultSource.scala: ## @@ -241,7 +241,12 @@ object DefaultSource { } if

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1100996687 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101012578 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void

[GitHub] [hudi] hudi-bot commented on pull request #7885: [MINOR] Make sure FTs are run in GH CI

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7885: URL: https://github.com/apache/hudi/pull/7885#issuecomment-1423683208 ## CI report: * c3d027696958b320912712447bcf41c3f2d28221 UNKNOWN * fe84e4662e1853b8ab23484e0c3a679e52a9d1cb Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7895: [HUDI-5736] De-coupling column drop flag and schema validation flag in Flink

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7895: URL: https://github.com/apache/hudi/pull/7895#issuecomment-1423683310 ## CI report: * 3b6dbf0bcc7059a7dc4a5132bf45d0d1451d7fb0 UNKNOWN * 192a62704a96fe3c67e5017d624e456b6722f02f UNKNOWN Bot commands @hudi-bot supports the

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101042060 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void

[GitHub] [hudi] hudi-bot commented on pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7903: URL: https://github.com/apache/hudi/pull/7903#issuecomment-1423736291 ## CI report: * ee465d312a5953c8b8337d7fa4f6d7dbc97142a2 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7752: [MINOR] De-duplicating Iterator implementations

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7752: URL: https://github.com/apache/hudi/pull/7752#issuecomment-1423735518 ## CI report: * dec6178b4b835160cc59964bdd25ad7fb1fdd41e Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7903: URL: https://github.com/apache/hudi/pull/7903#issuecomment-1423742547 ## CI report: * ee465d312a5953c8b8337d7fa4f6d7dbc97142a2 Azure:

[GitHub] [hudi] hbgstc123 commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101048291 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void

[GitHub] [hudi] liaotian1005 commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
liaotian1005 commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423572273 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1100991761 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void

[GitHub] [hudi] hudi-bot commented on pull request #7904: [HUDI-5735] Flink-hudi write time format data UTC time zone problem

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7904: URL: https://github.com/apache/hudi/pull/7904#issuecomment-1423697590 ## CI report: * ac26a880833f1f19aea723f17a13c6efbd86f5ca Azure:

[jira] [Created] (HUDI-5735) Fix: Flink-hudi write time format data UTC time zone problem

2023-02-08 Thread luckily (Jira)
luckily created HUDI-5735: - Summary: Fix: Flink-hudi write time format data UTC time zone problem Key: HUDI-5735 URL: https://issues.apache.org/jira/browse/HUDI-5735 Project: Apache Hudi Issue

[GitHub] [hudi] hudi-bot commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423600051 ## CI report: * a3e25d91fe89abb52b2019c5f5a68f28a321a1f8 Azure:

[GitHub] [hudi] koochiswathiTR commented on issue #3739: Hoodie clean is not deleting old files

2023-02-08 Thread via GitHub
koochiswathiTR commented on issue #3739: URL: https://github.com/apache/hudi/issues/3739#issuecomment-1423611752 @nsivabalan,@vinothchandar , @bhasudha , @bvaradar , @n3nash Cleanup tirggers after compaction? or Cleanup runs when an upsert on hudi dataset ? or cleanup triggers when

[GitHub] [hudi] liaotian1005 opened a new pull request, #7904: [HUDI-5735] Flink-hudi write time format data UTC time zone problem

2023-02-08 Thread via GitHub
liaotian1005 opened a new pull request, #7904: URL: https://github.com/apache/hudi/pull/7904 link-hudi write data type of timestamp format UTC time zone problem. 1. The time zone written by flink is local, but the time zone read is not local 2.flink writes timestamp data, but

[GitHub] [hudi] koochiswathiTR commented on issue #3739: Hoodie clean is not deleting old files

2023-02-08 Thread via GitHub
koochiswathiTR commented on issue #3739: URL: https://github.com/apache/hudi/issues/3739#issuecomment-1423673376 can we change and schedule clean up so that cleanup runs only in one batch ? rest other batches processing time would be faster. @nsivabalan -- This is an automated

[GitHub] [hudi] hudi-bot commented on pull request #7904: [HUDI-5735] Flink-hudi write time format data UTC time zone problem

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7904: URL: https://github.com/apache/hudi/pull/7904#issuecomment-1423693027 ## CI report: * ac26a880833f1f19aea723f17a13c6efbd86f5ca UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #7895: [HUDI-5736] De-coupling column drop flag and schema validation flag in Flink

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7895: URL: https://github.com/apache/hudi/pull/7895#issuecomment-1423692951 ## CI report: * 3b6dbf0bcc7059a7dc4a5132bf45d0d1451d7fb0 UNKNOWN * 192a62704a96fe3c67e5017d624e456b6722f02f Azure:

[GitHub] [hudi] hbgstc123 commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101029083 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101028481 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void

[GitHub] [hudi] hbgstc123 commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
hbgstc123 commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1101069152 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -359,6 +360,39 @@ void

[GitHub] [hudi] hudi-bot commented on pull request #7808: [MINOR] use ExecutorFactory in BootstrapHandler

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7808: URL: https://github.com/apache/hudi/pull/7808#issuecomment-1423604986 ## CI report: * dbedf67bd39cf8ff13b7dbe1294be86bc5f9718f Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7804: [HUDI-915][HUDI-5656] Rebased `HoodieBootstrapRelation` onto `HoodieBaseRelation`

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7804: URL: https://github.com/apache/hudi/pull/7804#issuecomment-1423604915 ## CI report: * 214938fa79f087400977256140ef633dace60663 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7860: [HUDI-5673] Support multi writer for bucket index with guarded lock

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7860: URL: https://github.com/apache/hudi/pull/7860#issuecomment-1423605075 ## CI report: * e72f988f68e3021f857b43b14b8721be3f988df5 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7633: Fix Deletes issued without any prior commits

2023-02-08 Thread via GitHub
hudi-bot commented on PR #7633: URL: https://github.com/apache/hudi/pull/7633#issuecomment-1423604711 ## CI report: * 8b89f3d81e3df42d79d5e1a55672bb9beefee0a9 Azure:

[GitHub] [hudi] SteNicholas commented on a diff in pull request #7903: [HUDI-5734]Fix flink batch read skip clustering data lost

2023-02-08 Thread via GitHub
SteNicholas commented on code in PR #7903: URL: https://github.com/apache/hudi/pull/7903#discussion_r1100978401 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/table/HoodieTableSource.java: ## @@ -380,7 +380,8 @@ private List buildFileIndex() {

[jira] [Updated] (HUDI-5672) Lockless multi writer support

2023-02-08 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen updated HUDI-5672: - Summary: Lockless multi writer support (was: Flink multi writer support) > Lockless multi writer support

[GitHub] [hudi] veenaypatil commented on issue #6014: [SUPPORT] High runtime for a batch in SparkWriteHelper stage

2023-02-08 Thread via GitHub
veenaypatil commented on issue #6014: URL: https://github.com/apache/hudi/issues/6014#issuecomment-1423646531 @nsivabalan sorry for late response on this issue, I am not seeing this issue as of now, we were only seeing this issue when we killed the job and restarted it. > I see

[GitHub] [hudi] pan3793 commented on pull request #7900: [HUDI-5731] Cleaning up unnecessary relocation for com.google.common packages

2023-02-08 Thread via GitHub
pan3793 commented on PR #7900: URL: https://github.com/apache/hudi/pull/7900#issuecomment-1423650689 Thanks for fixing this issue. And I think curator should be relocated/removed as well. The issue happens on Kyuubi IT because 1. Kyuubi invokes the curator to access ZK 2.

[GitHub] [hudi] voonhous commented on pull request #6868: [Hudi-4882] Multiple ordering fields and null value update for partial update to handle out-of-order events

2023-02-08 Thread via GitHub
voonhous commented on PR #6868: URL: https://github.com/apache/hudi/pull/6868#issuecomment-1423752841 Commenting for visibility -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[jira] [Updated] (HUDI-5735) Fix: Flink-hudi write time format data UTC time zone problem

2023-02-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-5735: - Labels: pull-request-available (was: ) > Fix: Flink-hudi write time format data UTC time zone

  1   2   >