[GitHub] [hudi] hudi-bot commented on pull request #6815: [HUDI-4937][Stacked on 7702] Fix `HoodieTable` injecting non-reusable `HoodieBackedTableMetadata` aggressively flushing MT readers

2023-01-19 Thread GitBox
hudi-bot commented on PR #6815: URL: https://github.com/apache/hudi/pull/6815#issuecomment-1397910792 ## CI report: * 13fb78850890b96b86b66d7df060feb11950ec0c UNKNOWN * 29de073c80985fa18576e7a01ca47b61d32ac944 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6133: [HUDI-1575] Early Conflict Detection For Multi-writer

2023-01-19 Thread GitBox
hudi-bot commented on PR #6133: URL: https://github.com/apache/hudi/pull/6133#issuecomment-1397910520 ## CI report: * dbe3db845908d261baa5a1aa71d19e0db55816de UNKNOWN * 678cce4a9748cb54a90a559384a0cb0443082535 UNKNOWN * 6fc5bf1ce7921bf25acc3659565457264d8b9dc2 UNKNOWN *

[GitHub] [hudi] BalaMahesh commented on pull request #7687: Update to handle deletes in postgres debezium

2023-01-19 Thread GitBox
BalaMahesh commented on PR #7687: URL: https://github.com/apache/hudi/pull/7687#issuecomment-1397908663 > Thanks @BalaMahesh can we fire a JIRA issue and change the commit title to: [HUDI-${JIRA_ID}] ${you commit title} @danny0405 - How do I get access to jira to create the issue

[GitHub] [hudi] hudi-bot commented on pull request #7713: [HUDI-5589] Fix Hudi config inference

2023-01-19 Thread GitBox
hudi-bot commented on PR #7713: URL: https://github.com/apache/hudi/pull/7713#issuecomment-1397908288 ## CI report: * 198a828b76b654e4b8f3ef8ac133f672a682cdf8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7703: [HUDI-1575][DO NOT MERGE] Testing early conflict detection with feature flag enabled by default

2023-01-19 Thread GitBox
hudi-bot commented on PR #7703: URL: https://github.com/apache/hudi/pull/7703#issuecomment-1397908217 ## CI report: * d5a19b738146a7003c7957b3a1cd63c5cfbd348d Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7702: [HUDI-5579] Fixing Kryo registration to be properly wired into Spark sessions

2023-01-19 Thread GitBox
hudi-bot commented on PR #7702: URL: https://github.com/apache/hudi/pull/7702#issuecomment-1397908181 ## CI report: * 384a9774018272e13b967817b0e48b1596a23dcc UNKNOWN * 45b15748e01531fb144e37f5b04b34b811ab1474 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7512: [HUDI-5417] support to read avro from non-legacy map/list in parquet log

2023-01-19 Thread GitBox
hudi-bot commented on PR #7512: URL: https://github.com/apache/hudi/pull/7512#issuecomment-1397907882 ## CI report: * e24249f8bc11200e709f5c6b873904085ba61d3b Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6133: [HUDI-1575] Early Conflict Detection For Multi-writer

2023-01-19 Thread GitBox
hudi-bot commented on PR #6133: URL: https://github.com/apache/hudi/pull/6133#issuecomment-1397907106 ## CI report: * dbe3db845908d261baa5a1aa71d19e0db55816de UNKNOWN * 678cce4a9748cb54a90a559384a0cb0443082535 UNKNOWN * 6fc5bf1ce7921bf25acc3659565457264d8b9dc2 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #7703: [HUDI-1575][DO NOT MERGE] Testing early conflict detection with feature flag enabled by default

2023-01-19 Thread GitBox
hudi-bot commented on PR #7703: URL: https://github.com/apache/hudi/pull/7703#issuecomment-1397903924 ## CI report: * 60e2e430cd0915cc2270cad6277d6327c9f47a15 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7713: [HUDI-5589] Fix Hudi config inference

2023-01-19 Thread GitBox
hudi-bot commented on PR #7713: URL: https://github.com/apache/hudi/pull/7713#issuecomment-1397904015 ## CI report: * 198a828b76b654e4b8f3ef8ac133f672a682cdf8 Azure:

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #7512: [HUDI-5417] support to read avro from non-legacy map/list in parquet log

2023-01-19 Thread GitBox
alexeykudinkin commented on code in PR #7512: URL: https://github.com/apache/hudi/pull/7512#discussion_r1082093165 ## hudi-common/src/main/java/org/apache/parquet/avro/HoodieAvroReadSupport.java: ## @@ -0,0 +1,133 @@ +/* + * Licensed to the Apache Software Foundation (ASF)

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #7714: [HUDI-5549] SparkRecordManager support avro data block

2023-01-19 Thread GitBox
alexeykudinkin commented on code in PR #7714: URL: https://github.com/apache/hudi/pull/7714#discussion_r1082080586 ## hudi-client/hudi-spark-client/src/main/scala/org/apache/spark/sql/HoodieInternalRowUtils.scala: ## @@ -210,6 +220,26 @@ object HoodieInternalRowUtils { }

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #7512: [HUDI-5417] support to read avro from non-legacy map/list in parquet log

2023-01-19 Thread GitBox
alexeykudinkin commented on code in PR #7512: URL: https://github.com/apache/hudi/pull/7512#discussion_r1082078272 ## hudi-common/src/main/java/org/apache/parquet/avro/HoodieAvroParquetReaderBuilder.java: ## @@ -0,0 +1,44 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] [hudi] hudi-bot commented on pull request #7714: [HUDI-5549] SparkRecordManager support avro data block

2023-01-19 Thread GitBox
hudi-bot commented on PR #7714: URL: https://github.com/apache/hudi/pull/7714#issuecomment-1397872078 ## CI report: * 462a4041585cfeb6c16354bbc8b964bdd08ed301 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7714: [HUDI-5549] SparkRecordManager support avro data block

2023-01-19 Thread GitBox
hudi-bot commented on PR #7714: URL: https://github.com/apache/hudi/pull/7714#issuecomment-1397865891 ## CI report: * 462a4041585cfeb6c16354bbc8b964bdd08ed301 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #7713: [HUDI-5589] Fix Hudi config inference

2023-01-19 Thread GitBox
hudi-bot commented on PR #7713: URL: https://github.com/apache/hudi/pull/7713#issuecomment-1397865854 ## CI report: * 198a828b76b654e4b8f3ef8ac133f672a682cdf8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7703: [HUDI-1575][DO NOT MERGE] Testing early conflict detection with feature flag enabled by default

2023-01-19 Thread GitBox
hudi-bot commented on PR #7703: URL: https://github.com/apache/hudi/pull/7703#issuecomment-1397865798 ## CI report: * 60e2e430cd0915cc2270cad6277d6327c9f47a15 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7512: [HUDI-5417] support to read avro from non-legacy map/list in parquet log

2023-01-19 Thread GitBox
hudi-bot commented on PR #7512: URL: https://github.com/apache/hudi/pull/7512#issuecomment-1397865498 ## CI report: * e24249f8bc11200e709f5c6b873904085ba61d3b Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6384: [HUDI-4613] Avoid the use of regex expressions when call hoodieFileGroup#addLogFile function

2023-01-19 Thread GitBox
hudi-bot commented on PR #6384: URL: https://github.com/apache/hudi/pull/6384#issuecomment-1397864762 ## CI report: * 57263199ad155c0513e54d197ff6998c9531a783 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7713: [HUDI-5589] Fix Hudi config inference

2023-01-19 Thread GitBox
hudi-bot commented on PR #7713: URL: https://github.com/apache/hudi/pull/7713#issuecomment-1397862421 ## CI report: * 198a828b76b654e4b8f3ef8ac133f672a682cdf8 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #7703: [HUDI-1575][DO NOT MERGE] Testing early conflict detection with feature flag enabled by default

2023-01-19 Thread GitBox
hudi-bot commented on PR #7703: URL: https://github.com/apache/hudi/pull/7703#issuecomment-1397862363 ## CI report: * 60e2e430cd0915cc2270cad6277d6327c9f47a15 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7512: [HUDI-5417] support to read avro from non-legacy map/list in parquet log

2023-01-19 Thread GitBox
hudi-bot commented on PR #7512: URL: https://github.com/apache/hudi/pull/7512#issuecomment-1397861977 ## CI report: * e24249f8bc11200e709f5c6b873904085ba61d3b Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6384: [HUDI-4613] Avoid the use of regex expressions when call hoodieFileGroup#addLogFile function

2023-01-19 Thread GitBox
hudi-bot commented on PR #6384: URL: https://github.com/apache/hudi/pull/6384#issuecomment-1397861259 ## CI report: * 57263199ad155c0513e54d197ff6998c9531a783 Azure:

[GitHub] [hudi] wzx140 opened a new pull request, #7714: [HUDI-5549] SparkRecordManager support avro data block

2023-01-19 Thread GitBox
wzx140 opened a new pull request, #7714: URL: https://github.com/apache/hudi/pull/7714 ### Change Logs SparkRecordManager support avro data block ### Impact Not change public API ### Risk level (write none, low medium or high below) None ###

[GitHub] [hudi] hudi-bot commented on pull request #7232: [HUDI-5235] clustering target size should larger than small file limit

2023-01-19 Thread GitBox
hudi-bot commented on PR #7232: URL: https://github.com/apache/hudi/pull/7232#issuecomment-1397857797 ## CI report: * 08239e5b8d4d49da4b5b3d814233251f81b3d0b0 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7159: [HUDI-5173]Skip if there is only one file in clusteringGroup

2023-01-19 Thread GitBox
hudi-bot commented on PR #7159: URL: https://github.com/apache/hudi/pull/7159#issuecomment-1397857684 ## CI report: * 15ecd91180d32c7fa1905c11408f4bc23347e682 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] wzx140 commented on a diff in pull request #7512: [HUDI-5417] support to read avro from non-legacy map/list in parquet log

2023-01-19 Thread GitBox
wzx140 commented on code in PR #7512: URL: https://github.com/apache/hudi/pull/7512#discussion_r1082063980 ## hudi-common/src/main/java/org/apache/parquet/avro/HoodieAvroParquetReaderBuilder.java: ## @@ -0,0 +1,44 @@ +/* + * Licensed to the Apache Software Foundation (ASF)

[GitHub] [hudi] zhuanshenbsj1 commented on pull request #7232: [HUDI-5235] clustering target size should larger than small file limit

2023-01-19 Thread GitBox
zhuanshenbsj1 commented on PR #7232: URL: https://github.com/apache/hudi/pull/7232#issuecomment-1397856020 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] zhuanshenbsj1 commented on pull request #7159: [HUDI-5173]Skip if there is only one file in clusteringGroup

2023-01-19 Thread GitBox
zhuanshenbsj1 commented on PR #7159: URL: https://github.com/apache/hudi/pull/7159#issuecomment-1397855643 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] danny0405 merged pull request #7626: [HUDI-5516] Reduce memory footprint on workload with thousand active partitions

2023-01-19 Thread GitBox
danny0405 merged PR #7626: URL: https://github.com/apache/hudi/pull/7626 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] danny0405 commented on pull request #7626: [HUDI-5516] Reduce memory footprint on workload with thousand active partitions

2023-01-19 Thread GitBox
danny0405 commented on PR #7626: URL: https://github.com/apache/hudi/pull/7626#issuecomment-1397828189 All the Flink related tests are passed so would merge the PR soon ~ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[GitHub] [hudi] danny0405 merged pull request #7677: [HUDI-5559] Support CDC for flink bounded source

2023-01-19 Thread GitBox
danny0405 merged PR #7677: URL: https://github.com/apache/hudi/pull/7677 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] yihua merged pull request #7612: [HUDI-5336] Fixing log file pattern match to ignore extraneous files

2023-01-19 Thread GitBox
yihua merged PR #7612: URL: https://github.com/apache/hudi/pull/7612 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] yihua opened a new pull request, #7713: [HUDI-5589] Fix Hudi config inference

2023-01-19 Thread GitBox
yihua opened a new pull request, #7713: URL: https://github.com/apache/hudi/pull/7713 ### Change Logs For a config defined by `ConfigProperty` with no default value set, e.g., `DynamoDbBasedLockConfig.DYNAMODB_LOCK_PARTITION_KEY`, even if the inference function is set with

[GitHub] [hudi] xushiyan opened a new pull request, #7712: [WIP][DNM] Check CI timeout

2023-01-19 Thread GitBox
xushiyan opened a new pull request, #7712: URL: https://github.com/apache/hudi/pull/7712 ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any public API or user-facing feature change or any

[GitHub] [hudi] hemanth-gowda-12 commented on issue #7654: [SUPPORT] Starvation on Hudi Java Client in OCC mode

2023-01-19 Thread GitBox
hemanth-gowda-12 commented on issue #7654: URL: https://github.com/apache/hudi/issues/7654#issuecomment-1397816372 @fengjian428 One additional thing, doing a tree on the output dir, I see duplicates on upserts sometimes. (Looking at the different file groups under partition 8).

[GitHub] [hudi] hudi-bot commented on pull request #7705: Consistent Hashing CI Test

2023-01-19 Thread GitBox
hudi-bot commented on PR #7705: URL: https://github.com/apache/hudi/pull/7705#issuecomment-1397810986 ## CI report: * 4798f6bd730194dcecfc517aaa737c568c09157b UNKNOWN * c4f67651c860abedb1396f5d3212760c8691145c Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7704: [HUDI-5583] multiple writers dead lock while cleaning

2023-01-19 Thread GitBox
hudi-bot commented on PR #7704: URL: https://github.com/apache/hudi/pull/7704#issuecomment-1397810949 ## CI report: * 0ee4dc0391d7e1d4b0e155ceabf2a16f4de160bb UNKNOWN * 65ea612da8874c1d67cee52d38cc29c943f7395e Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7694: [HUDI-5572] Flink write need to skip check the compatibility of Schem…

2023-01-19 Thread GitBox
hudi-bot commented on PR #7694: URL: https://github.com/apache/hudi/pull/7694#issuecomment-1397810870 ## CI report: * 97fdc558722b8d5152f9e21112045adb73eca9fe Azure:

[GitHub] [hudi] nsivabalan commented on a diff in pull request #7711: [HUDI-5569] Files written by first commit/delta commit if it failed are detected as valid data files

2023-01-19 Thread GitBox
nsivabalan commented on code in PR #7711: URL: https://github.com/apache/hudi/pull/7711#discussion_r1082026700 ## hudi-common/src/main/java/org/apache/hudi/common/table/view/AbstractTableFileSystemView.java: ## @@ -104,9 +104,9 @@ private String

[GitHub] [hudi] hudi-bot commented on pull request #7512: [HUDI-5417] support to read avro from non-legacy map/list in parquet log

2023-01-19 Thread GitBox
hudi-bot commented on PR #7512: URL: https://github.com/apache/hudi/pull/7512#issuecomment-1397810482 ## CI report: * e24249f8bc11200e709f5c6b873904085ba61d3b Azure:

[GitHub] [hudi] LinMingQiang commented on pull request #7694: [HUDI-5572] Flink write need to skip check the compatibility of Schem…

2023-01-19 Thread GitBox
LinMingQiang commented on PR #7694: URL: https://github.com/apache/hudi/pull/7694#issuecomment-1397808311 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] alexeykudinkin commented on pull request #7702: [HUDI-5579] Fixing Kryo registration to be properly wired into Spark sessions

2023-01-19 Thread GitBox
alexeykudinkin commented on PR #7702: URL: https://github.com/apache/hudi/pull/7702#issuecomment-1397788798 > but this is a very broad usability change. we should have brought this up for highlighting earlier. Agreed, not ideal, but unavoidable unfortunately -- w/o we'd be passing

[GitHub] [hudi] xushiyan commented on pull request #7702: [HUDI-5579] Fixing Kryo registration to be properly wired into Spark sessions

2023-01-19 Thread GitBox
xushiyan commented on PR #7702: URL: https://github.com/apache/hudi/pull/7702#issuecomment-1397760535 > --conf spark.kryo.registrator=org.apache.spark.HoodieSparkKryoRegistrar but this is a very broad usability change. we should have brought this up for highlighting earlier. --

[GitHub] [hudi] hudi-bot commented on pull request #7711: [HUDI-5569] Files written by first commit/delta commit if it failed are detected as valid data files

2023-01-19 Thread GitBox
hudi-bot commented on PR #7711: URL: https://github.com/apache/hudi/pull/7711#issuecomment-1397759988 ## CI report: * fb1f2609baf5b3f12a4ca5243f5205f2ba8f6367 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7661: [DO NOT MERGE] Release testing record merger

2023-01-19 Thread GitBox
hudi-bot commented on PR #7661: URL: https://github.com/apache/hudi/pull/7661#issuecomment-1397759656 ## CI report: * f698f26db2314cbbbee30d37df0d6fd343317796 UNKNOWN * 4a2dbb50cff97211589a22059ac7fb1ffcf605a8 UNKNOWN * 36b64551a0b0acb165f89c5a5052bcda46bb4ee4 Azure:

[GitHub] [hudi] nsivabalan commented on pull request #7612: [HUDI-5336] Fixing log file pattern match to ignore extraneous files

2023-01-19 Thread GitBox
nsivabalan commented on PR #7612: URL: https://github.com/apache/hudi/pull/7612#issuecomment-1397758256 Failed due to flaky flink IT test ![Screen Shot 2023-01-19 at 3 57 41 PM](https://user-images.githubusercontent.com/513218/213588359-65a53609-4caf-4cb7-b3e0-07c4a705a772.png) --

[GitHub] [hudi] hemanth-gowda-12 commented on issue #7654: [SUPPORT] Starvation on Hudi Java Client in OCC mode

2023-01-19 Thread GitBox
hemanth-gowda-12 commented on issue #7654: URL: https://github.com/apache/hudi/issues/7654#issuecomment-1397757982 @fengjian428 , I see the same issue adding the above line. Will we need to create a Jira for the above? Again, thanks a lot for the effort. ```

[GitHub] [hudi] hudi-bot commented on pull request #7711: [HUDI-5569] Files written by first commit/delta commit if it failed are detected as valid data files

2023-01-19 Thread GitBox
hudi-bot commented on PR #7711: URL: https://github.com/apache/hudi/pull/7711#issuecomment-1397755517 ## CI report: * fb1f2609baf5b3f12a4ca5243f5205f2ba8f6367 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #7661: [DO NOT MERGE] Release testing record merger

2023-01-19 Thread GitBox
hudi-bot commented on PR #7661: URL: https://github.com/apache/hudi/pull/7661#issuecomment-1397755335 ## CI report: * f698f26db2314cbbbee30d37df0d6fd343317796 UNKNOWN * c8e78a45c0001e62346b403ebb257ddfc80c213e Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7642: [HUDI-5534][Stacked on 6815] Optimizing Bloom Index lookup when using Bloom Filters from Metadata Table

2023-01-19 Thread GitBox
hudi-bot commented on PR #7642: URL: https://github.com/apache/hudi/pull/7642#issuecomment-1397755283 ## CI report: * dd82fb612d88d6c9ba4f06be2989ec5061052047 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7612: [HUDI-5336] Fixing log file pattern match to ignore extraneous files

2023-01-19 Thread GitBox
hudi-bot commented on PR #7612: URL: https://github.com/apache/hudi/pull/7612#issuecomment-1397749993 ## CI report: * 18c64ac0163f66dbbfb2ffd9ecebfd5982536e78 Azure:

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #7661: [DO NOT MERGE] Release testing record merger

2023-01-19 Thread GitBox
alexeykudinkin commented on code in PR #7661: URL: https://github.com/apache/hudi/pull/7661#discussion_r1081969266 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/HoodieAppendHandle.java: ## @@ -599,21 +600,22 @@ private void

[GitHub] [hudi] jonvex opened a new pull request, #7711: [HUDI-5569] Files written by first commit/delta commit if it failed are detected as valid data files

2023-01-19 Thread GitBox
jonvex opened a new pull request, #7711: URL: https://github.com/apache/hudi/pull/7711 ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any public API or user-facing feature change or any performance

[GitHub] [hudi] hudi-bot commented on pull request #7702: [HUDI-5579] Fixing Kryo registration to be properly wired into Spark sessions

2023-01-19 Thread GitBox
hudi-bot commented on PR #7702: URL: https://github.com/apache/hudi/pull/7702#issuecomment-1397701234 ## CI report: * 384a9774018272e13b967817b0e48b1596a23dcc UNKNOWN * 61e621587610d74d3d3820d68d1f18ddaa4538d8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7642: [HUDI-5534][Stacked on 6815] Optimizing Bloom Index lookup when using Bloom Filters from Metadata Table

2023-01-19 Thread GitBox
hudi-bot commented on PR #7642: URL: https://github.com/apache/hudi/pull/7642#issuecomment-1397700970 ## CI report: * dd82fb612d88d6c9ba4f06be2989ec5061052047 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6815: [HUDI-4937][Stacked on 7702] Fix `HoodieTable` injecting non-reusable `HoodieBackedTableMetadata` aggressively flushing MT readers

2023-01-19 Thread GitBox
hudi-bot commented on PR #6815: URL: https://github.com/apache/hudi/pull/6815#issuecomment-1397699828 ## CI report: * 13fb78850890b96b86b66d7df060feb11950ec0c UNKNOWN * 1aa280f0c422a12057c3478290c7d8cef171516d Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7702: [HUDI-5579] Fixing Kryo registration to be properly wired into Spark sessions

2023-01-19 Thread GitBox
hudi-bot commented on PR #7702: URL: https://github.com/apache/hudi/pull/7702#issuecomment-1397694848 ## CI report: * 384a9774018272e13b967817b0e48b1596a23dcc UNKNOWN * 61e621587610d74d3d3820d68d1f18ddaa4538d8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6815: [HUDI-4937][Stacked on 7702] Fix `HoodieTable` injecting non-reusable `HoodieBackedTableMetadata` aggressively flushing MT readers

2023-01-19 Thread GitBox
hudi-bot commented on PR #6815: URL: https://github.com/apache/hudi/pull/6815#issuecomment-1397693509 ## CI report: * 13fb78850890b96b86b66d7df060feb11950ec0c UNKNOWN * 1aa280f0c422a12057c3478290c7d8cef171516d Azure:

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #7512: [HUDI-5417] support to read avro from non-legacy map/list in parquet log

2023-01-19 Thread GitBox
alexeykudinkin commented on code in PR #7512: URL: https://github.com/apache/hudi/pull/7512#discussion_r1081925427 ## hudi-common/src/main/java/org/apache/parquet/avro/HoodieAvroParquetReaderBuilder.java: ## @@ -0,0 +1,44 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] [hudi] hudi-bot commented on pull request #7706: [HUDI-5585][flink]Fix flink creates and writes the table, the spark alter table reports an error

2023-01-19 Thread GitBox
hudi-bot commented on PR #7706: URL: https://github.com/apache/hudi/pull/7706#issuecomment-1397688233 ## CI report: * 126951c4f2e2581ffbfb996df3d2ea325290f7f6 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7702: [HUDI-5579] Fixing Kryo registration to be properly wired into Spark sessions

2023-01-19 Thread GitBox
hudi-bot commented on PR #7702: URL: https://github.com/apache/hudi/pull/7702#issuecomment-1397688130 ## CI report: * 384a9774018272e13b967817b0e48b1596a23dcc UNKNOWN * 61e621587610d74d3d3820d68d1f18ddaa4538d8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6815: [HUDI-4937][Stacked on 7702] Fix `HoodieTable` injecting non-reusable `HoodieBackedTableMetadata` aggressively flushing MT readers

2023-01-19 Thread GitBox
hudi-bot commented on PR #6815: URL: https://github.com/apache/hudi/pull/6815#issuecomment-1397686276 ## CI report: * 13fb78850890b96b86b66d7df060feb11950ec0c UNKNOWN * 1aa280f0c422a12057c3478290c7d8cef171516d Azure:

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #6384: [HUDI-4613] Avoid the use of regex expressions when call hoodieFileGroup#addLogFile function

2023-01-19 Thread GitBox
alexeykudinkin commented on code in PR #6384: URL: https://github.com/apache/hudi/pull/6384#discussion_r1081919293 ## hudi-common/src/main/java/org/apache/hudi/common/model/HoodieLogFile.java: ## @@ -48,59 +52,72 @@ public class HoodieLogFile implements Serializable {

[GitHub] [hudi] hudi-bot commented on pull request #7423: [HUDI-5384] Adding optimization rule to appropriately push down filters into the `HoodieFileIndex`

2023-01-19 Thread GitBox
hudi-bot commented on PR #7423: URL: https://github.com/apache/hudi/pull/7423#issuecomment-1397615128 ## CI report: * 78a6da0b0d5d65f8e7f4c59b495a2820e1f9877f UNKNOWN * cc679d4d5e34a00ab07be711b35dce914bd4b466 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7423: [HUDI-5384] Adding optimization rule to appropriately push down filters into the `HoodieFileIndex`

2023-01-19 Thread GitBox
hudi-bot commented on PR #7423: URL: https://github.com/apache/hudi/pull/7423#issuecomment-1397607321 ## CI report: * 78a6da0b0d5d65f8e7f4c59b495a2820e1f9877f UNKNOWN * cc679d4d5e34a00ab07be711b35dce914bd4b466 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7661: [DO NOT MERGE] Release testing record merger

2023-01-19 Thread GitBox
hudi-bot commented on PR #7661: URL: https://github.com/apache/hudi/pull/7661#issuecomment-1397598033 ## CI report: * f698f26db2314cbbbee30d37df0d6fd343317796 UNKNOWN * c8e78a45c0001e62346b403ebb257ddfc80c213e Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7423: [HUDI-5384] Adding optimization rule to appropriately push down filters into the `HoodieFileIndex`

2023-01-19 Thread GitBox
hudi-bot commented on PR #7423: URL: https://github.com/apache/hudi/pull/7423#issuecomment-1397596949 ## CI report: * 78a6da0b0d5d65f8e7f4c59b495a2820e1f9877f UNKNOWN * cc679d4d5e34a00ab07be711b35dce914bd4b466 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7232: [HUDI-5235] clustering target size should larger than small file limit

2023-01-19 Thread GitBox
hudi-bot commented on PR #7232: URL: https://github.com/apache/hudi/pull/7232#issuecomment-1397596345 ## CI report: * 08239e5b8d4d49da4b5b3d814233251f81b3d0b0 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7709: [HUDI-5582] Do not let users override internal metadata configs

2023-01-19 Thread GitBox
hudi-bot commented on PR #7709: URL: https://github.com/apache/hudi/pull/7709#issuecomment-1397583495 ## CI report: * 8048762c6d565c09289564d812c4bba77cc90d61 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7680: [HUDI-5548] spark sql show|update hudi's table properties

2023-01-19 Thread GitBox
hudi-bot commented on PR #7680: URL: https://github.com/apache/hudi/pull/7680#issuecomment-1397583044 ## CI report: * df3a787ab69d1a3ac0ff854b671699e0a55dc01d Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7677: [HUDI-5559] Support CDC for flink bounded source

2023-01-19 Thread GitBox
hudi-bot commented on PR #7677: URL: https://github.com/apache/hudi/pull/7677#issuecomment-1397582901 ## CI report: * 21392a9c797c024316f63df08774542f6a6a32b7 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7661: [DO NOT MERGE] Release testing record merger

2023-01-19 Thread GitBox
hudi-bot commented on PR #7661: URL: https://github.com/apache/hudi/pull/7661#issuecomment-1397582619 ## CI report: * f698f26db2314cbbbee30d37df0d6fd343317796 UNKNOWN * c8e78a45c0001e62346b403ebb257ddfc80c213e Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7626: [HUDI-5516] Reduce memory footprint on workload with thousand active partitions

2023-01-19 Thread GitBox
hudi-bot commented on PR #7626: URL: https://github.com/apache/hudi/pull/7626#issuecomment-1397582224 ## CI report: * 1cfe72cb4a06010b19c96abf25b3ebf9d7f6e895 Azure:

[GitHub] [hudi] nsivabalan commented on pull request #7709: [HUDI-5582] Do not let users override internal metadata configs

2023-01-19 Thread GitBox
nsivabalan commented on PR #7709: URL: https://github.com/apache/hudi/pull/7709#issuecomment-1397580211 have cancelled CI run to prioritize other patches. please re-trigger later tonight. -- This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [hudi] nsivabalan commented on a diff in pull request #7709: [HUDI-5582] Do not let users override internal metadata configs

2023-01-19 Thread GitBox
nsivabalan commented on code in PR #7709: URL: https://github.com/apache/hudi/pull/7709#discussion_r1081777492 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java: ## @@ -235,10 +235,7 @@ private void

[GitHub] [hudi] zhuanshenbsj1 commented on pull request #7159: [HUDI-5173]Skip if there is only one file in clusteringGroup

2023-01-19 Thread GitBox
zhuanshenbsj1 commented on PR #7159: URL: https://github.com/apache/hudi/pull/7159#issuecomment-1397504797 > @danny0405, @zhuanshenbsj1, IMO, the benefit of this pull request is not great for incremental clustering, meanwhile there is no benefit for full Clustering. Therefore, this change

[GitHub] [hudi] hudi-bot commented on pull request #7661: [DO NOT MERGE] Release testing record merger

2023-01-19 Thread GitBox
hudi-bot commented on PR #7661: URL: https://github.com/apache/hudi/pull/7661#issuecomment-1397503939 ## CI report: * f698f26db2314cbbbee30d37df0d6fd343317796 UNKNOWN * c8e78a45c0001e62346b403ebb257ddfc80c213e Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6384: [HUDI-4613] Avoid the use of regex expressions when call hoodieFileGroup#addLogFile function

2023-01-19 Thread GitBox
hudi-bot commented on PR #6384: URL: https://github.com/apache/hudi/pull/6384#issuecomment-1397494254 ## CI report: * 57263199ad155c0513e54d197ff6998c9531a783 Azure:

[GitHub] [hudi] vectorijk opened a new pull request, #7710: remove duplicated match clause in MergeInto syntax

2023-01-19 Thread GitBox
vectorijk opened a new pull request, #7710: URL: https://github.com/apache/hudi/pull/7710 ### Documentation Update remove duplicated match clause in MergeInto syntax ### Contributor's checklist - [x] Read through [contributor's

[GitHub] [hudi] hudi-bot commented on pull request #7709: [HUDI-5582] Do not let users override internal metadata configs

2023-01-19 Thread GitBox
hudi-bot commented on PR #7709: URL: https://github.com/apache/hudi/pull/7709#issuecomment-1397400826 ## CI report: * 170178b86673e1382178a1cb0149e90c7959d9f7 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7702: [HUDI-5579] Fixing Kryo registration to be properly wired into Spark sessions

2023-01-19 Thread GitBox
hudi-bot commented on PR #7702: URL: https://github.com/apache/hudi/pull/7702#issuecomment-1397400645 ## CI report: * 384a9774018272e13b967817b0e48b1596a23dcc UNKNOWN * e95e38b33e134cbc74d4c08065fe0c2ae1418937 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7642: [HUDI-5534][Stacked on 6815] Optimizing Bloom Index lookup when using Bloom Filters from Metadata Table

2023-01-19 Thread GitBox
hudi-bot commented on PR #7642: URL: https://github.com/apache/hudi/pull/7642#issuecomment-1397400077 ## CI report: * dd82fb612d88d6c9ba4f06be2989ec5061052047 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6815: [HUDI-4937][Stacked on 7702] Fix `HoodieTable` injecting non-reusable `HoodieBackedTableMetadata` aggressively flushing MT readers

2023-01-19 Thread GitBox
hudi-bot commented on PR #6815: URL: https://github.com/apache/hudi/pull/6815#issuecomment-1397398092 ## CI report: * 13fb78850890b96b86b66d7df060feb11950ec0c UNKNOWN * e33e7f6ab1c2b54bdb255e09ff557ac9f49b3ef9 Azure:

[GitHub] [hudi] zhuanshenbsj1 commented on pull request #7159: [HUDI-5173]Skip if there is only one file in clusteringGroup

2023-01-19 Thread GitBox
zhuanshenbsj1 commented on PR #7159: URL: https://github.com/apache/hudi/pull/7159#issuecomment-1397391259 > @danny0405, @zhuanshenbsj1, IMO, the benefit of this pull request is not great for incremental clustering, meanwhile there is no benefit for full Clustering. Therefore, this change

[GitHub] [hudi] hudi-bot commented on pull request #7709: [HUDI-5582] Do not let users override internal metadata configs

2023-01-19 Thread GitBox
hudi-bot commented on PR #7709: URL: https://github.com/apache/hudi/pull/7709#issuecomment-1397390030 ## CI report: * 170178b86673e1382178a1cb0149e90c7959d9f7 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7702: [HUDI-5579] Fixing Kryo registration to be properly wired into Spark sessions

2023-01-19 Thread GitBox
hudi-bot commented on PR #7702: URL: https://github.com/apache/hudi/pull/7702#issuecomment-1397389852 ## CI report: * 384a9774018272e13b967817b0e48b1596a23dcc UNKNOWN * e95e38b33e134cbc74d4c08065fe0c2ae1418937 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7642: [HUDI-5534][Stacked on 6815] Optimizing Bloom Index lookup when using Bloom Filters from Metadata Table

2023-01-19 Thread GitBox
hudi-bot commented on PR #7642: URL: https://github.com/apache/hudi/pull/7642#issuecomment-1397389424 ## CI report: * 6988dfce1a01dbb2e89bffabd0d2475134015756 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7159: [HUDI-5173]Skip if there is only one file in clusteringGroup

2023-01-19 Thread GitBox
hudi-bot commented on PR #7159: URL: https://github.com/apache/hudi/pull/7159#issuecomment-1397387785 ## CI report: * 15ecd91180d32c7fa1905c11408f4bc23347e682 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #6815: [HUDI-4937][Stacked on 7702] Fix `HoodieTable` injecting non-reusable `HoodieBackedTableMetadata` aggressively flushing MT readers

2023-01-19 Thread GitBox
hudi-bot commented on PR #6815: URL: https://github.com/apache/hudi/pull/6815#issuecomment-1397386572 ## CI report: * 13fb78850890b96b86b66d7df060feb11950ec0c UNKNOWN * e33e7f6ab1c2b54bdb255e09ff557ac9f49b3ef9 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #7709: [HUDI-5582] Do not let users override internal metadata configs

2023-01-19 Thread GitBox
hudi-bot commented on PR #7709: URL: https://github.com/apache/hudi/pull/7709#issuecomment-1397377343 ## CI report: * 170178b86673e1382178a1cb0149e90c7959d9f7 Azure:

[GitHub] [hudi] zhuanshenbsj1 commented on pull request #7159: [HUDI-5173]Skip if there is only one file in clusteringGroup

2023-01-19 Thread GitBox
zhuanshenbsj1 commented on PR #7159: URL: https://github.com/apache/hudi/pull/7159#issuecomment-1397373241 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #7702: [HUDI-5579] Fixing Kryo registration to be properly wired into Spark sessions

2023-01-19 Thread GitBox
alexeykudinkin commented on code in PR #7702: URL: https://github.com/apache/hudi/pull/7702#discussion_r1081601845 ## hudi-client/hudi-spark-client/src/main/scala/org/apache/spark/HoodieSparkKryoRegistrar.scala: ## @@ -42,22 +43,31 @@ import

[GitHub] [hudi] alexeykudinkin closed pull request #7478: [WIP][HUDI-5354] Fixing CSI index test

2023-01-19 Thread GitBox
alexeykudinkin closed pull request #7478: [WIP][HUDI-5354] Fixing CSI index test URL: https://github.com/apache/hudi/pull/7478 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [hudi] alexeykudinkin merged pull request #6782: [HUDI-4911][HUDI-3301] Fixing `HoodieMetadataLogRecordReader` to avoid flushing cache for every lookup

2023-01-19 Thread GitBox
alexeykudinkin merged PR #6782: URL: https://github.com/apache/hudi/pull/6782 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] alexeykudinkin commented on pull request #6782: [HUDI-4911][HUDI-3301] Fixing `HoodieMetadataLogRecordReader` to avoid flushing cache for every lookup

2023-01-19 Thread GitBox
alexeykudinkin commented on PR #6782: URL: https://github.com/apache/hudi/pull/6782#issuecomment-1397321632 CI is green: https://user-images.githubusercontent.com/428277/213512868-ea6da426-deb7-46b7-b5c7-0270069648c1.png;>

[GitHub] [hudi] hudi-bot commented on pull request #7709: [HUDI-5582] Do not let users override internal metadata configs

2023-01-19 Thread GitBox
hudi-bot commented on PR #7709: URL: https://github.com/apache/hudi/pull/7709#issuecomment-1397281145 ## CI report: * 170178b86673e1382178a1cb0149e90c7959d9f7 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] jonvex opened a new pull request, #7709: [HUDI-5582] Do not let users override internal metadata configs

2023-01-19 Thread GitBox
jonvex opened a new pull request, #7709: URL: https://github.com/apache/hudi/pull/7709 ### Change Logs hoodie.metadata.clean.async hoodie.metadata.cleaner.commits.retained hoodie.metadata.enable.full.scan.log.files hoodie.metadata.insert.parallelism

[GitHub] [hudi] hudi-bot commented on pull request #7702: [HUDI-5579] Fixing Kryo registration to be properly wired into Spark sessions

2023-01-19 Thread GitBox
hudi-bot commented on PR #7702: URL: https://github.com/apache/hudi/pull/7702#issuecomment-1397247078 ## CI report: * 384a9774018272e13b967817b0e48b1596a23dcc UNKNOWN * e95e38b33e134cbc74d4c08065fe0c2ae1418937 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6782: [HUDI-4911][HUDI-3301] Fixing `HoodieMetadataLogRecordReader` to avoid flushing cache for every lookup

2023-01-19 Thread GitBox
hudi-bot commented on PR #6782: URL: https://github.com/apache/hudi/pull/6782#issuecomment-1397242092 ## CI report: * 51f169f9e2c563c69cf1c922cbfa7d84e47958c6 Azure:

<    1   2   3   4   5   6   7   8   9   10   >