[GitHub] [hudi] hudi-bot commented on pull request #5093: [HUDI-3539] Flink bucket index bucketID bootstrap optimization.

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5093: URL: https://github.com/apache/hudi/pull/5093#issuecomment-1080223446 ## CI report: * 84fbc2eee9f31fdf66823f10f4e023bfaf2e2a09 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5093: [HUDI-3539] Flink bucket index bucketID bootstrap optimization.

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5093: URL: https://github.com/apache/hudi/pull/5093#issuecomment-1080221964 ## CI report: * 84fbc2eee9f31fdf66823f10f4e023bfaf2e2a09 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5093: [HUDI-3539] Flink bucket index bucketID bootstrap optimization.

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5093: URL: https://github.com/apache/hudi/pull/5093#issuecomment-1080221964 ## CI report: * 84fbc2eee9f31fdf66823f10f4e023bfaf2e2a09 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5093: [HUDI-3539] Flink bucket index bucketID bootstrap optimization.

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5093: URL: https://github.com/apache/hudi/pull/5093#issuecomment-1077913093 ## CI report: * 84fbc2eee9f31fdf66823f10f4e023bfaf2e2a09 Azure:

[jira] [Commented] (HUDI-1180) Upgrade HBase to 2.x

2022-03-27 Thread rex xiong (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17513153#comment-17513153 ] rex xiong commented on HUDI-1180: - [~guoyihua]  for hadoop 3.2.1,current hbase version2.x has incompatible

[jira] [Updated] (HUDI-1180) Upgrade HBase to 2.x

2022-03-27 Thread rex xiong (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] rex xiong updated HUDI-1180: Attachment: image-2022-03-28-13-48-58-149.png > Upgrade HBase to 2.x > > >

[GitHub] [hudi] hudi-bot commented on pull request #5042: Three bulk_insert files are concurrently submitted and executed with a difference of 2s, the insert fails occasionally.

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5042: URL: https://github.com/apache/hudi/pull/5042#issuecomment-1080214434 ## CI report: * 7ee24be4d11864af37bf300250d571e15d5f9ae9 UNKNOWN * 9a9e544ba48a52c7b54134fc9533c3e5a51ccfff UNKNOWN * f69bbf06e9cb669dfe0785b5eee8501ba56871fe

[GitHub] [hudi] hudi-bot removed a comment on pull request #5042: Three bulk_insert files are concurrently submitted and executed with a difference of 2s, the insert fails occasionally.

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5042: URL: https://github.com/apache/hudi/pull/5042#issuecomment-1080113560 ## CI report: * 7ee24be4d11864af37bf300250d571e15d5f9ae9 UNKNOWN * 9a9e544ba48a52c7b54134fc9533c3e5a51ccfff UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #4945: [HUDI-3538] Support Compaction Command Based on Call Procedure Command for Spark SQL

2022-03-27 Thread GitBox
hudi-bot commented on pull request #4945: URL: https://github.com/apache/hudi/pull/4945#issuecomment-1080193543 ## CI report: * faf4bc6134e15983b4199ce972f72f8471b1207f Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4945: [HUDI-3538] Support Compaction Command Based on Call Procedure Command for Spark SQL

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #4945: URL: https://github.com/apache/hudi/pull/4945#issuecomment-1080113468 ## CI report: * 3802bb3e07923b1904f141d62fb744667493ddfb Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5147: [HUDI-2520] fix drop partition issue when sync to hive

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5147: URL: https://github.com/apache/hudi/pull/5147#issuecomment-1080110471 ## CI report: * 38dd1f251d021d49f65f9a414f5bec9b8a2440da Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5147: [HUDI-2520] fix drop partition issue when sync to hive

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5147: URL: https://github.com/apache/hudi/pull/5147#issuecomment-1080178077 ## CI report: * 58e26cd6c9b233b731173c8f3d648864dd2f2941 Azure:

[jira] [Commented] (HUDI-3097) Address dependency issue with hudi-trino-bundle in connector

2022-03-27 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17513129#comment-17513129 ] Sagar Sumit commented on HUDI-3097: --- I see this is addressed in

[GitHub] [hudi] codope commented on a change in pull request #4693: [HUDI-2488][HUDI-3175] Implement async metadata indexing

2022-03-27 Thread GitBox
codope commented on a change in pull request #4693: URL: https://github.com/apache/hudi/pull/4693#discussion_r836041910 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/timeline/HoodieDefaultTimeline.java ## @@ -112,6 +114,25 @@ public

[jira] [Updated] (HUDI-3720) Rollback reattempt fails if the commit to roll back is not present

2022-03-27 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-3720: Status: Patch Available (was: In Progress) > Rollback reattempt fails if the commit to roll back is not

[GitHub] [hudi] watermelon12138 commented on pull request #4925: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single sink table from multiple source tables

2022-03-27 Thread GitBox
watermelon12138 commented on pull request #4925: URL: https://github.com/apache/hudi/pull/4925#issuecomment-1080153842 @xushiyan Thank you for your attention to this, and I hope to land this in next few days for 0.11.0, I'm debugging UT. -- This is an automated message from the

[GitHub] [hudi] codope commented on a change in pull request #4693: [HUDI-2488][HUDI-3175] Implement async metadata indexing

2022-03-27 Thread GitBox
codope commented on a change in pull request #4693: URL: https://github.com/apache/hudi/pull/4693#discussion_r836034276 ## File path: hudi-common/src/main/java/org/apache/hudi/common/bloom/BloomFilter.java ## @@ -30,6 +34,13 @@ */ void add(String key); + /** + *

[jira] [Reopened] (HUDI-3368) Support metadata bloom index for secondary keys

2022-03-27 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit reopened HUDI-3368: --- > Support metadata bloom index for secondary keys > --- > >

[jira] [Updated] (HUDI-3727) Add metrics for async indexer

2022-03-27 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3727: -- Issue Type: Task (was: Improvement) > Add metrics for async indexer > - >

[jira] [Updated] (HUDI-3718) Support concurrent writes while dropping index

2022-03-27 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3718: -- Issue Type: Improvement (was: New Feature) > Support concurrent writes while dropping index >

[jira] [Updated] (HUDI-1590) Support async clustering w/ test suite job

2022-03-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1590: - Fix Version/s: (was: 0.11.0) > Support async clustering w/ test suite job >

[jira] [Commented] (HUDI-1590) Support async clustering w/ test suite job

2022-03-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17513122#comment-17513122 ] Raymond Xu commented on HUDI-1590: -- [~legendtkl] no worries.  > Support async clustering w/ test suite

[GitHub] [hudi] hudi-bot commented on pull request #5148: [HUDI-3720] Fix the logic of reattempting pending rollback

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5148: URL: https://github.com/apache/hudi/pull/5148#issuecomment-1080142872 ## CI report: * e900e4c8bbaa2c71f469aaf5af5c19f74f779424 UNKNOWN * 23c299a42c07675dd47c2f2129b2b95d780a Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5148: [HUDI-3720] Fix the logic of reattempting pending rollback

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5148: URL: https://github.com/apache/hudi/pull/5148#issuecomment-1080094062 ## CI report: * 0f0fc30c8c43e2aee2b291295072695e6a36be43 Azure:

[GitHub] [hudi] XuQianJin-Stars commented on a change in pull request #5060: [HUDI-3652] Make ObjectSizeCalculator threadlocal to reduce memory footprint

2022-03-27 Thread GitBox
XuQianJin-Stars commented on a change in pull request #5060: URL: https://github.com/apache/hudi/pull/5060#discussion_r836029823 ## File path: hudi-common/src/main/java/org/apache/hudi/common/util/ObjectSizeCalculator.java ## @@ -54,6 +54,8 @@ * @author Attila Szegedi */

[GitHub] [hudi] hudi-bot removed a comment on pull request #5087: [HUDI-3614] [DO_NOT_MERGE]Replace List with HoodieData in HoodieFlink/JavaTable and commit executors

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5087: URL: https://github.com/apache/hudi/pull/5087#issuecomment-1080119531 ## CI report: * cb5e663b30f62899e8f518b378fa4061b2416f77 UNKNOWN * 170a4c1691307de27e688e0b043d1477a2a9a65e UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #5087: [HUDI-3614] [DO_NOT_MERGE]Replace List with HoodieData in HoodieFlink/JavaTable and commit executors

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5087: URL: https://github.com/apache/hudi/pull/5087#issuecomment-1080141340 ## CI report: * cb5e663b30f62899e8f518b378fa4061b2416f77 UNKNOWN * 170a4c1691307de27e688e0b043d1477a2a9a65e UNKNOWN * e157958ddf6adb265f9f084aa3dcb26ce0c9cb16

[jira] [Updated] (HUDI-2520) Certify sync with Hive 3

2022-03-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2520: - Status: Patch Available (was: In Progress) > Certify sync with Hive 3 > > >

[GitHub] [hudi] Guanpx commented on issue #5150: [SUPPORT] bucket_bulk_insert so slow and generate too many hdfs small flie with Flink BUCKET index

2022-03-27 Thread GitBox
Guanpx commented on issue #5150: URL: https://github.com/apache/hudi/issues/5150#issuecomment-1080140118 > See the fix here: #5151 Thank you very much, I will try again again now -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [hudi] codope commented on a change in pull request #4693: [HUDI-2488][HUDI-3175] Implement async metadata indexing

2022-03-27 Thread GitBox
codope commented on a change in pull request #4693: URL: https://github.com/apache/hudi/pull/4693#discussion_r836028462 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/index/RunIndexActionExecutor.java ## @@ -0,0 +1,263 @@ +/* + *

[GitHub] [hudi] hudi-bot commented on pull request #5151: [minor] Set up the sort operator parallelism to avoid data shuffle

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5151: URL: https://github.com/apache/hudi/pull/5151#issuecomment-1080139944 ## CI report: * dd456954a218cd11b2fd5840f7df2ce3ead4012b Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5151: [minor] Set up the sort operator parallelism to avoid data shuffle

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5151: URL: https://github.com/apache/hudi/pull/5151#issuecomment-1080138206 ## CI report: * dd456954a218cd11b2fd5840f7df2ce3ead4012b UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot

[GitHub] [hudi] xushiyan commented on pull request #5060: [HUDI-3652] Make ObjectSizeCalculator threadlocal to reduce memory footprint

2022-03-27 Thread GitBox
xushiyan commented on pull request #5060: URL: https://github.com/apache/hudi/pull/5060#issuecomment-1080139809 @sekaiga can you please rebase master ? this is a small fix we can land for 0.11 -- This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [hudi] danny0405 commented on a change in pull request #5141: [HUDI-3724] Fixing closure of ParquetReader

2022-03-27 Thread GitBox
danny0405 commented on a change in pull request #5141: URL: https://github.com/apache/hudi/pull/5141#discussion_r836027994 ## File path: hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieBaseRelation.scala ## @@ -333,7 +335,13 @@ object

[GitHub] [hudi] codope commented on a change in pull request #4693: [HUDI-2488][HUDI-3175] Implement async metadata indexing

2022-03-27 Thread GitBox
codope commented on a change in pull request #4693: URL: https://github.com/apache/hudi/pull/4693#discussion_r836027951 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/index/RunIndexActionExecutor.java ## @@ -0,0 +1,263 @@ +/* + *

[GitHub] [hudi] danny0405 commented on issue #5150: [SUPPORT] bucket_bulk_insert so slow and generate too many hdfs small flie with Flink BUCKET index

2022-03-27 Thread GitBox
danny0405 commented on issue #5150: URL: https://github.com/apache/hudi/issues/5150#issuecomment-1080138417 See the fix here: https://github.com/apache/hudi/pull/5151 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [hudi] codope commented on a change in pull request #4693: [HUDI-2488][HUDI-3175] Implement async metadata indexing

2022-03-27 Thread GitBox
codope commented on a change in pull request #4693: URL: https://github.com/apache/hudi/pull/4693#discussion_r836027799 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/index/RunIndexActionExecutor.java ## @@ -0,0 +1,189 @@ +/* + *

[GitHub] [hudi] hudi-bot commented on pull request #5151: [minor] Set up the sort operator parallelism to avoid data shuffle

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5151: URL: https://github.com/apache/hudi/pull/5151#issuecomment-1080138206 ## CI report: * dd456954a218cd11b2fd5840f7df2ce3ead4012b UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[GitHub] [hudi] danny0405 opened a new pull request #5151: [minor] Set up the sort operator parallelism to avoid data shuffle

2022-03-27 Thread GitBox
danny0405 opened a new pull request #5151: URL: https://github.com/apache/hudi/pull/5151 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the

[hudi] branch master updated (f2a93ea -> d31cde2)

2022-03-27 Thread leesf
This is an automated email from the ASF dual-hosted git repository. leesf pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from f2a93ea [HUDI-3724] Fixing closure of ParquetReader (#5141) add d31cde2 [MINOR] Fix call command parser use

[GitHub] [hudi] leesf merged pull request #5144: [MINOR] fix call command parser use spark3.2

2022-03-27 Thread GitBox
leesf merged pull request #5144: URL: https://github.com/apache/hudi/pull/5144 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] codope commented on a change in pull request #4693: [HUDI-2488][HUDI-3175] Implement async metadata indexing

2022-03-27 Thread GitBox
codope commented on a change in pull request #4693: URL: https://github.com/apache/hudi/pull/4693#discussion_r836025596 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/index/RunIndexActionExecutor.java ## @@ -0,0 +1,263 @@ +/* + *

[GitHub] [hudi] Guanpx edited a comment on issue #5150: [SUPPORT] bucket_bulk_insert so slow and generate too many hdfs small flie with Flink BUCKET index

2022-03-27 Thread GitBox
Guanpx edited a comment on issue #5150: URL: https://github.com/apache/hudi/issues/5150#issuecomment-1080133387 for this pr https://github.com/apache/hudi/pull/5135 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [hudi] Guanpx commented on issue #5150: [SUPPORT] bucket_bulk_insert so slow and generate too many hdfs small flie with Flink BUCKET index

2022-03-27 Thread GitBox
Guanpx commented on issue #5150: URL: https://github.com/apache/hudi/issues/5150#issuecomment-1080133387 https://github.com/apache/hudi/pull/5135 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] Guanpx opened a new issue #5150: [SUPPORT] bucket_bulk_insert so slow and generate too many hdfs small flie with Flink BUCKET index

2022-03-27 Thread GitBox
Guanpx opened a new issue #5150: URL: https://github.com/apache/hudi/issues/5150 **Describe the problem you faced** Flink + hudi cow + BUCKET index + bulk_insert bucket_bulk_insert **so slow** and generate **too many hdfs small flie** **To Reproduce** Steps to

[jira] [Created] (HUDI-3727) Add metrics for async indexer

2022-03-27 Thread Sagar Sumit (Jira)
Sagar Sumit created HUDI-3727: - Summary: Add metrics for async indexer Key: HUDI-3727 URL: https://issues.apache.org/jira/browse/HUDI-3727 Project: Apache Hudi Issue Type: Improvement

[GitHub] [hudi] codope commented on a change in pull request #4693: [HUDI-2488][HUDI-3175] Implement async metadata indexing

2022-03-27 Thread GitBox
codope commented on a change in pull request #4693: URL: https://github.com/apache/hudi/pull/4693#discussion_r835764478 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/index/RunIndexActionExecutor.java ## @@ -0,0 +1,189 @@ +/* + *

[GitHub] [hudi] hudi-bot removed a comment on pull request #4925: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single sink table from multiple source tables

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #4925: URL: https://github.com/apache/hudi/pull/4925#issuecomment-1080125928 ## CI report: * d68f9448706f659685b357a469be1d2f40968760 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4925: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single sink table from multiple source tables

2022-03-27 Thread GitBox
hudi-bot commented on pull request #4925: URL: https://github.com/apache/hudi/pull/4925#issuecomment-1080126938 ## CI report: * d68f9448706f659685b357a469be1d2f40968760 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4925: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single sink table from multiple source tables

2022-03-27 Thread GitBox
hudi-bot commented on pull request #4925: URL: https://github.com/apache/hudi/pull/4925#issuecomment-1080125928 ## CI report: * d68f9448706f659685b357a469be1d2f40968760 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4925: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single sink table from multiple source tables

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #4925: URL: https://github.com/apache/hudi/pull/4925#issuecomment-1080124911 ## CI report: * d68f9448706f659685b357a469be1d2f40968760 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4925: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single sink table from multiple source tables

2022-03-27 Thread GitBox
hudi-bot commented on pull request #4925: URL: https://github.com/apache/hudi/pull/4925#issuecomment-1080124911 ## CI report: * d68f9448706f659685b357a469be1d2f40968760 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4925: [HUDI-3103] Enable MultiTableDeltaStreamer to update a single sink table from multiple source tables

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #4925: URL: https://github.com/apache/hudi/pull/4925#issuecomment-1079572814 ## CI report: * d68f9448706f659685b357a469be1d2f40968760 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5089: [MINOR] Repeated execution of update status

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5089: URL: https://github.com/apache/hudi/pull/5089#issuecomment-1080120708 ## CI report: * 86911e7f5be5afa9ea13b3db414c94ba82b755ad Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5089: [MINOR] Repeated execution of update status

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5089: URL: https://github.com/apache/hudi/pull/5089#issuecomment-1080121718 ## CI report: * 86911e7f5be5afa9ea13b3db414c94ba82b755ad Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5089: [MINOR] Repeated execution of update status

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5089: URL: https://github.com/apache/hudi/pull/5089#issuecomment-1080120708 ## CI report: * 86911e7f5be5afa9ea13b3db414c94ba82b755ad Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5089: [MINOR] Repeated execution of update status

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5089: URL: https://github.com/apache/hudi/pull/5089#issuecomment-1080118236 ## CI report: * 86911e7f5be5afa9ea13b3db414c94ba82b755ad Azure:

[GitHub] [hudi] xiarixiaoyao commented on pull request #4910: [RFC-33] [HUDI-2429][Stacked on HUDI-2560] Support full Schema evolution for Spark

2022-03-27 Thread GitBox
xiarixiaoyao commented on pull request #4910: URL: https://github.com/apache/hudi/pull/4910#issuecomment-1080120399 @bvaradar addressed all comments Are there any other concerns? Let me discuss all the issues together -- This is an automated message from the Apache Git Service. To

[GitHub] [hudi] xiarixiaoyao commented on pull request #4910: [RFC-33] [HUDI-2429][Stacked on HUDI-2560] Support full Schema evolution for Spark

2022-03-27 Thread GitBox
xiarixiaoyao commented on pull request #4910: URL: https://github.com/apache/hudi/pull/4910#issuecomment-1080120016 now, let disscussion follow question @bvaradar @YannByron **question1**: do the SerDeHelper.Lnow, let disscussion follow question @bvaradar @YannByron

[GitHub] [hudi] hudi-bot commented on pull request #5087: [HUDI-3614] [DO_NOT_MERGE]Replace List with HoodieData in HoodieFlink/JavaTable and commit executors

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5087: URL: https://github.com/apache/hudi/pull/5087#issuecomment-1080119531 ## CI report: * cb5e663b30f62899e8f518b378fa4061b2416f77 UNKNOWN * 170a4c1691307de27e688e0b043d1477a2a9a65e UNKNOWN * e157958ddf6adb265f9f084aa3dcb26ce0c9cb16

[GitHub] [hudi] hudi-bot removed a comment on pull request #5087: [HUDI-3614] [DO_NOT_MERGE]Replace List with HoodieData in HoodieFlink/JavaTable and commit executors

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5087: URL: https://github.com/apache/hudi/pull/5087#issuecomment-1080116901 ## CI report: * cb5e663b30f62899e8f518b378fa4061b2416f77 UNKNOWN * 170a4c1691307de27e688e0b043d1477a2a9a65e UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #5089: [MINOR] Repeated execution of update status

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5089: URL: https://github.com/apache/hudi/pull/5089#issuecomment-1080118236 ## CI report: * 86911e7f5be5afa9ea13b3db414c94ba82b755ad Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5089: [MINOR] Repeated execution of update status

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5089: URL: https://github.com/apache/hudi/pull/5089#issuecomment-1080088184 ## CI report: * 86911e7f5be5afa9ea13b3db414c94ba82b755ad Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5087: [HUDI-3614] [DO_NOT_MERGE]Replace List with HoodieData in HoodieFlink/JavaTable and commit executors

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5087: URL: https://github.com/apache/hudi/pull/5087#issuecomment-1080116901 ## CI report: * cb5e663b30f62899e8f518b378fa4061b2416f77 UNKNOWN * 170a4c1691307de27e688e0b043d1477a2a9a65e UNKNOWN * e157958ddf6adb265f9f084aa3dcb26ce0c9cb16

[GitHub] [hudi] hudi-bot removed a comment on pull request #5087: [HUDI-3614] [DO_NOT_MERGE]Replace List with HoodieData in HoodieFlink/JavaTable and commit executors

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5087: URL: https://github.com/apache/hudi/pull/5087#issuecomment-1079377726 ## CI report: * cb5e663b30f62899e8f518b378fa4061b2416f77 UNKNOWN * 170a4c1691307de27e688e0b043d1477a2a9a65e UNKNOWN *

[jira] [Commented] (HUDI-1590) Support async clustering w/ test suite job

2022-03-27 Thread Kelu Tao (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17513107#comment-17513107 ] Kelu Tao commented on HUDI-1590: Hi, [~xushiyan] ,I'm very sorry about this. I am afraid that I don't have

[GitHub] [hudi] hudi-bot removed a comment on pull request #5042: Three bulk_insert files are concurrently submitted and executed with a difference of 2s, the insert fails occasionally.

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5042: URL: https://github.com/apache/hudi/pull/5042#issuecomment-1080111977 ## CI report: * 7ee24be4d11864af37bf300250d571e15d5f9ae9 UNKNOWN * Unknown: [CANCELED](TBD) * 9a9e544ba48a52c7b54134fc9533c3e5a51ccfff UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #5042: Three bulk_insert files are concurrently submitted and executed with a difference of 2s, the insert fails occasionally.

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5042: URL: https://github.com/apache/hudi/pull/5042#issuecomment-1080113560 ## CI report: * 7ee24be4d11864af37bf300250d571e15d5f9ae9 UNKNOWN * 9a9e544ba48a52c7b54134fc9533c3e5a51ccfff UNKNOWN * f69bbf06e9cb669dfe0785b5eee8501ba56871fe

[GitHub] [hudi] hudi-bot commented on pull request #4945: [HUDI-3538] Support Compaction Command Based on Call Procedure Command for Spark SQL

2022-03-27 Thread GitBox
hudi-bot commented on pull request #4945: URL: https://github.com/apache/hudi/pull/4945#issuecomment-1080113468 ## CI report: * 3802bb3e07923b1904f141d62fb744667493ddfb Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4945: [HUDI-3538] Support Compaction Command Based on Call Procedure Command for Spark SQL

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #4945: URL: https://github.com/apache/hudi/pull/4945#issuecomment-1080111859 ## CI report: * 3802bb3e07923b1904f141d62fb744667493ddfb Azure:

[jira] [Assigned] (HUDI-1590) Support async clustering w/ test suite job

2022-03-27 Thread Kelu Tao (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kelu Tao reassigned HUDI-1590: -- Assignee: (was: Kelu Tao) > Support async clustering w/ test suite job >

[GitHub] [hudi] hudi-bot removed a comment on pull request #5042: Three bulk_insert files are concurrently submitted and executed with a difference of 2s, the insert fails occasionally.

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5042: URL: https://github.com/apache/hudi/pull/5042#issuecomment-1080086310 ## CI report: * 7ee24be4d11864af37bf300250d571e15d5f9ae9 UNKNOWN * Unknown: [CANCELED](TBD) * 9a9e544ba48a52c7b54134fc9533c3e5a51ccfff UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #5042: Three bulk_insert files are concurrently submitted and executed with a difference of 2s, the insert fails occasionally.

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5042: URL: https://github.com/apache/hudi/pull/5042#issuecomment-1080111977 ## CI report: * 7ee24be4d11864af37bf300250d571e15d5f9ae9 UNKNOWN * Unknown: [CANCELED](TBD) * 9a9e544ba48a52c7b54134fc9533c3e5a51ccfff UNKNOWN *

[GitHub] [hudi] hudi-bot removed a comment on pull request #4945: [HUDI-3538] Support Compaction Command Based on Call Procedure Command for Spark SQL

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #4945: URL: https://github.com/apache/hudi/pull/4945#issuecomment-1079732557 ## CI report: * 3802bb3e07923b1904f141d62fb744667493ddfb Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4945: [HUDI-3538] Support Compaction Command Based on Call Procedure Command for Spark SQL

2022-03-27 Thread GitBox
hudi-bot commented on pull request #4945: URL: https://github.com/apache/hudi/pull/4945#issuecomment-1080111859 ## CI report: * 3802bb3e07923b1904f141d62fb744667493ddfb Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5147: [HUDI-2520] fix drop partition issue when sync to hive

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5147: URL: https://github.com/apache/hudi/pull/5147#issuecomment-1080108714 ## CI report: * 38dd1f251d021d49f65f9a414f5bec9b8a2440da Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5147: [HUDI-2520] fix drop partition issue when sync to hive

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5147: URL: https://github.com/apache/hudi/pull/5147#issuecomment-1080110471 ## CI report: * 38dd1f251d021d49f65f9a414f5bec9b8a2440da Azure:

[GitHub] [hudi] peanut-chenzhong commented on pull request #5042: Three bulk_insert files are concurrently submitted and executed with a difference of 2s, the insert fails occasionally.

2022-03-27 Thread GitBox
peanut-chenzhong commented on pull request #5042: URL: https://github.com/apache/hudi/pull/5042#issuecomment-1080110396 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] hudi-bot removed a comment on pull request #5147: [HUDI-2520] fix drop partition issue when sync to hive

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5147: URL: https://github.com/apache/hudi/pull/5147#issuecomment-1079964456 ## CI report: * 38dd1f251d021d49f65f9a414f5bec9b8a2440da Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5147: [HUDI-2520] fix drop partition issue when sync to hive

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5147: URL: https://github.com/apache/hudi/pull/5147#issuecomment-1080108714 ## CI report: * 38dd1f251d021d49f65f9a414f5bec9b8a2440da Azure:

[GitHub] [hudi] codope commented on a change in pull request #4693: [HUDI-2488][HUDI-3175] Implement async metadata indexing

2022-03-27 Thread GitBox
codope commented on a change in pull request #4693: URL: https://github.com/apache/hudi/pull/4693#discussion_r836008617 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java ## @@ -392,6 +398,12 @@ public void

[GitHub] [hudi] xiarixiaoyao edited a comment on pull request #4910: [RFC-33] [HUDI-2429][Stacked on HUDI-2560] Support full Schema evolution for Spark

2022-03-27 Thread GitBox
xiarixiaoyao edited a comment on pull request #4910: URL: https://github.com/apache/hudi/pull/4910#issuecomment-1080095336 > > [#4910 (comment)](https://github.com/apache/hudi/pull/4910#discussion_r834319573) @bvaradar @YannByron this is test result > > > > Test case:  > >

[GitHub] [hudi] xiarixiaoyao commented on pull request #4910: [RFC-33] [HUDI-2429][Stacked on HUDI-2560] Support full Schema evolution for Spark

2022-03-27 Thread GitBox
xiarixiaoyao commented on pull request #4910: URL: https://github.com/apache/hudi/pull/4910#issuecomment-1080099081 > yes, let me addressed in another comments. now focus on the questions: One specific question: For an existing table (0.10.1 or prior), Specifically,

[GitHub] [hudi] xiarixiaoyao commented on pull request #4910: [RFC-33] [HUDI-2429][Stacked on HUDI-2560] Support full Schema evolution for Spark

2022-03-27 Thread GitBox
xiarixiaoyao commented on pull request #4910: URL: https://github.com/apache/hudi/pull/4910#issuecomment-1080095336 > > [#4910 (comment)](https://github.com/apache/hudi/pull/4910#discussion_r834319573) @bvaradar @YannByron this is test result > > > > Test case:  > > dataSize:

[GitHub] [hudi] hudi-bot commented on pull request #5148: [HUDI-3720] Fix the logic of reattempting pending rollback

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5148: URL: https://github.com/apache/hudi/pull/5148#issuecomment-1080094062 ## CI report: * 0f0fc30c8c43e2aee2b291295072695e6a36be43 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5148: [HUDI-3720] Fix the logic of reattempting pending rollback

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5148: URL: https://github.com/apache/hudi/pull/5148#issuecomment-1080092792 ## CI report: * 0f0fc30c8c43e2aee2b291295072695e6a36be43 Azure:

[GitHub] [hudi] xiarixiaoyao commented on a change in pull request #4962: [HUDI-3355] Issue with out of order commits in the timeline when ingestion writers using SparkAllowUpdateStrategy

2022-03-27 Thread GitBox
xiarixiaoyao commented on a change in pull request #4962: URL: https://github.com/apache/hudi/pull/4962#discussion_r836001123 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/utils/TransactionUtils.java ## @@ -137,4 +126,27 @@ throw new

[GitHub] [hudi] hudi-bot commented on pull request #5148: [HUDI-3720] Fix the logic of reattempting pending rollback

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5148: URL: https://github.com/apache/hudi/pull/5148#issuecomment-1080092792 ## CI report: * 0f0fc30c8c43e2aee2b291295072695e6a36be43 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5148: [HUDI-3720] Fix the logic of reattempting pending rollback

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5148: URL: https://github.com/apache/hudi/pull/5148#issuecomment-1080072976 ## CI report: * 0f0fc30c8c43e2aee2b291295072695e6a36be43 Azure:

[hudi] branch master updated (9da2dd4 -> f2a93ea)

2022-03-27 Thread leesf
This is an automated email from the ASF dual-hosted git repository. leesf pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 9da2dd4 [HUDI-3719] High performance costs of AvroSerizlizer in DataSource wr… (#5137) add f2a93ea [HUDI-3724]

[GitHub] [hudi] leesf merged pull request #5141: [HUDI-3724] Fixing closure of ParquetReader

2022-03-27 Thread GitBox
leesf merged pull request #5141: URL: https://github.com/apache/hudi/pull/5141 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] xiarixiaoyao commented on pull request #4945: [HUDI-3538] Support Compaction Command Based on Call Procedure Command for Spark SQL

2022-03-27 Thread GitBox
xiarixiaoyao commented on pull request #4945: URL: https://github.com/apache/hudi/pull/4945#issuecomment-1080091463 LGTM, just a minor comment. once addressed, we can merge it. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [hudi] xiarixiaoyao commented on a change in pull request #4945: [HUDI-3538] Support Compaction Command Based on Call Procedure Command for Spark SQL

2022-03-27 Thread GitBox
xiarixiaoyao commented on a change in pull request #4945: URL: https://github.com/apache/hudi/pull/4945#discussion_r835999620 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/spark/sql/hudi/command/procedures/RunCompactionProcedure.scala ## @@ -0,0

[GitHub] [hudi] hudi-bot commented on pull request #5149: [WIP][HUDI-3721] Allow rollback of commits before metadata table initialization

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5149: URL: https://github.com/apache/hudi/pull/5149#issuecomment-1080088308 ## CI report: * 83d16feb46037566b487a04a1f3e20bc77f26a53 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5149: [WIP][HUDI-3721] Allow rollback of commits before metadata table initialization

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5149: URL: https://github.com/apache/hudi/pull/5149#issuecomment-1080054033 ## CI report: * 83d16feb46037566b487a04a1f3e20bc77f26a53 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5089: [MINOR] Repeated execution of update status

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5089: URL: https://github.com/apache/hudi/pull/5089#issuecomment-1080088184 ## CI report: * 86911e7f5be5afa9ea13b3db414c94ba82b755ad Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #5089: [MINOR] Repeated execution of update status

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5089: URL: https://github.com/apache/hudi/pull/5089#issuecomment-1078643570 ## CI report: * 86911e7f5be5afa9ea13b3db414c94ba82b755ad Azure:

[GitHub] [hudi] cuibo01 commented on pull request #5089: [MINOR] Repeated execution of update status

2022-03-27 Thread GitBox
cuibo01 commented on pull request #5089: URL: https://github.com/apache/hudi/pull/5089#issuecomment-1080087240 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] hudi-bot commented on pull request #5042: Three bulk_insert files are concurrently submitted and executed with a difference of 2s, the insert fails occasionally.

2022-03-27 Thread GitBox
hudi-bot commented on pull request #5042: URL: https://github.com/apache/hudi/pull/5042#issuecomment-1080086310 ## CI report: * 7ee24be4d11864af37bf300250d571e15d5f9ae9 UNKNOWN * Unknown: [CANCELED](TBD) * 9a9e544ba48a52c7b54134fc9533c3e5a51ccfff UNKNOWN *

[GitHub] [hudi] hudi-bot removed a comment on pull request #5042: Three bulk_insert files are concurrently submitted and executed with a difference of 2s, the insert fails occasionally.

2022-03-27 Thread GitBox
hudi-bot removed a comment on pull request #5042: URL: https://github.com/apache/hudi/pull/5042#issuecomment-1080084434 ## CI report: * 7ee24be4d11864af37bf300250d571e15d5f9ae9 UNKNOWN * Unknown: [CANCELED](TBD) * 9a9e544ba48a52c7b54134fc9533c3e5a51ccfff UNKNOWN *

  1   2   3   4   5   6   7   >