[GitHub] [hudi] codope commented on pull request #6725: [HUDI-4881] Push down filters if possible when syncing partitions to Hive

2022-09-27 Thread GitBox
codope commented on PR #6725: URL: https://github.com/apache/hudi/pull/6725#issuecomment-1260425055 Canceling the CI run to prioritize release blockers. Apologies. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[GitHub] [hudi] codope commented on pull request #6660: [MINOR] Skip loading last completed txn for single writer

2022-09-27 Thread GitBox
codope commented on PR #6660: URL: https://github.com/apache/hudi/pull/6660#issuecomment-1260424617 Canceling the CI run to prioritize release blockers. Apologies. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [hudi] codope commented on pull request #6806: [HUDI-4905] Improve type handling in proto schema conversion

2022-09-27 Thread GitBox
codope commented on PR #6806: URL: https://github.com/apache/hudi/pull/6806#issuecomment-1260424078 Canceling the CI run to prioritize release blockers. Apologies. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[GitHub] [hudi] pramodbiligiri commented on pull request #6665: [HUDI-4850] Incremental Ingestion from GCS

2022-09-27 Thread GitBox
pramodbiligiri commented on PR #6665: URL: https://github.com/apache/hudi/pull/6665#issuecomment-1260421762 Responded to recent comments by @codope (some Github is not letting me respond inline to some comments): - Moved getMissingCheckpointStrategy to IncrSourceHelper, changed

[GitHub] [hudi] pramodbiligiri commented on a diff in pull request #6665: [HUDI-4850] Incremental Ingestion from GCS

2022-09-27 Thread GitBox
pramodbiligiri commented on code in PR #6665: URL: https://github.com/apache/hudi/pull/6665#discussion_r981968905 ## hudi-utilities/pom.xml: ## @@ -443,6 +443,23 @@ aws-java-sdk-sqs ${aws.sdk.version} + + + + com.google.cloud +

[jira] [Updated] (HUDI-4937) Fix HoodieTable injecting HoodieBackedTableMetadat not reusing underlying MT readers

2022-09-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4937: - Labels: pull-request-available (was: ) > Fix HoodieTable injecting HoodieBackedTableMetadat not

[GitHub] [hudi] alexeykudinkin opened a new pull request, #6815: [HUDI-4937] Fix `HoodieTable` injecting non-reusable `HoodieBackedTableMetadata` aggressively flushing MT readers

2022-09-27 Thread GitBox
alexeykudinkin opened a new pull request, #6815: URL: https://github.com/apache/hudi/pull/6815 ### Change Logs Currently, `HoodieTable` is holding `HoodieBackedTableMetadata` that is setup not to reuse actual LogScanner and HFileReader used to read MT itself. This is proving

[jira] [Updated] (HUDI-4937) Fix HoodieTable injecting HoodieBackedTableMetadat not reusing underlying MT readers

2022-09-27 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-4937: -- Summary: Fix HoodieTable injecting HoodieBackedTableMetadat not reusing underlying MT readers

[jira] [Created] (HUDI-4937) Fix HoodieTable injecting HoodieBackedTableMetadat not reusing underlying MT reades

2022-09-27 Thread Alexey Kudinkin (Jira)
Alexey Kudinkin created HUDI-4937: - Summary: Fix HoodieTable injecting HoodieBackedTableMetadat not reusing underlying MT reades Key: HUDI-4937 URL: https://issues.apache.org/jira/browse/HUDI-4937

[GitHub] [hudi] hudi-bot commented on pull request #6802: [HUDI-4924] Auto-tune dedup parallelism

2022-09-27 Thread GitBox
hudi-bot commented on PR #6802: URL: https://github.com/apache/hudi/pull/6802#issuecomment-1260416284 ## CI report: * f29b7651130535f6212f2f3917ad6d48800710cb Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6753: [HUDI-4734] Deltastreamer table config change validation

2022-09-27 Thread GitBox
hudi-bot commented on PR #6753: URL: https://github.com/apache/hudi/pull/6753#issuecomment-1260416235 ## CI report: * bd1d6a2403336f0863b4e4ccb4f7a46eec76a4e3 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5616: [HUDI-4936] Fix `as.of.instant` not recognized as hoodie config

2022-09-27 Thread GitBox
hudi-bot commented on PR #5616: URL: https://github.com/apache/hudi/pull/5616#issuecomment-1260415431 ## CI report: * beb60e0e8dd7b36d2fdc09c234eac5ea7f5a830b Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4015: [HUDI-2780] Fix the issue of Mor log skipping complete blocks when reading data

2022-09-27 Thread GitBox
hudi-bot commented on PR #4015: URL: https://github.com/apache/hudi/pull/4015#issuecomment-1260414955 ## CI report: * eee3fdd5be1fbb6ef4d754c69a95db0fca36c25d Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6753: [HUDI-4734] Deltastreamer table config change validation

2022-09-27 Thread GitBox
hudi-bot commented on PR #6753: URL: https://github.com/apache/hudi/pull/6753#issuecomment-1260413049 ## CI report: * bd1d6a2403336f0863b4e4ccb4f7a46eec76a4e3 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5616: [HUDI-4936] Fix `as.of.instant` not recognized as hoodie config

2022-09-27 Thread GitBox
hudi-bot commented on PR #5616: URL: https://github.com/apache/hudi/pull/5616#issuecomment-1260412074 ## CI report: * beb60e0e8dd7b36d2fdc09c234eac5ea7f5a830b Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5269: [HUDI-3636] Create new write clients for async table services in DeltaStreamer

2022-09-27 Thread GitBox
hudi-bot commented on PR #5269: URL: https://github.com/apache/hudi/pull/5269#issuecomment-1260411915 ## CI report: * e7966fbbde59b43876542fc39135157835b6c861 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4015: [HUDI-2780] Fix the issue of Mor log skipping complete blocks when reading data

2022-09-27 Thread GitBox
hudi-bot commented on PR #4015: URL: https://github.com/apache/hudi/pull/4015#issuecomment-1260411508 ## CI report: * eee3fdd5be1fbb6ef4d754c69a95db0fca36c25d Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6814: [MINOR] fixing validate async operations to poll completed clean instances

2022-09-27 Thread GitBox
hudi-bot commented on PR #6814: URL: https://github.com/apache/hudi/pull/6814#issuecomment-1260409727 ## CI report: * 8d1bd7e5add4ad564ec57f073c28d30f4bf890f8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6355: [HUDI-4925] Should Force to use ExpressionPayload in MergeIntoTableCommand

2022-09-27 Thread GitBox
hudi-bot commented on PR #6355: URL: https://github.com/apache/hudi/pull/6355#issuecomment-1260409157 ## CI report: * 4a774eec1b4293fadf9b99638c96a0fb3bedc8d1 Azure:

[GitHub] [hudi] nsivabalan commented on pull request #6753: [HUDI-4734] Deltastreamer table config change validation

2022-09-27 Thread GitBox
nsivabalan commented on PR #6753: URL: https://github.com/apache/hudi/pull/6753#issuecomment-1260408466 I triaged as to why payloadClassUpdate test in deltastreamer was succeeding w/ this patch. I was expecting it to fail. apparently payload class is a top level config in deltastreamer and

[jira] [Updated] (HUDI-4936) as.of.instant not recognized as hoodie config

2022-09-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4936: - Labels: pull-request-available (was: ) > as.of.instant not recognized as hoodie config >

[GitHub] [hudi] xushiyan commented on pull request #5616: [HUDI-4936] Fix `as.of.instant` not recognized as hoodie config

2022-09-27 Thread GitBox
xushiyan commented on PR #5616: URL: https://github.com/apache/hudi/pull/5616#issuecomment-1260393874 > @gnailJC can you please elaborate if this is addressing an issue that you've faced or is it just code cleanup? @alexeykudinkin this is reported in

[jira] [Assigned] (HUDI-4936) as.of.instant not recognized as hoodie config

2022-09-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-4936: Assignee: Raymond Xu > as.of.instant not recognized as hoodie config >

[jira] [Created] (HUDI-4936) as.of.instant not recognized as hoodie config

2022-09-27 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-4936: Summary: as.of.instant not recognized as hoodie config Key: HUDI-4936 URL: https://issues.apache.org/jira/browse/HUDI-4936 Project: Apache Hudi Issue Type: Bug

[GitHub] [hudi] codope commented on pull request #6753: [HUDI-4734] Deltastreamer table config change validation

2022-09-27 Thread GitBox
codope commented on PR #6753: URL: https://github.com/apache/hudi/pull/6753#issuecomment-1260389144 simply rebased as the flaky test was fixed recently -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[jira] [Created] (HUDI-4935) Use "hoodie.timestamp.as.of" for time travel option

2022-09-27 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-4935: Summary: Use "hoodie.timestamp.as.of" for time travel option Key: HUDI-4935 URL: https://issues.apache.org/jira/browse/HUDI-4935 Project: Apache Hudi Issue Type:

[GitHub] [hudi] hudi-bot commented on pull request #6355: [HUDI-4925] Should Force to use ExpressionPayload in MergeIntoTableCommand

2022-09-27 Thread GitBox
hudi-bot commented on PR #6355: URL: https://github.com/apache/hudi/pull/6355#issuecomment-1260377382 ## CI report: * 4a774eec1b4293fadf9b99638c96a0fb3bedc8d1 Azure:

[GitHub] [hudi] fengjian428 commented on pull request #6355: [HUDI-4925] Should Force to use ExpressionPayload in MergeIntoTableCommand

2022-09-27 Thread GitBox
fengjian428 commented on PR #6355: URL: https://github.com/apache/hudi/pull/6355#issuecomment-1260376680 > @fengjian428 let's also add a test for this issue at what level? I feel it looks redundant if create a test only to check the payload class, WDYT? -- This is an automated

[GitHub] [hudi] hudi-bot commented on pull request #6813: [HUDI-4934] Revert batch clean files

2022-09-27 Thread GitBox
hudi-bot commented on PR #6813: URL: https://github.com/apache/hudi/pull/6813#issuecomment-1260371928 ## CI report: * 2c5f43beba87a049ffc077b0394e76fa1e9d9ad6 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6805: [MINOR] copy to avoid the problem of reusing buffer underlying the Row

2022-09-27 Thread GitBox
hudi-bot commented on PR #6805: URL: https://github.com/apache/hudi/pull/6805#issuecomment-1260371889 ## CI report: * 553617898625ce54efbe0f72bdad5c44c3906743 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6657: [HUDI-4687] Avoid setAccessible which breaks strong encapsulation

2022-09-27 Thread GitBox
hudi-bot commented on PR #6657: URL: https://github.com/apache/hudi/pull/6657#issuecomment-1260371724 ## CI report: * f4d7ad25b0513ac14d78d08b58c7bd0b4b0cf374 UNKNOWN * 23928b7db6d79929b01df68d2b6e1d892263223b Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6805: [MINOR] copy to avoid the problem of reusing buffer underlying the Row

2022-09-27 Thread GitBox
hudi-bot commented on PR #6805: URL: https://github.com/apache/hudi/pull/6805#issuecomment-1260369087 ## CI report: * 553617898625ce54efbe0f72bdad5c44c3906743 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6657: [HUDI-4687] Avoid setAccessible which breaks strong encapsulation

2022-09-27 Thread GitBox
hudi-bot commented on PR #6657: URL: https://github.com/apache/hudi/pull/6657#issuecomment-1260368877 ## CI report: * f4d7ad25b0513ac14d78d08b58c7bd0b4b0cf374 UNKNOWN * 23928b7db6d79929b01df68d2b6e1d892263223b Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6810: [HUDI-4718] Add Kerberos kdestroy command support

2022-09-27 Thread GitBox
hudi-bot commented on PR #6810: URL: https://github.com/apache/hudi/pull/6810#issuecomment-1260364997 ## CI report: * 7f565d23883e4f1d2006fcd704a48b77fd25f987 Azure:

[GitHub] [hudi] YannByron commented on pull request #6805: [MINOR] copy to avoid the problem of reusing buffer underlying the Row

2022-09-27 Thread GitBox
YannByron commented on PR #6805: URL: https://github.com/apache/hudi/pull/6805#issuecomment-1260353253 @alexeykudinkin I found it can avoid to do serialize/deserialize in many cases, only one case has to do this. so i do some optimization for this. -- This is an automated message

[GitHub] [hudi] hudi-bot commented on pull request #6736: [HUDI-4894] Fix ClassCastException when using fixed type defining dec…

2022-09-27 Thread GitBox
hudi-bot commented on PR #6736: URL: https://github.com/apache/hudi/pull/6736#issuecomment-1260330710 ## CI report: * 4703b2ce5a638618a2d9da14f10fec064216f1a1 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6736: [HUDI-4894] Fix ClassCastException when using fixed type defining dec…

2022-09-27 Thread GitBox
hudi-bot commented on PR #6736: URL: https://github.com/apache/hudi/pull/6736#issuecomment-1260327970 ## CI report: * 4703b2ce5a638618a2d9da14f10fec064216f1a1 Azure:

[jira] [Closed] (HUDI-4453) Support partition pruning for tables Bootstrapped from Source Hive Style partitioned tables

2022-09-27 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit closed HUDI-4453. - Fix Version/s: 0.12.1 (was: 0.13.0) Resolution: Fixed > Support partition

[hudi] branch master updated (be9b4195ea -> 96ce4b2f8e)

2022-09-27 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from be9b4195ea [HUDI-4913] Fix HoodieSnapshotExporter for writing to a different S3 bucket or FS (#6785) add

[GitHub] [hudi] codope merged pull request #6676: [HUDI-4453] Fix schema to include partition columns in bootstrap operation

2022-09-27 Thread GitBox
codope merged PR #6676: URL: https://github.com/apache/hudi/pull/6676 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot commented on pull request #6753: [HUDI-4734] Deltastreamer table config change validation

2022-09-27 Thread GitBox
hudi-bot commented on PR #6753: URL: https://github.com/apache/hudi/pull/6753#issuecomment-1260323649 ## CI report: * bd1d6a2403336f0863b4e4ccb4f7a46eec76a4e3 Azure:

[GitHub] [hudi] boneanxs commented on pull request #6793: 【HUDI-4917】Optimized the way to get HoodieBaseFile of loadColumnRange…

2022-09-27 Thread GitBox
boneanxs commented on PR #6793: URL: https://github.com/apache/hudi/pull/6793#issuecomment-1260315164 @CodeCooker17 Could you plz rebase the codes to fix the flaky issue? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[GitHub] [hudi] wangxianghu commented on pull request #6736: [HUDI-4894] Fix ClassCastException when using fixed type defining dec…

2022-09-27 Thread GitBox
wangxianghu commented on PR #6736: URL: https://github.com/apache/hudi/pull/6736#issuecomment-1260301786 @shenh062326 can you also take a look at this ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [hudi] hudi-bot commented on pull request #6725: [HUDI-4881] Push down filters if possible when syncing partitions to Hive

2022-09-27 Thread GitBox
hudi-bot commented on PR #6725: URL: https://github.com/apache/hudi/pull/6725#issuecomment-1260291247 ## CI report: * a0d1c46572bf9cb55107cc95bce8da7d9911400f Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6660: [MINOR] Skip loading last completed txn for single writer

2022-09-27 Thread GitBox
hudi-bot commented on PR #6660: URL: https://github.com/apache/hudi/pull/6660#issuecomment-1260291176 ## CI report: * 2294778f796f2a41333234d2d3aa300c27520ed1 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6741: [HUDI-4898] presto/hive respect payload during merge parquet file and logfile when reading mor table

2022-09-27 Thread GitBox
hudi-bot commented on PR #6741: URL: https://github.com/apache/hudi/pull/6741#issuecomment-1260291280 ## CI report: * bff3acafde6d8a1bd5574b90ce644ef30acbf0a2 UNKNOWN * ee6c9571e7c07aef6b27d05af536c4be2a5373cd Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5269: [HUDI-3636] Create new write clients for async table services in DeltaStreamer

2022-09-27 Thread GitBox
hudi-bot commented on PR #5269: URL: https://github.com/apache/hudi/pull/5269#issuecomment-1260290578 ## CI report: * b987d84d2c18e5e3cb15268a2689a6294842b2c8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6741: [HUDI-4898] presto/hive respect payload during merge parquet file and logfile when reading mor table

2022-09-27 Thread GitBox
hudi-bot commented on PR #6741: URL: https://github.com/apache/hudi/pull/6741#issuecomment-1260288962 ## CI report: * bff3acafde6d8a1bd5574b90ce644ef30acbf0a2 UNKNOWN * ee6c9571e7c07aef6b27d05af536c4be2a5373cd Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6725: [HUDI-4881] Push down filters if possible when syncing partitions to Hive

2022-09-27 Thread GitBox
hudi-bot commented on PR #6725: URL: https://github.com/apache/hudi/pull/6725#issuecomment-1260288901 ## CI report: * a0d1c46572bf9cb55107cc95bce8da7d9911400f Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6660: [MINOR] Skip loading last completed txn for single writer

2022-09-27 Thread GitBox
hudi-bot commented on PR #6660: URL: https://github.com/apache/hudi/pull/6660#issuecomment-1260288834 ## CI report: * 2294778f796f2a41333234d2d3aa300c27520ed1 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5269: [HUDI-3636] Create new write clients for async table services in DeltaStreamer

2022-09-27 Thread GitBox
hudi-bot commented on PR #5269: URL: https://github.com/apache/hudi/pull/5269#issuecomment-1260288169 ## CI report: * f82d7aecb0d5f6b687a3e8083b562dd8fd5601c9 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6676: [HUDI-4453] Fix schema to include partition columns in bootstrap operation

2022-09-27 Thread GitBox
hudi-bot commented on PR #6676: URL: https://github.com/apache/hudi/pull/6676#issuecomment-1260286424 ## CI report: * edf92e5ee1ef5ee9df4db8649e94277875fa00db Azure:

[GitHub] [hudi] boneanxs commented on pull request #6046: [HUDI-4363] Support Clustering row writer to improve performance

2022-09-27 Thread GitBox
boneanxs commented on PR #6046: URL: https://github.com/apache/hudi/pull/6046#issuecomment-1260281685 > can you please create a Jira corresponding to your investigation and link it in here? So that it's easier to discover it Yea, sure thing -- This is an automated message from the

[GitHub] [hudi] hudi-bot commented on pull request #6358: [HUDI-4588][HUDI-4472] Fixing `HoodieParquetReader` to properly specify projected schema when reading Parquet file

2022-09-27 Thread GitBox
hudi-bot commented on PR #6358: URL: https://github.com/apache/hudi/pull/6358#issuecomment-1260249246 ## CI report: * 288d166c49602a4593b1e97763a467811903737d UNKNOWN * 5558316c7b224caaa00a3de53ef0d24193f8d1ad Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6806: [HUDI-4905] Improve type handling in proto schema conversion

2022-09-27 Thread GitBox
hudi-bot commented on PR #6806: URL: https://github.com/apache/hudi/pull/6806#issuecomment-1260246561 ## CI report: * f18fff886e0509bed666404800694b8aff3bb98f UNKNOWN * eca2992d65733640a0999f29a5e35e6631bbc57a Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5269: [HUDI-3636] Create new write clients for async table services in DeltaStreamer

2022-09-27 Thread GitBox
hudi-bot commented on PR #5269: URL: https://github.com/apache/hudi/pull/5269#issuecomment-1260245383 ## CI report: * f82d7aecb0d5f6b687a3e8083b562dd8fd5601c9 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6802: [HUDI-4924] Auto-tune dedup parallelism

2022-09-27 Thread GitBox
hudi-bot commented on PR #6802: URL: https://github.com/apache/hudi/pull/6802#issuecomment-1260243028 ## CI report: * 4b207ae3989df07ab53e8bf69ed3c65ffc818270 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6806: [HUDI-4905] Improve type handling in proto schema conversion

2022-09-27 Thread GitBox
hudi-bot commented on PR #6806: URL: https://github.com/apache/hudi/pull/6806#issuecomment-1260243069 ## CI report: * f18fff886e0509bed666404800694b8aff3bb98f UNKNOWN * c5dff6337535dc26770b376383318065a8f4bd1d Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5269: [HUDI-3636] Create new write clients for async table services in DeltaStreamer

2022-09-27 Thread GitBox
hudi-bot commented on PR #5269: URL: https://github.com/apache/hudi/pull/5269#issuecomment-1260241912 ## CI report: * 460ce44922c50af95bf6c68f111ceb61d07c0289 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6802: [HUDI-4924] Auto-tune dedup parallelism

2022-09-27 Thread GitBox
hudi-bot commented on PR #6802: URL: https://github.com/apache/hudi/pull/6802#issuecomment-1260179805 ## CI report: * 4b207ae3989df07ab53e8bf69ed3c65ffc818270 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6814: [MINOR] fixing validate async operations to poll completed clean instances

2022-09-27 Thread GitBox
hudi-bot commented on PR #6814: URL: https://github.com/apache/hudi/pull/6814#issuecomment-1260176786 ## CI report: * 8d1bd7e5add4ad564ec57f073c28d30f4bf890f8 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6813: [HUDI-4934] Revert batch clean files

2022-09-27 Thread GitBox
hudi-bot commented on PR #6813: URL: https://github.com/apache/hudi/pull/6813#issuecomment-1260176763 ## CI report: * 2c5f43beba87a049ffc077b0394e76fa1e9d9ad6 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6810: [HUDI-4718] Add Kerberos kdestroy command support

2022-09-27 Thread GitBox
hudi-bot commented on PR #6810: URL: https://github.com/apache/hudi/pull/6810#issuecomment-1260176739 ## CI report: * 8ee0099aa6c60063c1619a3ab8dcf4d3557b04b7 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6802: [HUDI-4924] Auto-tune dedup parallelism

2022-09-27 Thread GitBox
hudi-bot commented on PR #6802: URL: https://github.com/apache/hudi/pull/6802#issuecomment-1260176698 ## CI report: * a7e22e5456a947c9f8ad000d47f4b32bddf1937c Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5269: [HUDI-3636] Create new write clients for async table services in DeltaStreamer

2022-09-27 Thread GitBox
hudi-bot commented on PR #5269: URL: https://github.com/apache/hudi/pull/5269#issuecomment-1260175625 ## CI report: * 460ce44922c50af95bf6c68f111ceb61d07c0289 Azure:

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #6806: [HUDI-4905] Improve type handling in proto schema conversion

2022-09-27 Thread GitBox
alexeykudinkin commented on code in PR #6806: URL: https://github.com/apache/hudi/pull/6806#discussion_r981805427 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/ProtoConversionUtil.java: ## @@ -314,7 +351,16 @@ private Object convertObject(Schema

[GitHub] [hudi] hudi-bot commented on pull request #6814: [MINOR] fixing validate async operations to poll completed clean instances

2022-09-27 Thread GitBox
hudi-bot commented on PR #6814: URL: https://github.com/apache/hudi/pull/6814#issuecomment-1260173497 ## CI report: * 8d1bd7e5add4ad564ec57f073c28d30f4bf890f8 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #6813: [HUDI-4934] Revert batch clean files

2022-09-27 Thread GitBox
hudi-bot commented on PR #6813: URL: https://github.com/apache/hudi/pull/6813#issuecomment-1260173450 ## CI report: * 2c5f43beba87a049ffc077b0394e76fa1e9d9ad6 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #6810: [HUDI-4718] Add Kerberos kdestroy command support

2022-09-27 Thread GitBox
hudi-bot commented on PR #6810: URL: https://github.com/apache/hudi/pull/6810#issuecomment-1260173419 ## CI report: * 8ee0099aa6c60063c1619a3ab8dcf4d3557b04b7 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6802: [HUDI-4924] Auto-tune dedup parallelism

2022-09-27 Thread GitBox
hudi-bot commented on PR #6802: URL: https://github.com/apache/hudi/pull/6802#issuecomment-1260173350 ## CI report: * a7e22e5456a947c9f8ad000d47f4b32bddf1937c Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5269: [HUDI-3636] Create new write clients for async table services in DeltaStreamer

2022-09-27 Thread GitBox
hudi-bot commented on PR #5269: URL: https://github.com/apache/hudi/pull/5269#issuecomment-1260171893 ## CI report: * 460ce44922c50af95bf6c68f111ceb61d07c0289 Azure:

[GitHub] [hudi] nsivabalan commented on pull request #5269: [HUDI-3636] Create new write clients for async table services in DeltaStreamer

2022-09-27 Thread GitBox
nsivabalan commented on PR #5269: URL: https://github.com/apache/hudi/pull/5269#issuecomment-1260167190 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [hudi] nsivabalan commented on pull request #6802: [HUDI-4924] Auto-tune dedup parallelism

2022-09-27 Thread GitBox
nsivabalan commented on PR #6802: URL: https://github.com/apache/hudi/pull/6802#issuecomment-1260162593 have rebased and pushed -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] nsivabalan opened a new pull request, #6814: [MINOR] fixing validate async operations to poll completed clean instances

2022-09-27 Thread GitBox
nsivabalan opened a new pull request, #6814: URL: https://github.com/apache/hudi/pull/6814 ### Change Logs Fixing call to right timeline api for completed clean for ValidateAsyncOperations ### Impact stabilizes integration tests **Risk level: none **

[GitHub] [hudi] nsivabalan opened a new pull request, #6813: [HUDI-4934] Revert batch clean files

2022-09-27 Thread GitBox
nsivabalan opened a new pull request, #6813: URL: https://github.com/apache/hudi/pull/6813 ### Change Logs Revert "[HUDI-4792] Batch clean files to delete (#6580)" This reverts commit cbf9b83ca6d3dada14eea551a5bae25144ca0459. ### Impact We have some integration

[jira] [Updated] (HUDI-4934) Cleaner cleans up files touched by clustering

2022-09-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4934: - Labels: pull-request-available (was: ) > Cleaner cleans up files touched by clustering >

[jira] [Commented] (HUDI-4199) Clean up row writer path for url encoding, consistent logical timestamp

2022-09-27 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17610282#comment-17610282 ] sivabalan narayanan commented on HUDI-4199: --- Went through the code and apparently things have

[jira] [Updated] (HUDI-4199) Clean up row writer path for url encoding, consistent logical timestamp

2022-09-27 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4199: -- Status: Open (was: In Progress) > Clean up row writer path for url encoding,

[jira] [Closed] (HUDI-4199) Clean up row writer path for url encoding, consistent logical timestamp

2022-09-27 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan closed HUDI-4199. - Resolution: Fixed > Clean up row writer path for url encoding, consistent logical

[GitHub] [hudi] the-other-tim-brown commented on a diff in pull request #6761: [HUDI-4904] Add support for unraveling proto schemas in ProtoClassBasedSchemaProvider

2022-09-27 Thread GitBox
the-other-tim-brown commented on code in PR #6761: URL: https://github.com/apache/hudi/pull/6761#discussion_r981791563 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/ProtoConversionUtil.java: ## @@ -80,17 +83,19 @@ public static GenericRecord

[jira] [Updated] (HUDI-4934) Cleaner cleans up files touched by clustering

2022-09-27 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4934: -- Priority: Blocker (was: Major) > Cleaner cleans up files touched by clustering >

[jira] [Created] (HUDI-4934) Cleaner cleans up files touched by clustering

2022-09-27 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-4934: - Summary: Cleaner cleans up files touched by clustering Key: HUDI-4934 URL: https://issues.apache.org/jira/browse/HUDI-4934 Project: Apache Hudi

[jira] [Updated] (HUDI-4934) Cleaner cleans up files touched by clustering

2022-09-27 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4934: -- Fix Version/s: 0.12.1 > Cleaner cleans up files touched by clustering >

[jira] [Assigned] (HUDI-4934) Cleaner cleans up files touched by clustering

2022-09-27 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-4934: - Assignee: sivabalan narayanan > Cleaner cleans up files touched by clustering >

[jira] [Updated] (HUDI-4199) Clean up row writer path for url encoding, consistent logical timestamp

2022-09-27 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4199: -- Status: In Progress (was: Open) > Clean up row writer path for url encoding,

[GitHub] [hudi] jinyius commented on a diff in pull request #6761: [HUDI-4904] Add support for unraveling proto schemas in ProtoClassBasedSchemaProvider

2022-09-27 Thread GitBox
jinyius commented on code in PR #6761: URL: https://github.com/apache/hudi/pull/6761#discussion_r981404962 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/ProtoConversionUtil.java: ## @@ -80,17 +83,19 @@ public static GenericRecord convertToAvro(Schema

[GitHub] [hudi] hudi-bot commented on pull request #5269: [HUDI-3636] Create new write clients for async table services in DeltaStreamer

2022-09-27 Thread GitBox
hudi-bot commented on PR #5269: URL: https://github.com/apache/hudi/pull/5269#issuecomment-1260132585 ## CI report: * 460ce44922c50af95bf6c68f111ceb61d07c0289 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5269: [HUDI-3636] Create new write clients for async table services in DeltaStreamer

2022-09-27 Thread GitBox
hudi-bot commented on PR #5269: URL: https://github.com/apache/hudi/pull/5269#issuecomment-1260118937 ## CI report: * a360d286f9a9bff3f60cc7231bc0abfe86675a88 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6806: [HUDI-4905] Improve type handling in proto schema conversion

2022-09-27 Thread GitBox
hudi-bot commented on PR #6806: URL: https://github.com/apache/hudi/pull/6806#issuecomment-1260109367 ## CI report: * f18fff886e0509bed666404800694b8aff3bb98f UNKNOWN * c5dff6337535dc26770b376383318065a8f4bd1d Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6753: [HUDI-4734] Deltastreamer table config change validation

2022-09-27 Thread GitBox
hudi-bot commented on PR #6753: URL: https://github.com/apache/hudi/pull/6753#issuecomment-1260109296 ## CI report: * f3bd19a7f74e4612058cd1eec09c26893d810e25 Azure:

[GitHub] [hudi] nsivabalan opened a new pull request, #6812: [WIP] Reverting offending commit with clean planner

2022-09-27 Thread GitBox
nsivabalan opened a new pull request, #6812: URL: https://github.com/apache/hudi/pull/6812 ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any public API or user-facing feature change or any

[GitHub] [hudi] hudi-bot commented on pull request #5269: [HUDI-3636] Create new write clients for async table services in DeltaStreamer

2022-09-27 Thread GitBox
hudi-bot commented on PR #5269: URL: https://github.com/apache/hudi/pull/5269#issuecomment-1260108067 ## CI report: * a360d286f9a9bff3f60cc7231bc0abfe86675a88 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6753: [HUDI-4734] Deltastreamer table config change validation

2022-09-27 Thread GitBox
hudi-bot commented on PR #6753: URL: https://github.com/apache/hudi/pull/6753#issuecomment-1260105388 ## CI report: * 364470c22551fb8fc17c4d7286e6b4504198d614 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #6806: [HUDI-4905] Improve type handling in proto schema conversion

2022-09-27 Thread GitBox
hudi-bot commented on PR #6806: URL: https://github.com/apache/hudi/pull/6806#issuecomment-1260105459 ## CI report: * f18fff886e0509bed666404800694b8aff3bb98f UNKNOWN * c5dff6337535dc26770b376383318065a8f4bd1d Azure:

[GitHub] [hudi] hudi-bot commented on pull request #5269: [HUDI-3636] Create new write clients for async table services in DeltaStreamer

2022-09-27 Thread GitBox
hudi-bot commented on PR #5269: URL: https://github.com/apache/hudi/pull/5269#issuecomment-1260104128 ## CI report: * a360d286f9a9bff3f60cc7231bc0abfe86675a88 Azure:

[GitHub] [hudi] the-other-tim-brown commented on a diff in pull request #6806: [HUDI-4905] Improve type handling in proto schema conversion

2022-09-27 Thread GitBox
the-other-tim-brown commented on code in PR #6806: URL: https://github.com/apache/hudi/pull/6806#discussion_r981750304 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/ProtoConversionUtil.java: ## @@ -173,24 +184,35 @@ private Schema

[GitHub] [hudi] the-other-tim-brown commented on a diff in pull request #6806: [HUDI-4905] Improve type handling in proto schema conversion

2022-09-27 Thread GitBox
the-other-tim-brown commented on code in PR #6806: URL: https://github.com/apache/hudi/pull/6806#discussion_r981744618 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/ProtoConversionUtil.java: ## @@ -314,7 +351,16 @@ private Object convertObject(Schema

[GitHub] [hudi] nochimow opened a new issue, #6811: [SUPPORT] Slow upsert performance

2022-09-27 Thread GitBox
nochimow opened a new issue, #6811: URL: https://github.com/apache/hudi/issues/6811 **Describe the problem you faced** After one month of data ingestion, a data ingestion pipeline started to take a very long time during the upsert operation We currently use Hudi Spark 0.12 running

[GitHub] [hudi] the-other-tim-brown commented on a diff in pull request #6806: [HUDI-4905] Improve type handling in proto schema conversion

2022-09-27 Thread GitBox
the-other-tim-brown commented on code in PR #6806: URL: https://github.com/apache/hudi/pull/6806#discussion_r981739787 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/ProtoConversionUtil.java: ## @@ -173,24 +184,35 @@ private Schema

[GitHub] [hudi] the-other-tim-brown commented on a diff in pull request #6806: [HUDI-4905] Improve type handling in proto schema conversion

2022-09-27 Thread GitBox
the-other-tim-brown commented on code in PR #6806: URL: https://github.com/apache/hudi/pull/6806#discussion_r981730983 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/ProtoConversionUtil.java: ## @@ -85,6 +93,8 @@ public static GenericRecord

  1   2   3   >