[jira] [Commented] (HUDI-1307) spark datasource load path format is confused for snapshot and increment read mode

2021-09-26 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17420358#comment-17420358 ] Raymond Xu commented on HUDI-1307: -- [~309637554] Any update on this improvement? definitely useful to

[jira] [Updated] (HUDI-2440) Add dependency change diff script for dependency governace

2021-09-26 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2440: - Component/s: Usability > Add dependency change diff script for dependency governace >

[jira] [Commented] (HUDI-2496) Inserts are precombined even with dedup disabled

2021-09-28 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17421584#comment-17421584 ] Raymond Xu commented on HUDI-2496: -- [~helias_an] Sure. assigned! Please ping us even with a draft PR, we

[jira] [Assigned] (HUDI-2496) Inserts are precombined even with dedup disabled

2021-09-28 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-2496: Assignee: Helias Antoniou > Inserts are precombined even with dedup disabled >

[jira] [Updated] (HUDI-1998) Provide a way to find list of commits through a pythonic API

2021-09-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1998: - Description: TimelineUtils is a java API using which one can get the latest commit or instantiate

[jira] [Created] (HUDI-2500) Spark datasource delete not working on Spark SQL created table

2021-09-29 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2500: Summary: Spark datasource delete not working on Spark SQL created table Key: HUDI-2500 URL: https://issues.apache.org/jira/browse/HUDI-2500 Project: Apache Hudi

[jira] [Updated] (HUDI-2495) Difference in behavior between GenericRecord based key gen and Row based key gen

2021-09-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2495: - Parent: HUDI-2505 Issue Type: Sub-task (was: Bug) > Difference in behavior between GenericRecord

[jira] [Updated] (HUDI-2390) KeyGenerator discrepancy between DataFrame writer and SQL

2021-09-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2390: - Parent: HUDI-2505 Issue Type: Sub-task (was: Improvement) > KeyGenerator discrepancy between

[jira] [Updated] (HUDI-2505) [UMBRELLA] Spark DataSource APIs and Spark SQL discrepancies

2021-09-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2505: - Labels: sev:critical (was: ) > [UMBRELLA] Spark DataSource APIs and Spark SQL discrepancies >

[jira] [Updated] (HUDI-2500) Spark datasource delete not working on Spark SQL created table

2021-09-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2500: - Parent: HUDI-2505 Issue Type: Sub-task (was: Bug) > Spark datasource delete not working on Spark

[jira] [Created] (HUDI-2505) [UMBRELLA] Spark DataSource APIs and Spark SQL discrepancies

2021-09-30 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2505: Summary: [UMBRELLA] Spark DataSource APIs and Spark SQL discrepancies Key: HUDI-2505 URL: https://issues.apache.org/jira/browse/HUDI-2505 Project: Apache Hudi

[jira] [Updated] (HUDI-2500) Spark datasource delete not working on Spark SQL created table

2021-09-30 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2500: - Description: Original issue [https://github.com/apache/hudi/issues/3670]   Script to re-produce

[jira] [Created] (HUDI-2531) [UMBRELLA] Support Dataset APIs in writer paths

2021-10-07 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2531: Summary: [UMBRELLA] Support Dataset APIs in writer paths Key: HUDI-2531 URL: https://issues.apache.org/jira/browse/HUDI-2531 Project: Apache Hudi Issue Type: New

[jira] [Updated] (HUDI-2531) [UMBRELLA] Support Dataset APIs in writer paths

2021-10-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2531: - Labels: hudi-umbrellas sev:critical user-support-issues (was: ) > [UMBRELLA] Support Dataset APIs in

[jira] [Updated] (HUDI-2531) [UMBRELLA] Support Dataset APIs in writer paths

2021-10-07 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2531: - Description: To make use of Dataset APIs in writer paths instead of RDD. > [UMBRELLA] Support Dataset

[jira] [Updated] (HUDI-2452) spark on hudi metadata key length < 0

2021-09-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2452: - Labels: sev:critical (was: pull-request-available sev:critical) > spark on hudi metadata key length < 0

[jira] [Created] (HUDI-2482) Support drop partitions SQL

2021-09-22 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2482: Summary: Support drop partitions SQL Key: HUDI-2482 URL: https://issues.apache.org/jira/browse/HUDI-2482 Project: Apache Hudi Issue Type: Improvement

[jira] [Updated] (HUDI-2482) Support drop partitions SQL

2021-09-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2482: - Parent: HUDI-1658 Issue Type: Sub-task (was: Improvement) > Support drop partitions SQL >

[jira] [Updated] (HUDI-2456) Support show partitions SQL

2021-09-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2456: - Parent: HUDI-1658 Issue Type: Sub-task (was: Improvement) > Support show partitions SQL >

[jira] [Updated] (HUDI-2482) Support drop partitions SQL

2021-09-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2482: - Description: (was: Spark SQL support the following syntax to show hudi tabls's partitions.

[jira] [Assigned] (HUDI-2482) Support drop partitions SQL

2021-09-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-2482: Assignee: (was: Yann Byron) > Support drop partitions SQL > --- > >

[jira] [Assigned] (HUDI-2108) Flaky test: TestHoodieBackedMetadata.testOnlyValidPartitionsAdded:210

2021-10-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-2108: Assignee: Raymond Xu (was: Vinoth Chandar) > Flaky test:

[jira] [Updated] (HUDI-2108) Flaky test: TestHoodieBackedMetadata.testOnlyValidPartitionsAdded:210

2021-10-02 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2108: - Status: In Progress (was: Open) > Flaky test: TestHoodieBackedMetadata.testOnlyValidPartitionsAdded:210

[jira] [Created] (HUDI-2516) Upgrade to Junit 5.8.1

2021-10-04 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2516: Summary: Upgrade to Junit 5.8.1 Key: HUDI-2516 URL: https://issues.apache.org/jira/browse/HUDI-2516 Project: Apache Hudi Issue Type: Sub-task Components:

[jira] [Updated] (HUDI-2108) Flaky test: TestHoodieBackedMetadata.testOnlyValidPartitionsAdded:210

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2108: - Description: org.apache.hudi.client.functional.TestHoodieBackedMetadata#testTableOperationsWithRestore  

[jira] [Updated] (HUDI-2108) Flaky test: TestHoodieBackedMetadata.testOnlyValidPartitionsAdded:210

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2108: - Component/s: Testing > Flaky test: TestHoodieBackedMetadata.testOnlyValidPartitionsAdded:210 >

[jira] [Updated] (HUDI-2108) Flaky test: TestHoodieBackedMetadata.testOnlyValidPartitionsAdded:210

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2108: - Description: org.apache.hudi.client.functional.TestHoodieBackedMetadata#testTableOperationsWithRestore  

[jira] [Updated] (HUDI-2528) Flaky test: [ERROR] HoodieTableType).[2] MERGE_ON_READ(testTableOperationsWithRestore

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2528: - Parent: HUDI-1248 Issue Type: Sub-task (was: Bug) > Flaky test: [ERROR] HoodieTableType).[2] >

[jira] [Updated] (HUDI-2529) Flaky test: ITTestHoodieFlinkCompactor.testHoodieFlinkCompactor:88

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2529: - Description: {code:java} 2021-09-30T16:45:30.4276182Z 12557 [pool-15-thread-2] ERROR

[jira] [Updated] (HUDI-2077) Flaky test: TestHoodieDeltaStreamer

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2077: - Description: {code:java} [INFO] Results:8520[INFO] 8521[ERROR] Errors: 8522[ERROR]

[jira] [Closed] (HUDI-2075) Flaky test: TestRowDataToHoodieFunction

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-2075. Resolution: Cannot Reproduce Don't see this in Azure. Re-open if this came back. > Flaky test:

[jira] [Updated] (HUDI-2529) Flaky test: ITTestHoodieFlinkCompactor.testHoodieFlinkCompactor:88

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2529: - Attachment: 27.txt > Flaky test: ITTestHoodieFlinkCompactor.testHoodieFlinkCompactor:88 >

[jira] [Updated] (HUDI-2528) Flaky test: MERGE_ON_READ testTableOperationsWithRestore

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2528: - Summary: Flaky test: MERGE_ON_READ testTableOperationsWithRestore (was: Flaky test: [ERROR]

[jira] [Updated] (HUDI-2077) Flaky test: TestHoodieDeltaStreamer

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2077: - Description: {code:java} [INFO] Results:8520[INFO] 8521[ERROR] Errors: 8522[ERROR]

[jira] [Updated] (HUDI-2077) Flaky test: TestHoodieDeltaStreamer

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2077: - Attachment: 28.txt > Flaky test: TestHoodieDeltaStreamer > --- > >

[jira] [Created] (HUDI-2528) Flaky test: [ERROR] HoodieTableType).[2] MERGE_ON_READ(testTableOperationsWithRestore

2021-10-05 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2528: Summary: Flaky test: [ERROR] HoodieTableType).[2] MERGE_ON_READ(testTableOperationsWithRestore Key: HUDI-2528 URL: https://issues.apache.org/jira/browse/HUDI-2528 Project:

[jira] [Updated] (HUDI-1248) [UMBRELLA] Tests cleanup and fixes

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1248: - Priority: Critical (was: Major) > [UMBRELLA] Tests cleanup and fixes >

[jira] [Assigned] (HUDI-2527) Flaky test: TestHoodieClientMultiWriter.testMultiWriterWithAsyncTableServicesWithConflict

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-2527: Assignee: Raymond Xu > Flaky test: >

[jira] [Created] (HUDI-2527) Flaky test: TestHoodieClientMultiWriter.testMultiWriterWithAsyncTableServicesWithConflict

2021-10-05 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2527: Summary: Flaky test: TestHoodieClientMultiWriter.testMultiWriterWithAsyncTableServicesWithConflict Key: HUDI-2527 URL: https://issues.apache.org/jira/browse/HUDI-2527

[jira] [Created] (HUDI-2529) Flaky test: ITTestHoodieFlinkCompactor.testHoodieFlinkCompactor:88

2021-10-05 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2529: Summary: Flaky test: ITTestHoodieFlinkCompactor.testHoodieFlinkCompactor:88 Key: HUDI-2529 URL: https://issues.apache.org/jira/browse/HUDI-2529 Project: Apache Hudi

[jira] [Closed] (HUDI-2076) Flaky test: TestHoodieMultiTableDeltaStreamer

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-2076. Resolution: Cannot Reproduce Don't see this in Azure. Re-open if this came back. > Flaky test:

[jira] [Closed] (HUDI-2078) Flaky test: TestCleaner

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu closed HUDI-2078. Resolution: Cannot Reproduce Don't see this in Azure. Re-open if this came back. > Flaky test: TestCleaner

[jira] [Updated] (HUDI-2527) Flaky test: TestHoodieClientMultiWriter.testMultiWriterWithAsyncTableServicesWithConflict

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2527: - Description: Test case does not make sense for COW table. Should remove COW from the test param.

[jira] [Updated] (HUDI-2527) Flaky test: TestHoodieClientMultiWriter.testMultiWriterWithAsyncTableServicesWithConflict

2021-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2527: - Description:   {code:java} [ERROR] Tests run: 6, Failures: 0, Errors: 1, Skipped: 0, Time elapsed:

[jira] [Updated] (HUDI-864) parquet schema conflict: optional binary (UTF8) is not a group

2021-09-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-864: Affects Version/s: 0.9.0 > parquet schema conflict: optional binary (UTF8) is not a group >

[jira] [Updated] (HUDI-864) parquet schema conflict: optional binary (UTF8) is not a group

2021-09-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-864: Affects Version/s: 0.6.0 0.5.3 0.7.0

[jira] [Updated] (HUDI-2390) KeyGenerator discrepancy between DataFrame writer and SQL

2021-09-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2390: - Labels: sev:critical (was: ) > KeyGenerator discrepancy between DataFrame writer and SQL >

[jira] [Updated] (HUDI-2390) KeyGenerator discrepancy between DataFrame writer and SQL

2021-09-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2390: - Priority: Critical (was: Minor) > KeyGenerator discrepancy between DataFrame writer and SQL >

[jira] [Updated] (HUDI-2495) Difference in behavior between GenericRecord based key gen and Row based key gen

2021-09-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2495: - Priority: Critical (was: Major) > Difference in behavior between GenericRecord based key gen and Row

[jira] [Updated] (HUDI-2495) Difference in behavior between GenericRecord based key gen and Row based key gen

2021-09-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2495: - Component/s: Spark Integration > Difference in behavior between GenericRecord based key gen and Row based

[jira] [Updated] (HUDI-2496) Inserts are precombined even with dedup disabled

2021-09-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2496: - Fix Version/s: 0.10.0 > Inserts are precombined even with dedup disabled >

[jira] [Updated] (HUDI-2496) Inserts are precombined even with dedup disabled

2021-09-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2496: - Component/s: Writer Core > Inserts are precombined even with dedup disabled >

[jira] [Updated] (HUDI-2496) Inserts are precombined even with dedup disabled

2021-09-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2496: - Labels: sev:critical (was: writer) > Inserts are precombined even with dedup disabled >

[jira] [Updated] (HUDI-2496) Inserts are precombined even with dedup disabled

2021-09-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2496: - Description: Original GH issue https://github.com/apache/hudi/issues/3709 Test case by [~xushiyan] :

[jira] [Updated] (HUDI-2496) Inserts are precombined even with dedup disabled

2021-09-27 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2496: - Priority: Critical (was: Major) > Inserts are precombined even with dedup disabled >

[jira] [Updated] (HUDI-2608) Support JSON schema in schema registry provider

2021-10-24 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2608: - Description: To work with JSON kafka source.   Original issue

[jira] [Updated] (HUDI-2608) Support JSON schema in schema registry provider

2021-10-24 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2608: - Labels: sev:normal user-support-issues (was: ) > Support JSON schema in schema registry provider >

[jira] [Created] (HUDI-2608) Support JSON schema in schema registry provider

2021-10-24 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2608: Summary: Support JSON schema in schema registry provider Key: HUDI-2608 URL: https://issues.apache.org/jira/browse/HUDI-2608 Project: Apache Hudi Issue Type: New

[jira] [Created] (HUDI-2610) Fix Spark version info for hudi table CTAS from another hudi table

2021-10-24 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2610: Summary: Fix Spark version info for hudi table CTAS from another hudi table Key: HUDI-2610 URL: https://issues.apache.org/jira/browse/HUDI-2610 Project: Apache Hudi

[jira] [Created] (HUDI-2609) Clarify small file configs in config page

2021-10-24 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2609: Summary: Clarify small file configs in config page Key: HUDI-2609 URL: https://issues.apache.org/jira/browse/HUDI-2609 Project: Apache Hudi Issue Type: Sub-task

[jira] [Updated] (HUDI-2609) Clarify small file configs in config page

2021-10-24 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2609: - Labels: user-support-issues (was: ) > Clarify small file configs in config page >

[jira] [Created] (HUDI-2611) `create table if not exists` should print message instead of throwing error

2021-10-24 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2611: Summary: `create table if not exists` should print message instead of throwing error Key: HUDI-2611 URL: https://issues.apache.org/jira/browse/HUDI-2611 Project: Apache Hudi

[jira] [Created] (HUDI-2617) Implement HBase Index for Dataset

2021-10-25 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2617: Summary: Implement HBase Index for Dataset Key: HUDI-2617 URL: https://issues.apache.org/jira/browse/HUDI-2617 Project: Apache Hudi Issue Type: Sub-task

[jira] [Updated] (HUDI-1430) Implement SparkDataFrameWriteClient with SimpleIndex

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1430: - Description: End to end upsert operation, with proper functional tests coverage. > Implement

[jira] [Updated] (HUDI-2619) Make table services work with Dataset

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2619: - Description: Clustering, Compaction, Clean should also work with Dataset > Make table services work with

[jira] [Updated] (HUDI-2531) [UMBRELLA] Support Dataset APIs in writer paths

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2531: - Status: In Progress (was: Open) > [UMBRELLA] Support Dataset APIs in writer paths >

[jira] [Assigned] (HUDI-2615) Decouple HoodieRecordPayload with Hoodie table, table services, and index

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-2615: Assignee: Raymond Xu > Decouple HoodieRecordPayload with Hoodie table, table services, and index >

[jira] [Updated] (HUDI-2077) Flaky test: TestHoodieDeltaStreamer

2021-10-24 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2077: - Priority: Critical (was: Major) > Flaky test: TestHoodieDeltaStreamer >

[jira] [Updated] (HUDI-2531) [UMBRELLA] Support Dataset APIs in writer paths

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2531: - Fix Version/s: 0.10.0 > [UMBRELLA] Support Dataset APIs in writer paths >

[jira] [Updated] (HUDI-2616) Implement BloomIndex for Dataset

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2616: - Fix Version/s: 0.10.0 > Implement BloomIndex for Dataset > - > >

[jira] [Updated] (HUDI-2617) Implement HBase Index for Dataset

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2617: - Fix Version/s: 0.10.0 > Implement HBase Index for Dataset > -- > >

[jira] [Updated] (HUDI-2615) Decouple HoodieRecordPayload with Hoodie table, table services, and index

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2615: - Fix Version/s: 0.10.0 > Decouple HoodieRecordPayload with Hoodie table, table services, and index >

[jira] [Assigned] (HUDI-1869) Upgrading Spark3 To 3.1

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-1869: Assignee: Yann Byron (was: pengzhiwei) > Upgrading Spark3 To 3.1 > --- > >

[jira] [Created] (HUDI-2621) Optimize DataFrameWriter on small file handling

2021-10-25 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2621: Summary: Optimize DataFrameWriter on small file handling Key: HUDI-2621 URL: https://issues.apache.org/jira/browse/HUDI-2621 Project: Apache Hudi Issue Type:

[jira] [Created] (HUDI-2623) Make hudi-bot comment at PR thread bottom

2021-10-25 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2623: Summary: Make hudi-bot comment at PR thread bottom Key: HUDI-2623 URL: https://issues.apache.org/jira/browse/HUDI-2623 Project: Apache Hudi Issue Type: Improvement

[jira] [Updated] (HUDI-2287) Partition pruning not working on Hudi dataset

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2287: - Status: In Progress (was: Open) > Partition pruning not working on Hudi dataset >

[jira] [Updated] (HUDI-1706) Test flakiness w/ multiwriter test

2021-10-24 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1706: - Priority: Major (was: Blocker) > Test flakiness w/ multiwriter test > --

[jira] [Created] (HUDI-2615) Decouple HoodieRecordPayload with Hoodie table, table services, and index

2021-10-25 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2615: Summary: Decouple HoodieRecordPayload with Hoodie table, table services, and index Key: HUDI-2615 URL: https://issues.apache.org/jira/browse/HUDI-2615 Project: Apache Hudi

[jira] [Updated] (HUDI-2531) [UMBRELLA] Support Dataset APIs in writer paths

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2531: - Priority: Blocker (was: Critical) > [UMBRELLA] Support Dataset APIs in writer paths >

[jira] [Created] (HUDI-2616) Implement BloomIndex for Dataset

2021-10-25 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-2616: Summary: Implement BloomIndex for Dataset Key: HUDI-2616 URL: https://issues.apache.org/jira/browse/HUDI-2616 Project: Apache Hudi Issue Type: Sub-task

[jira] [Updated] (HUDI-2618) Implement write operations other than upsert in SparkDataFrameWriteClient

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2618: - Story Points: 4 (was: 3) > Implement write operations other than upsert in SparkDataFrameWriteClient >

[jira] [Comment Edited] (HUDI-1970) Performance testing/certification of key SQL DMLs

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17433587#comment-17433587 ] Raymond Xu edited comment on HUDI-1970 at 10/25/21, 7:13 AM: - * 1B records

[jira] [Updated] (HUDI-1970) Performance testing/certification of key SQL DMLs

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1970: - Status: Patch Available (was: In Progress) > Performance testing/certification of key SQL DMLs >

[jira] [Updated] (HUDI-1430) Implement SparkDataFrameWriteClient with SimpleIndex

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1430: - Status: In Progress (was: Open) > Implement SparkDataFrameWriteClient with SimpleIndex >

[jira] [Updated] (HUDI-1430) Implement SparkDataFrameWriteClient with SimpleIndex

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1430: - Story Points: 2 > Implement SparkDataFrameWriteClient with SimpleIndex >

[jira] [Updated] (HUDI-2618) Implement write operations other than upsert in SparkDataFrameWriteClient

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2618: - Summary: Implement write operations other than upsert in SparkDataFrameWriteClient (was: Implement

[jira] [Updated] (HUDI-2618) Implement write operations other than upsert in SparkDataFrameWriteClient

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2618: - Description: insert, insert_prepped, insert_overwrite, insert_overwrite_table, delete, delete_partitions,

[jira] [Assigned] (HUDI-1885) Support Delete/Update Non-Pk Table

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-1885: Assignee: Yann Byron > Support Delete/Update Non-Pk Table > -- > >

[jira] [Assigned] (HUDI-2234) MERGE INTO works only ON primary key

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-2234: Assignee: Yann Byron (was: pengzhiwei) > MERGE INTO works only ON primary key >

[jira] [Commented] (HUDI-2287) Partition pruning not working on Hudi dataset

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17433586#comment-17433586 ] Raymond Xu commented on HUDI-2287: -- [~rjkumr] it's likely caused by your `hoodie.table.partition.fields`

[jira] [Updated] (HUDI-2287) Partition pruning not working on Hudi dataset

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2287: - Priority: Major (was: Blocker) > Partition pruning not working on Hudi dataset >

[jira] [Updated] (HUDI-1430) Implement SparkDataFrameWriteClient with SimpleIndex

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1430: - Parent: HUDI-2531 Issue Type: Sub-task (was: Improvement) > Implement SparkDataFrameWriteClient

[jira] [Updated] (HUDI-1430) Implement SparkDataFrameWriteClient with SimpleIndex

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1430: - Summary: Implement SparkDataFrameWriteClient with SimpleIndex (was: Support Dataset write w/o conversion

[jira] [Updated] (HUDI-2621) Enhance DataFrameWriter with small file handling

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2621: - Summary: Enhance DataFrameWriter with small file handling (was: Optimize DataFrameWriter on small file

[jira] [Updated] (HUDI-2287) Partition pruning not working on Hudi dataset

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2287: - Priority: Blocker (was: Major) > Partition pruning not working on Hudi dataset >

[jira] [Updated] (HUDI-2615) Decouple HoodieRecordPayload with Hoodie table, table services, and index

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2615: - Status: In Progress (was: Open) > Decouple HoodieRecordPayload with Hoodie table, table services, and

[jira] [Updated] (HUDI-2287) Partition pruning not working on Hudi dataset

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2287: - Status: Patch Available (was: In Progress) > Partition pruning not working on Hudi dataset >

[jira] [Commented] (HUDI-1970) Performance testing/certification of key SQL DMLs

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17433587#comment-17433587 ] Raymond Xu commented on HUDI-1970: -- * 1B records (randomized values in the example trip model) * 100

[jira] [Updated] (HUDI-1970) Performance testing/certification of key SQL DMLs

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1970: - Status: In Progress (was: Open) > Performance testing/certification of key SQL DMLs >

[jira] [Updated] (HUDI-1430) Implement SparkDataFrameWriteClient with SimpleIndex

2021-10-25 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-1430: - Story Points: 3 (was: 2) > Implement SparkDataFrameWriteClient with SimpleIndex >

<    2   3   4   5   6   7   8   9   10   11   >