[jira] [Created] (HUDI-7674) Hudi CLI : Command "metadata validate-files" not using file listing to validate

2024-04-25 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-7674: Summary: Hudi CLI : Command "metadata validate-files" not using file listing to validate Key: HUDI-7674 URL: https://issues.apache.org/jira/browse/HUDI-7674

[jira] [Assigned] (HUDI-7008) Fixing usage of Kafka Avro deserializer w/ debezium sources

2023-10-30 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-7008: Assignee: sivabalan narayanan > Fixing usage of Kafka Avro deserializer w/

[jira] [Created] (HUDI-7008) Fixing usage of Kafka Avro deserializer w/ debezium sources

2023-10-30 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-7008: Summary: Fixing usage of Kafka Avro deserializer w/ debezium sources Key: HUDI-7008 URL: https://issues.apache.org/jira/browse/HUDI-7008 Project: Apache Hudi

[jira] [Created] (HUDI-5933) Fix NullPointer Exception in MultiTableDeltaStreamer when Transformer_class config is not set

2023-03-14 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-5933: Summary: Fix NullPointer Exception in MultiTableDeltaStreamer when Transformer_class config is not set Key: HUDI-5933 URL: https://issues.apache.org/jira/browse/HUDI-5933

[jira] [Commented] (HUDI-2761) IllegalArgException from timeline server when serving getLastestBaseFiles with multi-writer

2021-11-23 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448275#comment-17448275 ] Balaji Varadarajan commented on HUDI-2761: -- [~shivnarayan] :Not sure if I understood why you

[jira] [Created] (HUDI-2166) Support Alter table drop column

2021-07-12 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-2166: Summary: Support Alter table drop column Key: HUDI-2166 URL: https://issues.apache.org/jira/browse/HUDI-2166 Project: Apache Hudi Issue Type:

[jira] [Created] (HUDI-1741) Row Level TTL Support for records stored in Hudi

2021-03-30 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1741: Summary: Row Level TTL Support for records stored in Hudi Key: HUDI-1741 URL: https://issues.apache.org/jira/browse/HUDI-1741 Project: Apache Hudi

[jira] [Commented] (HUDI-1741) Row Level TTL Support for records stored in Hudi

2021-03-30 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311938#comment-17311938 ] Balaji Varadarajan commented on HUDI-1741: -- [~shivnarayan] : FYI > Row Level TTL Support for

[jira] [Commented] (HUDI-1724) run_sync_tool support for hive3.1.2 on hadoop3.1.4

2021-03-26 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17309272#comment-17309272 ] Balaji Varadarajan commented on HUDI-1724: -- [~shivnarayan] : Can you please triage this >

[jira] [Created] (HUDI-1724) run_sync_tool support for hive3.1.2 on hadoop3.1.4

2021-03-26 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1724: Summary: run_sync_tool support for hive3.1.2 on hadoop3.1.4 Key: HUDI-1724 URL: https://issues.apache.org/jira/browse/HUDI-1724 Project: Apache Hudi

[jira] [Commented] (HUDI-1711) Avro Schema Exception with Spark 3.0 in 0.7

2021-03-23 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17307095#comment-17307095 ] Balaji Varadarajan commented on HUDI-1711: -- [~shivnarayan]: Can you triage this issue when you

[jira] [Created] (HUDI-1711) Avro Schema Exception with Spark 3.0 in 0.7

2021-03-23 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1711: Summary: Avro Schema Exception with Spark 3.0 in 0.7 Key: HUDI-1711 URL: https://issues.apache.org/jira/browse/HUDI-1711 Project: Apache Hudi Issue

[jira] [Commented] (HUDI-1640) Implement Spark Datasource option to read hudi configs from properties file

2021-02-25 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17290922#comment-17290922 ] Balaji Varadarajan commented on HUDI-1640: -- [~shivnarayan]: Can you vet this and add to the work

[jira] [Created] (HUDI-1640) Implement Spark Datasource option to read hudi configs from properties file

2021-02-25 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1640: Summary: Implement Spark Datasource option to read hudi configs from properties file Key: HUDI-1640 URL: https://issues.apache.org/jira/browse/HUDI-1640

[jira] [Commented] (HUDI-1608) MOR fetches all records for read optimized query w/ spark sql

2021-02-10 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17282851#comment-17282851 ] Balaji Varadarajan commented on HUDI-1608: -- [~shivnarayan]: You need to set 

[jira] [Created] (HUDI-1523) Avoid excessive mkdir calls when creating new files

2021-01-11 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1523: Summary: Avoid excessive mkdir calls when creating new files Key: HUDI-1523 URL: https://issues.apache.org/jira/browse/HUDI-1523 Project: Apache Hudi

[jira] [Created] (HUDI-1505) Allow pluggable option to write error records to side table, queue

2021-01-04 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1505: Summary: Allow pluggable option to write error records to side table, queue Key: HUDI-1505 URL: https://issues.apache.org/jira/browse/HUDI-1505 Project:

[jira] [Created] (HUDI-1501) Explore providing ways to auto-tune input record size based on incoming payload

2020-12-31 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1501: Summary: Explore providing ways to auto-tune input record size based on incoming payload Key: HUDI-1501 URL: https://issues.apache.org/jira/browse/HUDI-1501

[jira] [Updated] (HUDI-1501) Explore providing ways to auto-tune input record size based on incoming payload

2020-12-31 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1501: - Status: Open (was: New) > Explore providing ways to auto-tune input record size based on

[jira] [Assigned] (HUDI-1499) Support configuration to let user override record-size estimate

2020-12-29 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-1499: Assignee: sivabalan narayanan > Support configuration to let user override

[jira] [Created] (HUDI-1499) Support configuration to let user override record-size estimate

2020-12-29 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1499: Summary: Support configuration to let user override record-size estimate Key: HUDI-1499 URL: https://issues.apache.org/jira/browse/HUDI-1499 Project:

[jira] [Updated] (HUDI-1499) Support configuration to let user override record-size estimate

2020-12-29 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1499: - Status: Open (was: New) > Support configuration to let user override record-size

[jira] [Updated] (HUDI-1497) Timeout Exception during getFileStatus()

2020-12-28 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1497: - Status: Open (was: New) > Timeout Exception during getFileStatus() >

[jira] [Created] (HUDI-1497) Timeout Exception during getFileStatus()

2020-12-28 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1497: Summary: Timeout Exception during getFileStatus() Key: HUDI-1497 URL: https://issues.apache.org/jira/browse/HUDI-1497 Project: Apache Hudi Issue

[jira] [Assigned] (HUDI-1496) Seek Error when querying MOR tables in GCP

2020-12-28 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-1496: Assignee: sivabalan narayanan > Seek Error when querying MOR tables in GCP >

[jira] [Updated] (HUDI-1496) Seek Error when querying MOR tables in GCP

2020-12-28 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1496: - Status: Open (was: New) > Seek Error when querying MOR tables in GCP >

[jira] [Created] (HUDI-1496) Seek Error when querying MOR tables in GCP

2020-12-28 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1496: Summary: Seek Error when querying MOR tables in GCP Key: HUDI-1496 URL: https://issues.apache.org/jira/browse/HUDI-1496 Project: Apache Hudi Issue

[jira] [Updated] (HUDI-1490) Incremental Query fails if there are partitions that have no incremental changes

2020-12-23 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1490: - Status: Open (was: New) > Incremental Query fails if there are partitions that have no

[jira] [Created] (HUDI-1490) Incremental Query fails if there are partitions that have no incremental changes

2020-12-23 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1490: Summary: Incremental Query fails if there are partitions that have no incremental changes Key: HUDI-1490 URL: https://issues.apache.org/jira/browse/HUDI-1490

[jira] [Commented] (HUDI-1475) Fix documentation of preCombine to clarify when this API is used by Hudi

2020-12-18 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17252023#comment-17252023 ] Balaji Varadarajan commented on HUDI-1475: -- Relevant Issue: 

[jira] [Created] (HUDI-1475) Fix documentation of preCombine to clarify when this API is used by Hudi

2020-12-18 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1475: Summary: Fix documentation of preCombine to clarify when this API is used by Hudi Key: HUDI-1475 URL: https://issues.apache.org/jira/browse/HUDI-1475

[jira] [Updated] (HUDI-1475) Fix documentation of preCombine to clarify when this API is used by Hudi

2020-12-18 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1475: - Status: Open (was: New) > Fix documentation of preCombine to clarify when this API is

[jira] [Updated] (HUDI-1452) RocksDB FileSystemView throwing NotSerializableError when embedded timeline server is turned off

2020-12-10 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1452: - Description: [https://github.com/apache/hudi/issues/2321]   We need to make

[jira] [Updated] (HUDI-1452) RocksDB FileSystemView throwing NotSerializableError when embedded timeline server is turned off

2020-12-10 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1452: - Description: [https://github.com/apache/hudi/issues/2321]   We need to make 

[jira] [Updated] (HUDI-1452) RocksDB FileSystemView throwing NotSerializableError when embedded timeline server is turned off

2020-12-10 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1452: - Status: Open (was: New) > RocksDB FileSystemView throwing NotSerializableError when

[jira] [Created] (HUDI-1452) RocksDB FileSystemView throwing NotSerializableError when embedded timeline server is turned off

2020-12-10 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1452: Summary: RocksDB FileSystemView throwing NotSerializableError when embedded timeline server is turned off Key: HUDI-1452 URL:

[jira] [Assigned] (HUDI-1452) RocksDB FileSystemView throwing NotSerializableError when embedded timeline server is turned off

2020-12-10 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-1452: Assignee: Sreeram Ramji > RocksDB FileSystemView throwing NotSerializableError

[jira] [Updated] (HUDI-1440) Allow option to override schema when doing spark.write

2020-12-08 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1440: - Status: Open (was: New) > Allow option to override schema when doing spark.write >

[jira] [Created] (HUDI-1440) Allow option to override schema when doing spark.write

2020-12-08 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1440: Summary: Allow option to override schema when doing spark.write Key: HUDI-1440 URL: https://issues.apache.org/jira/browse/HUDI-1440 Project: Apache Hudi

[jira] [Created] (HUDI-1436) Provide Option to run auto clean every nth commit.

2020-12-07 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1436: Summary: Provide Option to run auto clean every nth commit. Key: HUDI-1436 URL: https://issues.apache.org/jira/browse/HUDI-1436 Project: Apache Hudi

[jira] [Updated] (HUDI-1436) Provide Option to run auto clean every nth commit.

2020-12-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1436: - Status: Open (was: New) > Provide Option to run auto clean every nth commit. >

[jira] [Assigned] (HUDI-1436) Provide Option to run auto clean every nth commit.

2020-12-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-1436: Assignee: Sreeram Ramji > Provide Option to run auto clean every nth commit. >

[jira] [Updated] (HUDI-1435) Marker File Reconciliation failing for Non-Partitioned datasets when duplicate marker files present

2020-12-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1435: - Status: Patch Available (was: In Progress) > Marker File Reconciliation failing for

[jira] [Updated] (HUDI-1435) Marker File Reconciliation failing for Non-Partitioned datasets when duplicate marker files present

2020-12-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1435: - Status: Open (was: New) > Marker File Reconciliation failing for Non-Partitioned

[jira] [Updated] (HUDI-1435) Marker File Reconciliation failing for Non-Partitioned datasets when duplicate marker files present

2020-12-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1435: - Status: In Progress (was: Open) > Marker File Reconciliation failing for Non-Partitioned

[jira] [Updated] (HUDI-1435) Marker File Reconciliation failing for Non-Partitioned datasets when duplicate marker files present

2020-12-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1435: - Status: In Progress (was: Open) > Marker File Reconciliation failing for Non-Partitioned

[jira] [Updated] (HUDI-1435) Marker File Reconciliation failing for Non-Partitioned datasets when duplicate marker files present

2020-12-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1435: - Summary: Marker File Reconciliation failing for Non-Partitioned datasets when duplicate

[jira] [Assigned] (HUDI-1435) Marker File Reconciliation failing for Non-Partitioned Paths when duplicate marker files present

2020-12-07 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-1435: Assignee: Balaji Varadarajan > Marker File Reconciliation failing for

[jira] [Created] (HUDI-1435) Marker File Reconciliation failing for Non-Partitioned Paths when duplicate marker files present

2020-12-07 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1435: Summary: Marker File Reconciliation failing for Non-Partitioned Paths when duplicate marker files present Key: HUDI-1435 URL:

[jira] [Commented] (HUDI-1329) Support async compaction in spark DF write()

2020-12-03 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17243675#comment-17243675 ] Balaji Varadarajan commented on HUDI-1329: -- [~309637554]: This API allows only running

[jira] [Updated] (HUDI-1413) Need binary release of Hudi to distribute tools like hudi-cli.sh and hudi-sync

2020-11-23 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1413: - Fix Version/s: 0.7.0 > Need binary release of Hudi to distribute tools like hudi-cli.sh

[jira] [Created] (HUDI-1413) Need binary release of Hudi to distribute tools like hudi-cli.sh and hudi-sync

2020-11-23 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1413: Summary: Need binary release of Hudi to distribute tools like hudi-cli.sh and hudi-sync Key: HUDI-1413 URL: https://issues.apache.org/jira/browse/HUDI-1413

[jira] [Updated] (HUDI-1413) Need binary release of Hudi to distribute tools like hudi-cli.sh and hudi-sync

2020-11-23 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1413: - Status: Open (was: New) > Need binary release of Hudi to distribute tools like

[jira] [Created] (HUDI-1395) HoodieSnapshotCopier not working on non-partitioned datasets

2020-11-12 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1395: Summary: HoodieSnapshotCopier not working on non-partitioned datasets Key: HUDI-1395 URL: https://issues.apache.org/jira/browse/HUDI-1395 Project: Apache

[jira] [Updated] (HUDI-1395) HoodieSnapshotCopier not working on non-partitioned datasets

2020-11-12 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1395: - Status: Open (was: New) > HoodieSnapshotCopier not working on non-partitioned datasets >

[jira] [Commented] (HUDI-1205) Serialization fail when log file is larger than 2GB

2020-11-11 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17230386#comment-17230386 ] Balaji Varadarajan commented on HUDI-1205: -- [~leehuynh] [~zuyanton] [~garyli1019] Please see the

[jira] [Commented] (HUDI-1205) Serialization fail when log file is larger than 2GB

2020-11-11 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17230385#comment-17230385 ] Balaji Varadarajan commented on HUDI-1205: -- This is likely fixed as part of

[jira] [Updated] (HUDI-1383) Incorrect partitions getting hive synced

2020-11-09 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1383: - Status: Open (was: New) > Incorrect partitions getting hive synced >

[jira] [Created] (HUDI-1383) Incorrect partitions getting hive synced

2020-11-09 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1383: Summary: Incorrect partitions getting hive synced Key: HUDI-1383 URL: https://issues.apache.org/jira/browse/HUDI-1383 Project: Apache Hudi Issue

[jira] [Created] (HUDI-1381) Schedule compaction based on time elapsed

2020-11-09 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1381: Summary: Schedule compaction based on time elapsed Key: HUDI-1381 URL: https://issues.apache.org/jira/browse/HUDI-1381 Project: Apache Hudi Issue

[jira] [Updated] (HUDI-1381) Schedule compaction based on time elapsed

2020-11-09 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1381: - Status: Open (was: New) > Schedule compaction based on time elapsed >

[jira] [Updated] (HUDI-1309) Listing Metadata unreadable in S3 as the log block is deemed corrupted

2020-11-05 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1309: - Status: Open (was: New) > Listing Metadata unreadable in S3 as the log block is deemed

[jira] [Commented] (HUDI-1365) Listing leaf files and directories is very Slow

2020-11-03 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17225477#comment-17225477 ] Balaji Varadarajan commented on HUDI-1365: --

[jira] [Commented] (HUDI-1365) Listing leaf files and directories is very Slow

2020-11-02 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17224750#comment-17224750 ] Balaji Varadarajan commented on HUDI-1365: -- [~Selvaraj.periyasamy1983]: 0.5.0 is a very old

[jira] [Created] (HUDI-1368) Merge On Read Snapshot Reader not working for Databricks on ADLS Gen2

2020-11-02 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1368: Summary: Merge On Read Snapshot Reader not working for Databricks on ADLS Gen2 Key: HUDI-1368 URL: https://issues.apache.org/jira/browse/HUDI-1368 Project:

[jira] [Updated] (HUDI-1368) Merge On Read Snapshot Reader not working for Databricks on ADLS Gen2

2020-11-02 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1368: - Status: Open (was: New) > Merge On Read Snapshot Reader not working for Databricks on

[jira] [Updated] (HUDI-1363) Provide Option to drop columns after they are used to generate partition or record keys

2020-10-30 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1363: - Status: Open (was: New) > Provide Option to drop columns after they are used to generate

[jira] [Created] (HUDI-1363) Provide Option to drop columns after they are used to generate partition or record keys

2020-10-30 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1363: Summary: Provide Option to drop columns after they are used to generate partition or record keys Key: HUDI-1363 URL: https://issues.apache.org/jira/browse/HUDI-1363

[jira] [Assigned] (HUDI-1358) Memory Leak in HoodieLogFormatWriter

2020-10-29 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-1358: Assignee: Balaji Varadarajan > Memory Leak in HoodieLogFormatWriter >

[jira] [Created] (HUDI-1358) Memory Leak in HoodieLogFormatWriter

2020-10-29 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1358: Summary: Memory Leak in HoodieLogFormatWriter Key: HUDI-1358 URL: https://issues.apache.org/jira/browse/HUDI-1358 Project: Apache Hudi Issue Type:

[jira] [Updated] (HUDI-1358) Memory Leak in HoodieLogFormatWriter

2020-10-29 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1358: - Status: Open (was: New) > Memory Leak in HoodieLogFormatWriter >

[jira] [Commented] (HUDI-1350) Support Partition level delete API in HUDI on top on Insert Overwrite

2020-10-23 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17219950#comment-17219950 ] Balaji Varadarajan commented on HUDI-1350: -- Yes, [~309637554]: You can change the API to take in

[jira] [Commented] (HUDI-1340) Not able to query real time table when rows contains nested elements

2020-10-22 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17219370#comment-17219370 ] Balaji Varadarajan commented on HUDI-1340: -- This is likely related to parquet (serde and related

[jira] [Commented] (HUDI-1340) Not able to query real time table when rows contains nested elements

2020-10-19 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216913#comment-17216913 ] Balaji Varadarajan commented on HUDI-1340: -- [~bdighe]: Did you use --conf

[jira] [Updated] (HUDI-1340) Not able to query real time table when rows contains nested elements

2020-10-19 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1340: - Status: Open (was: New) > Not able to query real time table when rows contains nested

[jira] [Commented] (HUDI-845) Allow parallel writing and move the pending rollback work into cleaner

2020-10-16 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17215226#comment-17215226 ] Balaji Varadarajan commented on HUDI-845: - Yes [~309637554]. this ticket is for tracking general

[jira] [Updated] (HUDI-1343) Add standard schema postprocessor which would rewrite the schema using spark-avro conversion

2020-10-13 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1343: - Fix Version/s: 0.7.0 > Add standard schema postprocessor which would rewrite the schema

[jira] [Created] (HUDI-1343) Add standard schema postprocessor which would rewrite the schema using spark-avro conversion

2020-10-13 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1343: Summary: Add standard schema postprocessor which would rewrite the schema using spark-avro conversion Key: HUDI-1343 URL: https://issues.apache.org/jira/browse/HUDI-1343

[jira] [Updated] (HUDI-1343) Add standard schema postprocessor which would rewrite the schema using spark-avro conversion

2020-10-13 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1343: - Status: Open (was: New) > Add standard schema postprocessor which would rewrite the

[jira] [Updated] (HUDI-1329) Support async compaction in spark DF write()

2020-10-09 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1329: - Status: Open (was: New) > Support async compaction in spark DF write() >

[jira] [Created] (HUDI-1329) Support async compaction in spark DF write()

2020-10-09 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1329: Summary: Support async compaction in spark DF write() Key: HUDI-1329 URL: https://issues.apache.org/jira/browse/HUDI-1329 Project: Apache Hudi Issue

[jira] [Updated] (HUDI-898) Need to add Schema parameter to HoodieRecordPayload::preCombine

2020-10-02 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-898: Status: Open (was: New) > Need to add Schema parameter to HoodieRecordPayload::preCombine >

[jira] [Assigned] (HUDI-898) Need to add Schema parameter to HoodieRecordPayload::preCombine

2020-10-02 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-898: --- Assignee: Balaji Varadarajan > Need to add Schema parameter to

[jira] [Commented] (HUDI-1308) Issues found during testing RFC-15

2020-10-01 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17205435#comment-17205435 ] Balaji Varadarajan commented on HUDI-1308: -- cc [~vinoth] > Issues found during testing RFC-15 >

[jira] [Created] (HUDI-1311) Writes creating/updating large number of files seeing errors when deleting marker files in S3

2020-10-01 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1311: Summary: Writes creating/updating large number of files seeing errors when deleting marker files in S3 Key: HUDI-1311 URL: https://issues.apache.org/jira/browse/HUDI-1311

[jira] [Created] (HUDI-1310) Corruption Block Handling too slow in S3

2020-10-01 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1310: Summary: Corruption Block Handling too slow in S3 Key: HUDI-1310 URL: https://issues.apache.org/jira/browse/HUDI-1310 Project: Apache Hudi Issue

[jira] [Assigned] (HUDI-1308) Issues found during testing RFC-15

2020-10-01 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-1308: Assignee: Balaji Varadarajan (was: Prashant Wason) > Issues found during testing

[jira] [Assigned] (HUDI-1308) Issues found during testing RFC-15

2020-10-01 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-1308: Assignee: Prashant Wason > Issues found during testing RFC-15 >

[jira] [Assigned] (HUDI-1309) Listing Metadata unreadable in S3 as the log block is deemed corrupted

2020-10-01 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-1309: Assignee: Prashant Wason > Listing Metadata unreadable in S3 as the log block is

[jira] [Created] (HUDI-1309) Listing Metadata unreadable in S3 as the log block is deemed corrupted

2020-10-01 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1309: Summary: Listing Metadata unreadable in S3 as the log block is deemed corrupted Key: HUDI-1309 URL: https://issues.apache.org/jira/browse/HUDI-1309 Project:

[jira] [Created] (HUDI-1308) Issues found during testing RFC-15

2020-10-01 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1308: Summary: Issues found during testing RFC-15 Key: HUDI-1308 URL: https://issues.apache.org/jira/browse/HUDI-1308 Project: Apache Hudi Issue Type:

[jira] [Updated] (HUDI-1308) Issues found during testing RFC-15

2020-10-01 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1308: - Status: Open (was: New) > Issues found during testing RFC-15 >

[jira] [Commented] (HUDI-1257) Insert only write operations should preserve duplicate records

2020-09-24 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17201628#comment-17201628 ] Balaji Varadarajan commented on HUDI-1257: -- [~nicholasjiang]: Yes, they are same. You can dupe

[jira] [Updated] (HUDI-1290) Implement Debezium avro source for Delta Streamer

2020-09-21 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1290: - Status: Open (was: New) > Implement Debezium avro source for Delta Streamer >

[jira] [Assigned] (HUDI-1290) Implement Debezium avro source for Delta Streamer

2020-09-21 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-1290: Assignee: Balaji Varadarajan > Implement Debezium avro source for Delta Streamer >

[jira] [Created] (HUDI-1290) Implement Debezium avro source for Delta Streamer

2020-09-21 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1290: Summary: Implement Debezium avro source for Delta Streamer Key: HUDI-1290 URL: https://issues.apache.org/jira/browse/HUDI-1290 Project: Apache Hudi

[jira] [Updated] (HUDI-1270) NoSuchMethod PartitionedFile on AWS EMR Spark 2.4.5

2020-09-13 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1270: - Status: Open (was: New) > NoSuchMethod PartitionedFile on AWS EMR Spark 2.4.5 >

[jira] [Commented] (HUDI-1270) NoSuchMethod PartitionedFile on AWS EMR Spark 2.4.5

2020-09-13 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17195158#comment-17195158 ] Balaji Varadarajan commented on HUDI-1270: -- [~uditme] : Pinging  > NoSuchMethod PartitionedFile

[jira] [Created] (HUDI-1280) Add tool to capture earliest or latest offsets in kafka topics

2020-09-13 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1280: Summary: Add tool to capture earliest or latest offsets in kafka topics Key: HUDI-1280 URL: https://issues.apache.org/jira/browse/HUDI-1280 Project: Apache

[jira] [Updated] (HUDI-1280) Add tool to capture earliest or latest offsets in kafka topics

2020-09-13 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1280: - Status: Open (was: New) > Add tool to capture earliest or latest offsets in kafka topics

  1   2   3   4   5   6   7   8   9   10   >