[jira] [Assigned] (HUDI-6522) RFC for Hudi Reverse Streamer

2023-07-11 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-6522: -- Assignee: Pratyaksh Sharma > RFC for Hudi Reverse Streamer >

[jira] [Updated] (HUDI-6522) RFC for Hudi Reverse Streamer

2023-07-11 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-6522: --- Status: In Progress (was: Open) > RFC for Hudi Reverse Streamer >

[jira] [Created] (HUDI-6522) RFC for Hudi Reverse Streamer

2023-07-11 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-6522: -- Summary: RFC for Hudi Reverse Streamer Key: HUDI-6522 URL: https://issues.apache.org/jira/browse/HUDI-6522 Project: Apache Hudi Issue Type: New Feature

[jira] [Created] (HUDI-6425) Hudi Reverse Streamer

2023-06-22 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-6425: -- Summary: Hudi Reverse Streamer Key: HUDI-6425 URL: https://issues.apache.org/jira/browse/HUDI-6425 Project: Apache Hudi Issue Type: Epic

[jira] [Created] (HUDI-6421) claim RFC for hudi reverse streamer

2023-06-22 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-6421: -- Summary: claim RFC for hudi reverse streamer Key: HUDI-6421 URL: https://issues.apache.org/jira/browse/HUDI-6421 Project: Apache Hudi Issue Type: Task

[jira] [Created] (HUDI-5946) Add glue sync configs to website

2023-03-16 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-5946: -- Summary: Add glue sync configs to website Key: HUDI-5946 URL: https://issues.apache.org/jira/browse/HUDI-5946 Project: Apache Hudi Issue Type:

[jira] [Updated] (HUDI-5903) Make number of max concurrent glue connections configurable

2023-03-16 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-5903: --- Status: In Progress (was: Open) > Make number of max concurrent glue connections

[jira] [Created] (HUDI-5903) Make number of max concurrent glue connections configurable

2023-03-07 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-5903: -- Summary: Make number of max concurrent glue connections configurable Key: HUDI-5903 URL: https://issues.apache.org/jira/browse/HUDI-5903 Project: Apache Hudi

[jira] [Created] (HUDI-5902) Parallelise glue sync calls

2023-03-07 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-5902: -- Summary: Parallelise glue sync calls Key: HUDI-5902 URL: https://issues.apache.org/jira/browse/HUDI-5902 Project: Apache Hudi Issue Type: Improvement

[jira] [Closed] (HUDI-5687) missing records when Delta streamer is run in continuous mode

2023-02-02 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma closed HUDI-5687. -- Resolution: Duplicate > missing records when Delta streamer is run in continuous mode >

[jira] [Created] (HUDI-5687) missing records when Delta streamer is run in continuous mode

2023-02-02 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-5687: -- Summary: missing records when Delta streamer is run in continuous mode Key: HUDI-5687 URL: https://issues.apache.org/jira/browse/HUDI-5687 Project: Apache Hudi

[jira] [Resolved] (HUDI-5527) Can't set keygen class in bootstrap

2023-01-11 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma resolved HUDI-5527. > Can't set keygen class in bootstrap > --- > >

[jira] [Commented] (HUDI-5527) Can't set keygen class in bootstrap

2023-01-11 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17672316#comment-17672316 ] Pratyaksh Sharma commented on HUDI-5527:

[jira] [Created] (HUDI-5497) Update KafkaOffsetGen configs on all configurations page

2023-01-04 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-5497: -- Summary: Update KafkaOffsetGen configs on all configurations page Key: HUDI-5497 URL: https://issues.apache.org/jira/browse/HUDI-5497 Project: Apache Hudi

[jira] [Updated] (HUDI-5015) Cleaner does not work properly when metadata table is enabled

2022-10-11 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-5015: --- Fix Version/s: 0.13.0 > Cleaner does not work properly when metadata table is enabled >

[jira] [Created] (HUDI-5015) Cleaner does not work properly when metadata table is enabled

2022-10-11 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-5015: -- Summary: Cleaner does not work properly when metadata table is enabled Key: HUDI-5015 URL: https://issues.apache.org/jira/browse/HUDI-5015 Project: Apache Hudi

[jira] [Commented] (HUDI-4974) Avoid creating Configuration copies in Hudi

2022-10-03 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17612377#comment-17612377 ] Pratyaksh Sharma commented on HUDI-4974: [https://github.com/prestodb/presto/pull/18441] fixes

[jira] [Updated] (HUDI-3676) Enhance tests for triggering clean every Nth commit

2022-08-17 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-3676: --- Status: In Progress (was: Open) > Enhance tests for triggering clean every Nth commit >

[jira] [Updated] (HUDI-4634) update schema provider configuration in MTDS blog

2022-08-17 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-4634: --- Status: In Progress (was: Open) > update schema provider configuration in MTDS blog >

[jira] [Created] (HUDI-4634) update schema provider configuration in MTDS blog

2022-08-17 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-4634: -- Summary: update schema provider configuration in MTDS blog Key: HUDI-4634 URL: https://issues.apache.org/jira/browse/HUDI-4634 Project: Apache Hudi

[jira] [Updated] (HUDI-4630) Allow different transformers for different tables getting ingested with HoodieMultiTableDeltaStreamer

2022-08-16 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-4630: --- Component/s: deltastreamer > Allow different transformers for different tables getting

[jira] [Updated] (HUDI-4630) Allow different transformers for different tables getting ingested with HoodieMultiTableDeltaStreamer

2022-08-16 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-4630: --- Labels: delta (was: ) > Allow different transformers for different tables getting ingested

[jira] [Updated] (HUDI-4630) Allow different transformers for different tables getting ingested with HoodieMultiTableDeltaStreamer

2022-08-16 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-4630: --- Labels: newbie (was: delta) > Allow different transformers for different tables getting

[jira] [Created] (HUDI-4630) Allow different transformers for different tables getting ingested with HoodieMultiTableDeltaStreamer

2022-08-16 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-4630: -- Summary: Allow different transformers for different tables getting ingested with HoodieMultiTableDeltaStreamer Key: HUDI-4630 URL:

[jira] [Created] (HUDI-4581) Claim RFC-58

2022-08-09 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-4581: -- Summary: Claim RFC-58 Key: HUDI-4581 URL: https://issues.apache.org/jira/browse/HUDI-4581 Project: Apache Hudi Issue Type: Task Reporter:

[jira] [Updated] (HUDI-4552) RFC-58: Integrate column stats index with query engines other than spark

2022-08-05 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-4552: --- Status: In Progress (was: Open) > RFC-58: Integrate column stats index with query engines

[jira] [Updated] (HUDI-4552) RFC-58: Integrate column stats index with query engines other than spark

2022-08-05 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-4552: --- Labels: hudi-umbrellas (was: ) > RFC-58: Integrate column stats index with query engines

[jira] [Created] (HUDI-4552) RFC-58: Integrate column stats index with query engines other than spark

2022-08-05 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-4552: -- Summary: RFC-58: Integrate column stats index with query engines other than spark Key: HUDI-4552 URL: https://issues.apache.org/jira/browse/HUDI-4552 Project:

[jira] [Assigned] (HUDI-4394) Metadata Indexes integration with Presto/Trino

2022-07-14 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-4394: -- Assignee: Pratyaksh Sharma > Metadata Indexes integration with Presto/Trino >

[jira] [Updated] (HUDI-4364) integrate column stats index with presto engine

2022-07-05 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-4364: --- Status: In Progress (was: Open) > integrate column stats index with presto engine >

[jira] [Created] (HUDI-4364) integrate column stats index with presto engine

2022-07-05 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-4364: -- Summary: integrate column stats index with presto engine Key: HUDI-4364 URL: https://issues.apache.org/jira/browse/HUDI-4364 Project: Apache Hudi Issue

[jira] [Created] (HUDI-4131) investigate the difference between hive_sync.table and hoodie.table.name for flink engine.

2022-05-20 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-4131: -- Summary: investigate the difference between hive_sync.table and hoodie.table.name for flink engine. Key: HUDI-4131 URL: https://issues.apache.org/jira/browse/HUDI-4131

[jira] [Commented] (HUDI-3690) use all the coming records to update the existing

2022-05-09 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17533741#comment-17533741 ] Pratyaksh Sharma commented on HUDI-3690: Guess this is related to

[jira] [Commented] (HUDI-541) Replace variables/comments named "data files" to "base file"

2022-04-28 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529460#comment-17529460 ] Pratyaksh Sharma commented on HUDI-541: --- Sure. The PR has been long pending for review. Let me ping

[jira] [Commented] (HUDI-1588) Support multiple ordering fields via payload class config

2022-03-24 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17511618#comment-17511618 ] Pratyaksh Sharma commented on HUDI-1588: [~xushiyan] Let us push it to 0.12 > Support multiple

[jira] [Commented] (HUDI-3676) Enhance tests for triggering clean every Nth commit

2022-03-24 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17511617#comment-17511617 ] Pratyaksh Sharma commented on HUDI-3676: [~xushiyan] There is no PR right now, I will raise one

[jira] [Created] (HUDI-3671) Fix logic for deltastreamer consuming message based on kafka timestamp

2022-03-20 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-3671: -- Summary: Fix logic for deltastreamer consuming message based on kafka timestamp Key: HUDI-3671 URL: https://issues.apache.org/jira/browse/HUDI-3671 Project:

[jira] [Updated] (HUDI-1549) Programmatic way to fetch earliest commit retained

2022-03-19 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-1549: --- Status: In Progress (was: Open) > Programmatic way to fetch earliest commit retained >

[jira] [Assigned] (HUDI-1436) Provide Option to run auto clean every nth commit.

2022-03-19 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-1436: -- Assignee: Pratyaksh Sharma (was: sivabalan narayanan) > Provide Option to run auto

[jira] [Assigned] (HUDI-2719) Add back support for copying over extra metadata from previous commit

2022-03-07 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-2719: -- Assignee: Pratyaksh Sharma (was: Nishith Agarwal) > Add back support for copying over

[jira] [Assigned] (HUDI-1619) Add tests to Multitable delta streamer for more sources and mix of diff sources

2022-03-07 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-1619: -- Assignee: Pratyaksh Sharma > Add tests to Multitable delta streamer for more sources

[jira] [Assigned] (HUDI-1564) Blog: Dfs -> Hudi followed by Kafka to Hudi

2022-03-07 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-1564: -- Assignee: Pratyaksh Sharma > Blog: Dfs -> Hudi followed by Kafka to Hudi >

[jira] [Assigned] (HUDI-1562) Delta streamer checkpointing documentation

2022-03-07 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-1562: -- Assignee: Pratyaksh Sharma (was: Nishith Agarwal) > Delta streamer checkpointing

[jira] [Assigned] (HUDI-2318) Enhance and stablize multi-table deltastreamer

2022-03-07 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-2318: -- Assignee: Pratyaksh Sharma > Enhance and stablize multi-table deltastreamer >

[jira] [Assigned] (HUDI-1881) HoodieMultiTableDeltaStreamer does not ingest from all topics when using continuous mode

2022-03-07 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-1881: -- Assignee: Pratyaksh Sharma (was: Nishith Agarwal) > HoodieMultiTableDeltaStreamer

[jira] [Assigned] (HUDI-1549) Programmatic way to fetch earliest commit retained

2022-03-07 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-1549: -- Assignee: Pratyaksh Sharma > Programmatic way to fetch earliest commit retained >

[jira] [Updated] (HUDI-1258) Small file handling Merges can be handled without actual merging

2022-02-09 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-1258: --- Status: In Progress (was: Open) > Small file handling Merges can be handled without actual

[jira] [Assigned] (HUDI-1258) Small file handling Merges can be handled without actual merging

2022-02-09 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-1258: -- Assignee: Pratyaksh Sharma (was: Raymond Xu) > Small file handling Merges can be

[jira] [Updated] (HUDI-3264) Make schema registry configs more flexible with MultiTableDeltaStreamer

2022-02-09 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-3264: --- Status: Patch Available (was: In Progress) > Make schema registry configs more flexible with

[jira] [Updated] (HUDI-349) Make cleaner retention based on time period to account for higher deviations in ingestion runs

2022-02-09 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-349: -- Fix Version/s: 0.11.0 > Make cleaner retention based on time period to account for higher

[jira] [Commented] (HUDI-3264) Make schema registry configs more flexible with MultiTableDeltaStreamer

2022-02-09 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17489796#comment-17489796 ] Pratyaksh Sharma commented on HUDI-3264: The best and the easiest possible fix is to allow users

[jira] [Updated] (HUDI-3264) Make schema registry configs more flexible with MultiTableDeltaStreamer

2022-02-08 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-3264: --- Status: In Progress (was: Open) > Make schema registry configs more flexible with

[jira] [Assigned] (HUDI-3264) Make schema registry configs more flexible with MultiTableDeltaStreamer

2022-02-08 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-3264: -- Assignee: Pratyaksh Sharma > Make schema registry configs more flexible with

[jira] [Closed] (HUDI-382) Move TimestampBasedKeyGenerator to hudi-spark module from hudi-utilities

2022-02-08 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma closed HUDI-382. - Resolution: Fixed This key generator is a part of hudi-spark-client now. > Move

[jira] [Created] (HUDI-3390) Update cleaner blog with KEEP_LATEST_BY_HOURS policy

2022-02-08 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-3390: -- Summary: Update cleaner blog with KEEP_LATEST_BY_HOURS policy Key: HUDI-3390 URL: https://issues.apache.org/jira/browse/HUDI-3390 Project: Apache Hudi

[jira] [Updated] (HUDI-1588) Support multiple ordering fields via payload class config

2021-12-19 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-1588: --- Status: In Progress (was: Open) > Support multiple ordering fields via payload class config

[jira] [Assigned] (HUDI-1588) Support multiple ordering fields via payload class config

2021-12-19 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-1588: -- Assignee: Pratyaksh Sharma > Support multiple ordering fields via payload class config

[jira] [Assigned] (HUDI-1436) Provide Option to run auto clean every nth commit.

2021-10-26 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-1436: -- Assignee: Pratyaksh Sharma (was: Sreeram Ramji) > Provide Option to run auto clean

[jira] [Updated] (HUDI-1436) Provide Option to run auto clean every nth commit.

2021-10-26 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-1436: --- Status: In Progress (was: Open) > Provide Option to run auto clean every nth commit. >

[jira] [Closed] (HUDI-2496) Inserts are precombined even with dedup disabled

2021-10-26 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma closed HUDI-2496. -- > Inserts are precombined even with dedup disabled >

[jira] [Resolved] (HUDI-2496) Inserts are precombined even with dedup disabled

2021-10-26 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma resolved HUDI-2496. Resolution: Fixed > Inserts are precombined even with dedup disabled >

[jira] [Updated] (HUDI-1475) Fix documentation of preCombine to clarify when this API is used by Hudi

2021-10-26 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-1475: --- Status: In Progress (was: Open) > Fix documentation of preCombine to clarify when this API

[jira] [Updated] (HUDI-1475) Fix documentation of preCombine to clarify when this API is used by Hudi

2021-10-26 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-1475: --- Status: Patch Available (was: In Progress) > Fix documentation of preCombine to clarify when

[jira] [Assigned] (HUDI-1475) Fix documentation of preCombine to clarify when this API is used by Hudi

2021-10-26 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reassigned HUDI-1475: -- Assignee: Pratyaksh Sharma > Fix documentation of preCombine to clarify when this API

[jira] [Commented] (HUDI-1549) Programmatic way to fetch earliest commit retained

2021-10-26 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17434178#comment-17434178 ] Pratyaksh Sharma commented on HUDI-1549: so I guess the requirement is as simple as exposing a

[jira] [Created] (HUDI-2543) Introduce guides section on website

2021-10-11 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-2543: -- Summary: Introduce guides section on website Key: HUDI-2543 URL: https://issues.apache.org/jira/browse/HUDI-2543 Project: Apache Hudi Issue Type: Task

[jira] [Updated] (HUDI-2543) Introduce guides section on website

2021-10-11 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-2543: --- Status: In Progress (was: Open) > Introduce guides section on website >

[jira] [Comment Edited] (HUDI-1362) Make deltastreamer support insert_overwrite

2021-09-13 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17414072#comment-17414072 ] Pratyaksh Sharma edited comment on HUDI-1362 at 9/13/21, 9:22 AM: -- No I

[jira] [Commented] (HUDI-1362) Make deltastreamer support insert_overwrite

2021-09-13 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17414072#comment-17414072 ] Pratyaksh Sharma commented on HUDI-1362: No I meant to say can you please elaborate the use case

[jira] [Commented] (HUDI-1362) Make deltastreamer support full overwrite

2021-09-13 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17414065#comment-17414065 ] Pratyaksh Sharma commented on HUDI-1362: [~liujinhui] insert_overwrite operation type is already

[jira] [Commented] (HUDI-2318) Enhance and stablize multi-table deltastreamer

2021-09-13 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17414060#comment-17414060 ] Pratyaksh Sharma commented on HUDI-2318: [~shivnarayan] Thank you for filing this issue. We can

[jira] [Commented] (HUDI-2312) Add support delete/delete_partition with deltastreamer

2021-09-13 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17414033#comment-17414033 ] Pratyaksh Sharma commented on HUDI-2312: I think this support is already available. Can we close

[jira] [Commented] (HUDI-1257) Insert only write operations should preserve duplicate records

2021-09-12 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17413787#comment-17413787 ] Pratyaksh Sharma commented on HUDI-1257: Guess we can close this given HUDI-1234 has already

[jira] [Closed] (HUDI-1257) Insert only write operations should preserve duplicate records

2021-09-12 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma closed HUDI-1257. -- Resolution: Duplicate HUDI-1234 is already merged in master.  > Insert only write operations

[jira] [Created] (HUDI-2419) Allow users to give timestamps for kafka offsets in custom formats

2021-09-12 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-2419: -- Summary: Allow users to give timestamps for kafka offsets in custom formats Key: HUDI-2419 URL: https://issues.apache.org/jira/browse/HUDI-2419 Project: Apache

[jira] [Updated] (HUDI-2416) Move FAQs to website

2021-09-11 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-2416: --- Status: Patch Available (was: In Progress) > Move FAQs to website > > >

[jira] [Updated] (HUDI-2416) Move FAQs to website

2021-09-11 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-2416: --- Status: In Progress (was: Open) > Move FAQs to website > > >

[jira] [Created] (HUDI-2416) Move FAQs to website

2021-09-11 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-2416: -- Summary: Move FAQs to website Key: HUDI-2416 URL: https://issues.apache.org/jira/browse/HUDI-2416 Project: Apache Hudi Issue Type: Improvement

[jira] [Updated] (HUDI-2397) Introduce --enable-sync parameter in HoodieMultiTableDeltaStreamer

2021-09-04 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-2397: --- Description: HoodieDeltaStreamer has introduced a new parameter enableMetaSync and

[jira] [Updated] (HUDI-2397) Introduce --enable-sync parameter in HoodieMultiTableDeltaStreamer

2021-09-04 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-2397: --- Description: HoodieDeltaStreamer has introduced a new parameter `--enable-sync` and

[jira] [Updated] (HUDI-2397) Introduce --enable-sync parameter in HoodieMultiTableDeltaStreamer

2021-09-04 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-2397: --- Description: HoodieDeltaStreamer has introduced a new parameter `---enable-sync` and

[jira] [Created] (HUDI-2397) Introduce --enable-sync parameter in HoodieMultiTableDeltaStreamer

2021-09-04 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-2397: -- Summary: Introduce --enable-sync parameter in HoodieMultiTableDeltaStreamer Key: HUDI-2397 URL: https://issues.apache.org/jira/browse/HUDI-2397 Project: Apache

[jira] [Resolved] (HUDI-2366) fix hudi generating too many logs

2021-09-04 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma resolved HUDI-2366. Resolution: Fixed > fix hudi generating too many logs > - >

[jira] [Commented] (HUDI-2370) Supports data encryption

2021-09-04 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17409997#comment-17409997 ] Pratyaksh Sharma commented on HUDI-2370: This is a very useful feature to have. > Supports data

[jira] [Created] (HUDI-1981) Introduce --enable-sync option in HoodieMultiTableDeltaStreamer

2021-06-05 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-1981: -- Summary: Introduce --enable-sync option in HoodieMultiTableDeltaStreamer Key: HUDI-1981 URL: https://issues.apache.org/jira/browse/HUDI-1981 Project: Apache Hudi

[jira] [Commented] (HUDI-349) Make cleaner retention based on time period to account for higher deviations in ingestion runs

2021-05-19 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17347813#comment-17347813 ] Pratyaksh Sharma commented on HUDI-349: --- Need to create a new Cleaning policy.  > Make cleaner

[jira] [Commented] (HUDI-1277) [DOC] Need documentation explaining how to write custom record payload class

2021-05-19 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17347715#comment-17347715 ] Pratyaksh Sharma commented on HUDI-1277: can we close this Jira then? > [DOC] Need documentation

[jira] [Created] (HUDI-1766) Write a detailed blog for HoodieCleaner

2021-04-05 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-1766: -- Summary: Write a detailed blog for HoodieCleaner Key: HUDI-1766 URL: https://issues.apache.org/jira/browse/HUDI-1766 Project: Apache Hudi Issue Type:

[jira] [Updated] (HUDI-1766) Write a detailed blog for HoodieCleaner

2021-04-05 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-1766: --- Status: In Progress (was: Open) > Write a detailed blog for HoodieCleaner >

[jira] [Comment Edited] (HUDI-73) Support vanilla Avro Kafka Source in HoodieDeltaStreamer

2021-04-05 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-73?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17315238#comment-17315238 ] Pratyaksh Sharma edited comment on HUDI-73 at 4/6/21, 5:01 AM: --- [~garyli] it

[jira] [Reopened] (HUDI-73) Support vanilla Avro Kafka Source in HoodieDeltaStreamer

2021-04-05 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-73?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma reopened HUDI-73: -- [~garyli1019] it is not resolved yet.  > Support vanilla Avro Kafka Source in HoodieDeltaStreamer >

[jira] [Updated] (HUDI-485) Check for where clause is wrong in HiveIncrementalPuller

2021-04-04 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-485: -- Status: Open (was: New) > Check for where clause is wrong in HiveIncrementalPuller >

[jira] [Updated] (HUDI-485) Check for where clause is wrong in HiveIncrementalPuller

2021-04-04 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-485: -- Status: In Progress (was: Open) > Check for where clause is wrong in HiveIncrementalPuller >

[jira] [Updated] (HUDI-485) Check for where clause is wrong in HiveIncrementalPuller

2021-04-04 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-485: -- Status: Patch Available (was: In Progress) > Check for where clause is wrong in

[jira] [Created] (HUDI-1760) Incorrect Documentation for HoodieWriteConfigs

2021-04-03 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-1760: -- Summary: Incorrect Documentation for HoodieWriteConfigs Key: HUDI-1760 URL: https://issues.apache.org/jira/browse/HUDI-1760 Project: Apache Hudi Issue

[jira] [Commented] (HUDI-1741) Row Level TTL Support for records stored in Hudi

2021-04-03 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17314297#comment-17314297 ] Pratyaksh Sharma commented on HUDI-1741: Guess the same can be handled with this Jira -

[jira] [Commented] (HUDI-1509) Major performance degradation due to rewriting records with default values

2021-01-11 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17262865#comment-17262865 ] Pratyaksh Sharma commented on HUDI-1509: [~nishith29] The PR was a generic one where point #2

[jira] [Updated] (HUDI-867) Graphite metrics are throwing IllegalArgumentException on continuous mode

2020-09-18 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-867: -- Status: Closed (was: Patch Available) > Graphite metrics are throwing IllegalArgumentException

[jira] [Commented] (HUDI-867) Graphite metrics are throwing IllegalArgumentException on continuous mode

2020-09-18 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17198296#comment-17198296 ] Pratyaksh Sharma commented on HUDI-867: --- Yep, closing.  > Graphite metrics are throwing

[jira] [Closed] (HUDI-1156) Remove unused dependencies from HoodieDeltaStreamerWrapper Class

2020-09-11 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma closed HUDI-1156. -- > Remove unused dependencies from HoodieDeltaStreamerWrapper Class >

  1   2   3   >