[jira] [Updated] (HUDI-3517) Unicode in partition path causes it to be resolved wrongly

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-3517: -- Sprint: 0.13.0 Final Sprint, 0.13.0 Final Sprint 2 (was: 0.13.0 Final Sprint, 0.13.0

[jira] [Updated] (HUDI-4700) RFC for primary key-less data model

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4700: -- Sprint: 0.13.0 Final Sprint 2 (was: 0.13.0 Final Sprint 2, 0.13.0 Final Sprint 3) >

[jira] [Updated] (HUDI-5520) Fail MDT when list of log files grows unboundedly

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5520: -- Story Points: 2 (was: 1) > Fail MDT when list of log files grows unboundedly >

[jira] [Updated] (HUDI-3636) Clustering fails due to marker creation failure

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-3636: -- Story Points: 1 (was: 0) > Clustering fails due to marker creation failure >

[jira] [Updated] (HUDI-5520) Fail MDT when list of log files grows unboundedly

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5520: -- Story Points: 1 (was: 3) > Fail MDT when list of log files grows unboundedly >

[jira] [Assigned] (HUDI-5464) Fix instantiation of a new partition in MDT re-using the same instant time as a regular commit

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-5464: - Assignee: Raymond Xu (was: Alexey Kudinkin) > Fix instantiation of a new

[jira] [Updated] (HUDI-5570) Write tests for failed compaction retried w/ MDT able to serve just the required data

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5570: -- Sprint: 0.13.0 Final Sprint 3 > Write tests for failed compaction retried w/ MDT able

[jira] [Updated] (HUDI-5570) Write tests for failed compaction retried w/ MDT able to serve just the required data

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5570: -- Epic Link: HUDI-1292 > Write tests for failed compaction retried w/ MDT able to serve

[jira] [Updated] (HUDI-5570) Write tests for failed compaction retried w/ MDT able to serve just the required data

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5570: -- Story Points: 2 > Write tests for failed compaction retried w/ MDT able to serve just

[jira] [Updated] (HUDI-3775) Allow for offline compaction of MOR tables via spark streaming

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-3775: -- Story Points: 0 (was: 1) > Allow for offline compaction of MOR tables via spark

[jira] [Updated] (HUDI-5570) Write tests for failed compaction retried w/ MDT able to serve just the required data

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5570: -- Fix Version/s: 0.13.0 > Write tests for failed compaction retried w/ MDT able to serve

[jira] [Updated] (HUDI-5570) Write tests for failed compaction retried w/ MDT able to serve just the required data

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5570: -- Priority: Blocker (was: Major) > Write tests for failed compaction retried w/ MDT able

[jira] [Created] (HUDI-5570) Write tests for failed compaction retried w/ MDT able to serve just the required data

2023-01-17 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-5570: - Summary: Write tests for failed compaction retried w/ MDT able to serve just the required data Key: HUDI-5570 URL: https://issues.apache.org/jira/browse/HUDI-5570

[jira] [Assigned] (HUDI-5570) Write tests for failed compaction retried w/ MDT able to serve just the required data

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-5570: - Assignee: sivabalan narayanan > Write tests for failed compaction retried w/ MDT

[jira] [Updated] (HUDI-4911) Make sure LogRecordReader doesn't flush the cache before each lookup

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4911: -- Story Points: 1 (was: 4) > Make sure LogRecordReader doesn't flush the cache before

[jira] [Updated] (HUDI-5408) Partially failed commits in MDT have to be rolled back in all cases

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5408: -- Story Points: 0 (was: 1) > Partially failed commits in MDT have to be rolled back in

[jira] [Updated] (HUDI-5407) Rollbacks in MDT is not effective

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5407: -- Story Points: 0 (was: 1) > Rollbacks in MDT is not effective >

[jira] [Updated] (HUDI-5433) Fix the way we deduce the pending instants for MDT writes

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5433: -- Story Points: 0 (was: 1) > Fix the way we deduce the pending instants for MDT writes >

[jira] [Updated] (HUDI-5463) Apply rollback commits from data table as rollbacks in MDT instead of Delta commit

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5463: -- Sprint: 0.13.0 Final Sprint (was: 0.13.0 Final Sprint, 0.13.0 Final Sprint 3) > Apply

[jira] [Updated] (HUDI-5463) Apply rollback commits from data table as rollbacks in MDT instead of Delta commit

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5463: -- Sprint: 0.13.0 Final Sprint (was: 0.13.0 Final Sprint, 0.13.0 Final Sprint 2) > Apply

[jira] [Updated] (HUDI-5463) Apply rollback commits from data table as rollbacks in MDT instead of Delta commit

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5463: -- Sprint: 0.13.0 Final Sprint, 0.13.0 Final Sprint 3 (was: 0.13.0 Final Sprint) > Apply

[jira] [Updated] (HUDI-5569) Files written by first commit/delta commit if it failed is detected as valid data files

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5569: -- Sprint: 0.13.0 Final Sprint 2 > Files written by first commit/delta commit if it failed

[jira] [Updated] (HUDI-5569) Files written by first commit/delta commit if it failed is detected as valid data files

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5569: -- Fix Version/s: 0.13.0 > Files written by first commit/delta commit if it failed is

[jira] [Assigned] (HUDI-5569) Files written by first commit/delta commit if it failed is detected as valid data files

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-5569: - Assignee: Jonathan Vexler > Files written by first commit/delta commit if it

[jira] [Updated] (HUDI-5569) Files written by first commit/delta commit if it failed is detected as valid data files

2023-01-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5569: -- Description: We have an method in HoodieFileGroup which detects whether a file group is

[jira] [Created] (HUDI-5569) Files written by first commit/delta commit if it failed is detected as valid data files

2023-01-17 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-5569: - Summary: Files written by first commit/delta commit if it failed is detected as valid data files Key: HUDI-5569 URL: https://issues.apache.org/jira/browse/HUDI-5569

[jira] [Created] (HUDI-5566) Add schema upgrade test w/ metadata payload

2023-01-16 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-5566: - Summary: Add schema upgrade test w/ metadata payload Key: HUDI-5566 URL: https://issues.apache.org/jira/browse/HUDI-5566 Project: Apache Hudi

[jira] [Updated] (HUDI-5520) Fail MDT when list of log files grows unboundedly

2023-01-12 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5520: -- Summary: Fail MDT when list of log files grows unboundedly (was: Fail MDT when list of

[jira] [Updated] (HUDI-5547) Add support to refresh FileSystem based schema provider for every batch w/ deltastreamer in continuous mode

2023-01-12 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5547: -- Fix Version/s: 0.13.0 > Add support to refresh FileSystem based schema provider for

[jira] [Created] (HUDI-5547) Add support to refresh FileSystem based schema provider for every batch w/ deltastreamer in continuous mode

2023-01-12 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-5547: - Summary: Add support to refresh FileSystem based schema provider for every batch w/ deltastreamer in continuous mode Key: HUDI-5547 URL:

[jira] [Updated] (HUDI-5537) Support partitionBy with dataframe apis

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5537: -- Epic Link: HUDI-4699 Story Points: 2 > Support partitionBy with dataframe apis >

[jira] [Updated] (HUDI-5535) Add support for keyless for all keygens(non partitioned, timestamp based key gen)

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5535: -- Fix Version/s: 0.13.0 > Add support for keyless for all keygens(non partitioned,

[jira] [Assigned] (HUDI-5537) Support partitionBy with dataframe apis

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-5537: - Assignee: Lokesh Jain > Support partitionBy with dataframe apis >

[jira] [Updated] (HUDI-5537) Support partitionBy with dataframe apis

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5537: -- Sprint: 0.13.0 Final Sprint 2 > Support partitionBy with dataframe apis >

[jira] [Created] (HUDI-5537) Support partitionBy with dataframe apis

2023-01-11 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-5537: - Summary: Support partitionBy with dataframe apis Key: HUDI-5537 URL: https://issues.apache.org/jira/browse/HUDI-5537 Project: Apache Hudi Issue

[jira] [Updated] (HUDI-5537) Support partitionBy with dataframe apis

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5537: -- Fix Version/s: 0.13.0 > Support partitionBy with dataframe apis >

[jira] [Created] (HUDI-5536) Support writing to hudi w/o any options

2023-01-11 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-5536: - Summary: Support writing to hudi w/o any options Key: HUDI-5536 URL: https://issues.apache.org/jira/browse/HUDI-5536 Project: Apache Hudi Issue

[jira] [Updated] (HUDI-5536) Support writing to hudi w/o any options

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5536: -- Epic Link: HUDI-4699 Story Points: 2 > Support writing to hudi w/o any options

[jira] [Assigned] (HUDI-5536) Support writing to hudi w/o any options

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-5536: - Assignee: Lokesh Jain > Support writing to hudi w/o any options >

[jira] [Updated] (HUDI-5536) Support writing to hudi w/o any options

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5536: -- Fix Version/s: 0.13.0 > Support writing to hudi w/o any options >

[jira] [Updated] (HUDI-5536) Support writing to hudi w/o any options

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5536: -- Sprint: 0.13.0 Final Sprint 2 > Support writing to hudi w/o any options >

[jira] [Assigned] (HUDI-5535) Add support for keyless for all keygens(non partitioned, timestamp based key gen)

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-5535: - Assignee: Lokesh Jain > Add support for keyless for all keygens(non partitioned,

[jira] [Updated] (HUDI-5535) Add support for keyless for all keygens(non partitioned, timestamp based key gen)

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5535: -- Sprint: 0.13.0 Final Sprint 2 > Add support for keyless for all keygens(non

[jira] [Created] (HUDI-5535) Add support for keyless for all keygens(non partitioned, timestamp based key gen)

2023-01-11 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-5535: - Summary: Add support for keyless for all keygens(non partitioned, timestamp based key gen) Key: HUDI-5535 URL: https://issues.apache.org/jira/browse/HUDI-5535

[jira] [Updated] (HUDI-5535) Add support for keyless for all keygens(non partitioned, timestamp based key gen)

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5535: -- Epic Link: HUDI-4699 Story Points: 3 > Add support for keyless for all

[jira] [Assigned] (HUDI-2681) Make hoodie record_key and preCombine_key optional

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-2681: - Assignee: Lokesh Jain (was: Yann Byron) > Make hoodie record_key and

[jira] [Assigned] (HUDI-4701) Support bulk insert without primary key and precombine field

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-4701: - Assignee: Lokesh Jain > Support bulk insert without primary key and precombine

[jira] [Commented] (HUDI-5523) Support force rollback to a history instant

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17675784#comment-17675784 ] sivabalan narayanan commented on HUDI-5523: --- it is feasible. we need to add savepoint and

[jira] [Updated] (HUDI-5433) Fix the way we deduce the pending instants for MDT writes

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5433: -- Reviewers: Ethan Guo (was: Raymond Xu) > Fix the way we deduce the pending instants

[jira] [Closed] (HUDI-5432) Fix adding back a log block w/ same commit time as previously rolled back one

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan closed HUDI-5432. - Resolution: Not A Problem > Fix adding back a log block w/ same commit time as previously

[jira] [Commented] (HUDI-5432) Fix adding back a log block w/ same commit time as previously rolled back one

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5432?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17675770#comment-17675770 ] sivabalan narayanan commented on HUDI-5432: --- not a problem. Verified using a test and manually

[jira] [Closed] (HUDI-5430) Fix multi-writer handling w/ rollback blocks in MOR table (log record reader)

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan closed HUDI-5430. - Resolution: Not A Problem > Fix multi-writer handling w/ rollback blocks in MOR table

[jira] [Commented] (HUDI-5430) Fix multi-writer handling w/ rollback blocks in MOR table (log record reader)

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17675769#comment-17675769 ] sivabalan narayanan commented on HUDI-5430: --- this is fixed with 5407 and 5408. There won't be

[jira] [Commented] (HUDI-5465) Fix compaction and rollback handling in MDT for multi-writer scenarios in DT

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17675767#comment-17675767 ] sivabalan narayanan commented on HUDI-5465: --- Will be fixed using 5433 patch.  > Fix compaction

[jira] [Closed] (HUDI-5465) Fix compaction and rollback handling in MDT for multi-writer scenarios in DT

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan closed HUDI-5465. - Resolution: Duplicate > Fix compaction and rollback handling in MDT for multi-writer

[jira] [Closed] (HUDI-5169) Re-attempt failed rollback (regular commits, clustering) and get it to completion

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan closed HUDI-5169. - Resolution: Not A Problem > Re-attempt failed rollback (regular commits, clustering) and

[jira] [Commented] (HUDI-5465) Fix compaction and rollback handling in MDT for multi-writer scenarios in DT

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17675724#comment-17675724 ] sivabalan narayanan commented on HUDI-5465: --- if a compaction instant time is c50 and later we

[jira] [Closed] (HUDI-5532) Add a KeyGenerator to support a Keyless workflow

2023-01-11 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan closed HUDI-5532. - Resolution: Duplicate > Add a KeyGenerator to support a Keyless workflow >

[jira] [Created] (HUDI-5525) Test timestamp as of w/ archival beyond savepoint enabled

2023-01-10 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-5525: - Summary: Test timestamp as of w/ archival beyond savepoint enabled Key: HUDI-5525 URL: https://issues.apache.org/jira/browse/HUDI-5525 Project: Apache Hudi

[jira] [Updated] (HUDI-5434) Fix archival in MDT to not rely on rollbacks/clean in DT

2023-01-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5434: -- Reviewers: sivabalan narayanan > Fix archival in MDT to not rely on rollbacks/clean in

[jira] [Updated] (HUDI-4700) RFC for primary key-less data model

2023-01-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4700: -- Story Points: 2 > RFC for primary key-less data model >

[jira] [Updated] (HUDI-4701) Support bulk insert without primary key and precombine field

2023-01-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4701: -- Story Points: 2 > Support bulk insert without primary key and precombine field >

[jira] [Updated] (HUDI-2681) Make hoodie record_key and preCombine_key optional

2023-01-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2681: -- Story Points: 2 > Make hoodie record_key and preCombine_key optional >

[jira] [Updated] (HUDI-4700) RFC for primary key-less data model

2023-01-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4700: -- Priority: Blocker (was: Major) > RFC for primary key-less data model >

[jira] [Updated] (HUDI-4701) Support bulk insert without primary key and precombine field

2023-01-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4701: -- Priority: Blocker (was: Major) > Support bulk insert without primary key and

[jira] [Updated] (HUDI-2681) Make hoodie record_key and preCombine_key optional

2023-01-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2681: -- Priority: Blocker (was: Major) > Make hoodie record_key and preCombine_key optional >

[jira] [Updated] (HUDI-2681) Make hoodie record_key and preCombine_key optional

2023-01-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2681: -- Fix Version/s: 0.13.0 > Make hoodie record_key and preCombine_key optional >

[jira] [Updated] (HUDI-4700) RFC for primary key-less data model

2023-01-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4700: -- Sprint: 0.13.0 Final Sprint 2 > RFC for primary key-less data model >

[jira] [Updated] (HUDI-4701) Support bulk insert without primary key and precombine field

2023-01-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4701: -- Sprint: 0.13.0 Final Sprint 2 > Support bulk insert without primary key and precombine

[jira] [Updated] (HUDI-2681) Make hoodie record_key and preCombine_key optional

2023-01-10 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2681: -- Sprint: 0.13.0 Final Sprint 2 > Make hoodie record_key and preCombine_key optional >

[jira] [Updated] (HUDI-3775) Allow for offline compaction of MOR tables via spark streaming

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-3775: -- Story Points: 1 (was: 2) > Allow for offline compaction of MOR tables via spark

[jira] [Updated] (HUDI-5349) Clean up partially failed restore if any

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5349: -- Story Points: 1 > Clean up partially failed restore if any >

[jira] [Updated] (HUDI-5520) Fail MDT when list of log files grow > 1000

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5520: -- Priority: Blocker (was: Critical) > Fail MDT when list of log files grow > 1000 >

[jira] [Updated] (HUDI-5408) Partially failed commits in MDT have to be rolled back in all cases

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5408: -- Priority: Blocker (was: Critical) > Partially failed commits in MDT have to be rolled

[jira] [Updated] (HUDI-5408) Partially failed commits in MDT have to be rolled back in all cases

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5408: -- Story Points: 1 (was: 2) > Partially failed commits in MDT have to be rolled back in

[jira] [Updated] (HUDI-5433) Fix the way we deduce the pending instants for MDT writes

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5433: -- Story Points: 1 (was: 2) > Fix the way we deduce the pending instants for MDT writes >

[jira] [Updated] (HUDI-5075) Add support to rollback residual clustering after disabling clustering

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5075: -- Story Points: 1 (was: 3) > Add support to rollback residual clustering after disabling

[jira] [Updated] (HUDI-3636) Clustering fails due to marker creation failure

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-3636: -- Story Points: 1 (was: 2) > Clustering fails due to marker creation failure >

[jira] [Updated] (HUDI-5080) UnpersistRdds unpersist all rdds in the spark context

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5080: -- Sprint: 2022/10/18, 2022/11/01, 2022/11/15, 2022/11/29, 2022/12/12 (was: 2022/10/18,

[jira] [Updated] (HUDI-5520) Fail MDT when list of log files grow > 1000

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5520: -- Story Points: 3 > Fail MDT when list of log files grow > 1000 >

[jira] [Assigned] (HUDI-5520) Fail MDT when list of log files grow > 1000

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-5520: - Assignee: Jonathan Vexler > Fail MDT when list of log files grow > 1000 >

[jira] [Updated] (HUDI-5520) Fail MDT when list of log files grow > 1000

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5520: -- Epic Link: HUDI-1292 > Fail MDT when list of log files grow > 1000 >

[jira] [Updated] (HUDI-5520) Fail MDT when list of log files grow > 1000

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5520: -- Sprint: 0.13.0 Final Sprint > Fail MDT when list of log files grow > 1000 >

[jira] [Created] (HUDI-5520) Fail MDT when list of log files grow > 1000

2023-01-09 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-5520: - Summary: Fail MDT when list of log files grow > 1000 Key: HUDI-5520 URL: https://issues.apache.org/jira/browse/HUDI-5520 Project: Apache Hudi

[jira] [Updated] (HUDI-5520) Fail MDT when list of log files grow > 1000

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5520: -- Fix Version/s: 0.13.0 > Fail MDT when list of log files grow > 1000 >

[jira] [Updated] (HUDI-5451) Ensure switching "001" and "002" suffix for compaction and cleaning in MDT is backwards compatible

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5451: -- Sprint: (was: 0.13.0 Final Sprint) > Ensure switching "001" and "002" suffix for

[jira] [Updated] (HUDI-5490) Investigate test failures w/ record level index for existing tests

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5490: -- Sprint: (was: 0.13.0 Final Sprint) > Investigate test failures w/ record level index

[jira] [Updated] (HUDI-5298) Optimize WriteStatus storing HoodieRecord

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5298: -- Sprint: (was: 0.13.0 Final Sprint) > Optimize WriteStatus storing HoodieRecord >

[jira] [Updated] (HUDI-5453) Ensure new fileId format is good across all code paths and backwards compatible

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5453: -- Sprint: (was: 0.13.0 Final Sprint) > Ensure new fileId format is good across all code

[jira] [Updated] (HUDI-5446) Add support to write record level index to MDT

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5446: -- Sprint: (was: 0.13.0 Final Sprint) > Add support to write record level index to MDT >

[jira] [Updated] (HUDI-5297) Deprecate InternalWriteStatus and re-use WriteStatus

2023-01-09 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5297: -- Sprint: (was: 0.13.0 Final Sprint) > Deprecate InternalWriteStatus and re-use

[jira] [Created] (HUDI-5514) Add support for auto generation of record keys for Hudi

2023-01-08 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-5514: - Summary: Add support for auto generation of record keys for Hudi Key: HUDI-5514 URL: https://issues.apache.org/jira/browse/HUDI-5514 Project: Apache Hudi

[jira] [Assigned] (HUDI-5514) Add support for auto generation of record keys for Hudi

2023-01-08 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-5514: - Assignee: sivabalan narayanan > Add support for auto generation of record keys

[jira] [Updated] (HUDI-5407) Rollbacks in MDT is not effective

2023-01-05 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5407: -- Status: Patch Available (was: In Progress) > Rollbacks in MDT is not effective >

[jira] [Updated] (HUDI-5407) Rollbacks in MDT is not effective

2023-01-05 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-5407: -- Status: In Progress (was: Open) > Rollbacks in MDT is not effective >

[jira] [Updated] (HUDI-4911) Make sure LogRecordReader doesn't flush the cache before each lookup

2023-01-05 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4911: -- Reviewers: sivabalan narayanan (was: Sagar Sumit) > Make sure LogRecordReader doesn't

[jira] [Assigned] (HUDI-5293) Schema on read + reconcile schema fails w/ 0.12.1

2023-01-03 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-5293: - Assignee: Jonathan Vexler > Schema on read + reconcile schema fails w/ 0.12.1 >

[jira] [Assigned] (HUDI-5356) Call close on SparkRDDWriteClient several places

2023-01-03 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-5356: - Assignee: Jonathan Vexler > Call close on SparkRDDWriteClient several places >

[jira] [Assigned] (HUDI-5349) Clean up partially failed restore if any

2023-01-03 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-5349: - Assignee: Jonathan Vexler (was: sivabalan narayanan) > Clean up partially

[jira] [Closed] (HUDI-5370) Properly close file handles for Metadata writer

2023-01-03 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan closed HUDI-5370. - Resolution: Fixed > Properly close file handles for Metadata writer >

<    2   3   4   5   6   7   8   9   10   11   >