[jira] [Updated] (HUDI-8221) [Umbrella] RFC-78: Concurrent schema evolution detection

2024-09-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8221: - Labels: pull-request-available (was: ) > [Umbrella] RFC-78: Concurrent schema evolution detection

[jira] [Updated] (HUDI-6891) Read Optimized Queries should not use RLI

2024-09-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6891: - Labels: pull-request-available (was: ) > Read Optimized Queries should not use RLI >

[jira] [Updated] (HUDI-8077) Fix the incremental cleaning to base on completion time

2024-09-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8077: - Labels: pull-request-available (was: ) > Fix the incremental cleaning to base on completion time

[jira] [Updated] (HUDI-8218) Changing the Properties to Load From Both Default Path and Enviorment

2024-09-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8218: - Labels: pull-request-available (was: ) > Changing the Properties to Load From Both Default Path a

[jira] [Updated] (HUDI-8216) Json to Row conversion failing for Timestamp fields during serialization

2024-09-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8216: - Labels: pull-request-available (was: ) > Json to Row conversion failing for Timestamp fields duri

[jira] [Updated] (HUDI-8179) Support Flink 1.20 in Hudi

2024-09-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8179: - Labels: pull-request-available (was: ) > Support Flink 1.20 in Hudi > --

[jira] [Updated] (HUDI-8215) Support composition compaction strategy

2024-09-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8215: - Labels: pull-request-available (was: ) > Support composition compaction strategy > --

[jira] [Updated] (HUDI-8214) Support specify partitions with regex for compaction

2024-09-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8214: - Labels: pull-request-available (was: ) > Support specify partitions with regex for compaction > -

[jira] [Updated] (HUDI-8213) Exclude jackson-databind from hudi-spark-bundle to fix CVE-2017-17485

2024-09-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8213: - Labels: pull-request-available (was: ) > Exclude jackson-databind from hudi-spark-bundle to fix C

[jira] [Updated] (HUDI-8212) Add option for billing project id for BIG query sync

2024-09-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8212: - Labels: pull-request-available (was: ) > Add option for billing project id for BIG query sync > -

[jira] [Updated] (HUDI-8191) Fix MockStateInitializationContext#getKeyedStateStore to use non-KeyedStateStore

2024-09-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8191: - Labels: pull-request-available (was: ) > Fix MockStateInitializationContext#getKeyedStateStore to

[jira] [Updated] (HUDI-7563) Implement DROP INDEX support

2024-09-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7563: - Labels: hudi-1.0.0-beta2 pull-request-available (was: hudi-1.0.0-beta2) > Implement DROP INDEX su

[jira] [Updated] (HUDI-8201) [Umbrella] RFC-81 : Hoodie stand alone catalog

2024-09-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8201: - Labels: pull-request-available (was: ) > [Umbrella] RFC-81 : Hoodie stand alone catalog > ---

[jira] [Updated] (HUDI-8203) Make record merge mode the primary merging config

2024-09-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8203: - Labels: pull-request-available (was: ) > Make record merge mode the primary merging config >

[jira] [Updated] (HUDI-8023) Add multi-writer tests for indexes

2024-09-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8023: - Labels: pull-request-available (was: ) > Add multi-writer tests for indexes > ---

[jira] [Updated] (HUDI-8141) Hudi incremental query and source should use completion time as the checkpoint

2024-09-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8141: - Labels: pull-request-available (was: ) > Hudi incremental query and source should use completion

[jira] [Updated] (HUDI-8200) Add support to configure StorageViewType with HoodieMetadataTableValidator

2024-09-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8200: - Labels: pull-request-available (was: ) > Add support to configure StorageViewType with HoodieMeta

[jira] [Updated] (HUDI-8102) Ensure secondary index readable using the native hfile reader

2024-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8102: - Labels: pull-request-available (was: ) > Ensure secondary index readable using the native hfile r

[jira] [Updated] (HUDI-7928) Remove shared HFile reader in HoodieNativeAvroHFileReader

2024-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7928: - Labels: pull-request-available (was: ) > Remove shared HFile reader in HoodieNativeAvroHFileReade

[jira] [Updated] (HUDI-8183) Record key value is null if the specified field does not exist

2024-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8183: - Labels: pull-request-available (was: ) > Record key value is null if the specified field does not

[jira] [Updated] (HUDI-8197) Get rid of disable vectorized reader in sql config in spark fg reader implementation

2024-09-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8197: - Labels: pull-request-available (was: ) > Get rid of disable vectorized reader in sql config in sp

[jira] [Updated] (HUDI-6909) Handle `_hoodie_operation` field in the new HoodieFileGroupReader

2024-09-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6909: - Labels: pull-request-available (was: ) > Handle `_hoodie_operation` field in the new HoodieFileGr

[jira] [Updated] (HUDI-7848) Fix the Comparable type of the ordering field value stored in delete record

2024-09-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7848: - Labels: pull-request-available (was: ) > Fix the Comparable type of the ordering field value stor

[jira] [Updated] (HUDI-8187) Hudi 1.0 reader should be able to read both 1.0 and 0.x tables with custom key generator

2024-09-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8187: - Labels: pull-request-available (was: ) > Hudi 1.0 reader should be able to read both 1.0 and 0.x

[jira] [Updated] (HUDI-8190) Implement efficient streaming reads for HoodieDataBlocks

2024-09-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8190: - Labels: pull-request-available (was: ) > Implement efficient streaming reads for HoodieDataBlocks

[jira] [Updated] (HUDI-8188) Add validation for partition stats index in HoodieMetadataTableValidator

2024-09-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8188: - Labels: pull-request-available (was: ) > Add validation for partition stats index in HoodieMetada

[jira] [Updated] (HUDI-8185) Fix colstats collection when record type is SPARK

2024-09-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8185: - Labels: pull-request-available (was: ) > Fix colstats collection when record type is SPARK >

[jira] [Updated] (HUDI-8186) Fix empty meta sync class name

2024-09-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8186: - Labels: pull-request-available (was: ) > Fix empty meta sync class name > ---

[jira] [Updated] (HUDI-8184) Fix Hudi CLI 'version' command to return Hudi version

2024-09-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8184: - Labels: pull-request-available (was: ) > Fix Hudi CLI 'version' command to return Hudi version >

[jira] [Updated] (HUDI-8095) Remove Hadoop dependencies in hudi-client-common module

2024-09-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8095: - Labels: pull-request-available (was: ) > Remove Hadoop dependencies in hudi-client-common module

[jira] [Updated] (HUDI-7902) Partition fields in Table config should store partition field types for custom key generator

2024-09-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7902: - Labels: pull-request-available (was: ) > Partition fields in Table config should store partition

[jira] [Updated] (HUDI-8175) Fix LongWritable cannot be cast to TimestampWritable for MOR table with timestamp column and schema evolution enabled

2024-09-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8175: - Labels: pull-request-available (was: ) > Fix LongWritable cannot be cast to TimestampWritable for

[jira] [Updated] (HUDI-8171) Rename HoodieFilegroupReader to just fileslice reader

2024-09-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8171: - Labels: pull-request-available (was: ) > Rename HoodieFilegroupReader to just fileslice reader >

[jira] [Updated] (HUDI-8170) Add reader state class to remove state from the reader context

2024-09-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8170: - Labels: pull-request-available (was: ) > Add reader state class to remove state from the reader c

[jira] [Updated] (HUDI-8161) Make spark-sql command 'desc' independent from schema evolution config

2024-09-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8161: - Labels: pull-request-available (was: ) > Make spark-sql command 'desc' independent from schema ev

[jira] [Updated] (HUDI-8160) Verify the consistency of the user-defined schema and the existing hoodie scheme when creating the hoodie table

2024-09-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8160: - Labels: pull-request-available (was: ) > Verify the consistency of the user-defined schema and th

[jira] [Updated] (HUDI-8159) Use SerializableConfiguration with Spark broadcast

2024-08-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8159: - Labels: pull-request-available (was: ) > Use SerializableConfiguration with Spark broadcast > ---

[jira] [Updated] (HUDI-8103) Introduce table version specific grouping of table configs

2024-08-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8103: - Labels: pull-request-available (was: ) > Introduce table version specific grouping of table confi

[jira] [Updated] (HUDI-8111) Add support for validating last N file slices with HoodieMetadataValidator

2024-08-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8111: - Labels: pull-request-available (was: ) > Add support for validating last N file slices with Hoodi

[jira] [Updated] (HUDI-8137) Fix Time travel query for spark datasource read of MDT

2024-08-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8137: - Labels: pull-request-available (was: ) > Fix Time travel query for spark datasource read of MDT >

[jira] [Updated] (HUDI-8018) Parameterize most SQL tests for both table types

2024-08-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8018: - Labels: pull-request-available (was: ) > Parameterize most SQL tests for both table types > -

[jira] [Updated] (HUDI-8135) Limit meta client initializations in StreamSync

2024-08-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8135: - Labels: pull-request-available (was: ) > Limit meta client initializations in StreamSync > --

[jira] [Updated] (HUDI-6791) Integrate FileGroupReader with NewHoodieParquetFileFormat for Spark CDC Query

2024-08-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6791: - Labels: pull-request-available (was: ) > Integrate FileGroupReader with NewHoodieParquetFileForma

[jira] [Updated] (HUDI-8133) No easy way to append classpath in hudi hive sync

2024-08-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8133: - Labels: pull-request-available (was: ) > No easy way to append classpath in hudi hive sync >

[jira] [Updated] (HUDI-8134) build is broken if no spark profile provided on m1 mac

2024-08-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8134: - Labels: pull-request-available (was: ) > build is broken if no spark profile provided on m1 mac >

[jira] [Updated] (HUDI-8126) Optimise error table write

2024-08-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8126: - Labels: pull-request-available (was: ) > Optimise error table write > --

[jira] [Updated] (HUDI-5829) Optimize conversion from json to row format when sanitizing field names

2024-08-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-5829: - Labels: pull-request-available (was: ) > Optimize conversion from json to row format when sanitiz

[jira] [Updated] (HUDI-8125) Avoid processing nested json in MercifulJsonConverter when not required

2024-08-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8125: - Labels: pull-request-available (was: ) > Avoid processing nested json in MercifulJsonConverter wh

[jira] [Updated] (HUDI-8124) Allow reuse of meta client in Meta Sync path

2024-08-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8124: - Labels: pull-request-available (was: ) > Allow reuse of meta client in Meta Sync path > -

[jira] [Updated] (HUDI-8123) Fix MDT file listing to exclude non-existent log files in marker-based rollback

2024-08-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8123: - Labels: pull-request-available (was: ) > Fix MDT file listing to exclude non-existent log files i

[jira] [Updated] (HUDI-7696) Consolidate convertFilesToPartitionStatsRecords and convertMetadataToPartitionStatsRecords

2024-08-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7696: - Labels: pull-request-available (was: ) > Consolidate convertFilesToPartitionStatsRecords and > c

[jira] [Updated] (HUDI-8110) Throw an error for time travel query on MDT

2024-08-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8110: - Labels: pull-request-available (was: ) > Throw an error for time travel query on MDT > --

[jira] [Updated] (HUDI-8116) Coalesce Row Source Aliases with Schema Fields in S3/GCS

2024-08-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8116: - Labels: pull-request-available (was: ) > Coalesce Row Source Aliases with Schema Fields in S3/GCS

[jira] [Updated] (HUDI-8034) Support custom key generator with HoodieCatalogTable

2024-08-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8034: - Labels: pull-request-available (was: ) > Support custom key generator with HoodieCatalogTable > -

[jira] [Updated] (HUDI-8113) Improve HoodieActiveTimeline#revertCompleteToInflight parameter check

2024-08-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8113: - Labels: pull-request-available (was: ) > Improve HoodieActiveTimeline#revertCompleteToInflight pa

[jira] [Updated] (HUDI-8112) Fix TestHoodieActiveTimeline unit test missing test LogCompaction

2024-08-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8112: - Labels: pull-request-available (was: ) > Fix TestHoodieActiveTimeline unit test missing test LogC

[jira] [Updated] (HUDI-8078) Persisting writestatus to optimize the writes

2024-08-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8078: - Labels: pull-request-available spark (was: spark) > Persisting writestatus to optimize the writes

[jira] [Updated] (HUDI-8109) Fix the error related to locking during clustering/compaction when calling the file system in OSS.

2024-08-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8109: - Labels: pull-request-available (was: ) > Fix the error related to locking during clustering/compa

[jira] [Updated] (HUDI-8092) Replace Hadoop FileSystem and related usages with HoodieStorage abstraction in hudi-client-common module

2024-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8092: - Labels: pull-request-available (was: ) > Replace Hadoop FileSystem and related usages with Hoodie

[jira] [Updated] (HUDI-7916) Add tests on the integration of new file group reader with Hive

2024-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7916: - Labels: pull-request-available (was: ) > Add tests on the integration of new file group reader wi

[jira] [Updated] (HUDI-8106) Use the new filegroup reader for the metadata table

2024-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8106: - Labels: pull-request-available (was: ) > Use the new filegroup reader for the metadata table > --

[jira] [Updated] (HUDI-8105) Move MercifulJsonConverter to hudi-utilities package

2024-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8105: - Labels: pull-request-available (was: ) > Move MercifulJsonConverter to hudi-utilities package > -

[jira] [Updated] (HUDI-8097) Schema evolution setting from hudi-defaults.conf is ignored while altering column in Spark

2024-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8097: - Labels: pull-request-available (was: ) > Schema evolution setting from hudi-defaults.conf is igno

[jira] [Updated] (HUDI-8096) Improve OverwriteNonDefaultsWithLatestAvroPayload Java Doc

2024-08-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8096: - Labels: pull-request-available (was: ) > Improve OverwriteNonDefaultsWithLatestAvroPayload Java D

[jira] [Updated] (HUDI-8093) Replace Hadoop Configuration with StorageConfiguration in hudi-client-common module

2024-08-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8093: - Labels: pull-request-available (was: ) > Replace Hadoop Configuration with StorageConfiguration i

[jira] [Updated] (HUDI-8090) New zookeeper based lock provider

2024-08-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8090: - Labels: pull-request-available (was: ) > New zookeeper based lock provider >

[jira] [Updated] (HUDI-8089) Remove support for spark 2

2024-08-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8089: - Labels: pull-request-available (was: ) > Remove support for spark 2 > --

[jira] [Updated] (HUDI-8088) Fix documentation: filename of Externalized Config file

2024-08-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8088: - Labels: pull-request-available (was: ) > Fix documentation: filename of Externalized Config file

[jira] [Updated] (HUDI-8087) Prevent docker from starting in the build phase of integration tests

2024-08-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8087: - Labels: pull-request-available (was: ) > Prevent docker from starting in the build phase of integ

[jira] [Updated] (HUDI-8066) Cherrypick Flink 1.18 into hudi 0.14 branch

2024-08-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8066: - Labels: pull-request-available (was: ) > Cherrypick Flink 1.18 into hudi 0.14 branch > --

[jira] [Updated] (HUDI-8070) Support Flink 1.19 in Hudi

2024-08-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8070: - Labels: pull-request-available (was: ) > Support Flink 1.19 in Hudi > --

[jira] [Updated] (HUDI-8084) Support Sort Merge Join Compaction

2024-08-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8084: - Labels: pull-request-available (was: ) > Support Sort Merge Join Compaction > ---

[jira] [Updated] (HUDI-8083) Hudi table created with dataframe API becomes unwritable to INSERT queries due to config conflict

2024-08-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8083: - Labels: pull-request-available (was: ) > Hudi table created with dataframe API becomes unwritable

[jira] [Updated] (HUDI-8080) Get rid of separate reader instance for cdc reader

2024-08-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8080: - Labels: pull-request-available (was: ) > Get rid of separate reader instance for cdc reader > ---

[jira] [Updated] (HUDI-8079) Get rid of base file reader usage for count in filegroup reader parquet file format

2024-08-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8079: - Labels: pull-request-available (was: ) > Get rid of base file reader usage for count in filegroup

[jira] [Updated] (HUDI-5807) HoodieSparkParquetReader is not appending partition-path values

2024-08-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-5807: - Labels: hudi-1.0.0-beta2 pull-request-available (was: hudi-1.0.0-beta2) > HoodieSparkParquetReade

[jira] [Updated] (HUDI-8073) Create abstraction to maintain files in the the engine native format

2024-08-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8073: - Labels: pull-request-available (was: ) > Create abstraction to maintain files in the the engine n

[jira] [Updated] (HUDI-8071) Handle skew for user defined sort columns in BULK_INSERT

2024-08-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8071: - Labels: pull-request-available (was: ) > Handle skew for user defined sort columns in BULK_INSERT

[jira] [Updated] (HUDI-8068) Hook up source partitions to s3 incr source

2024-08-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8068: - Labels: pull-request-available (was: ) > Hook up source partitions to s3 incr source > --

[jira] [Updated] (HUDI-8067) Docker Compose V1 removed from 6.5.0-1025-azure

2024-08-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8067: - Labels: pull-request-available (was: ) > Docker Compose V1 removed from 6.5.0-1025-azure > --

[jira] [Updated] (HUDI-7947) [Umbrella] RFC-80 : Support column families for wide tables

2024-08-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7947: - Labels: hudi-umbrellas pull-request-available (was: hudi-umbrellas) > [Umbrella] RFC-80 : Support

[jira] [Updated] (HUDI-6948) HoodieAvroParquetReader sets configs wrong

2024-08-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6948: - Labels: pull-request-available (was: ) > HoodieAvroParquetReader sets configs wrong > ---

[jira] [Updated] (HUDI-8043) Fix dynamo db lock provider bug

2024-08-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8043: - Labels: pull-request-available (was: ) > Fix dynamo db lock provider bug > --

[jira] [Updated] (HUDI-7930) After Compaction RuntimeException: Unsupported type in the list: optional binary xxx (STRING)

2024-08-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7930: - Labels: pull-request-available (was: ) > After Compaction RuntimeException: Unsupported type in t

[jira] [Updated] (HUDI-8041) Support projection push down for lookup join

2024-08-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8041: - Labels: pull-request-available (was: ) > Support projection push down for lookup join > -

[jira] [Updated] (HUDI-8040) Fix SimpleConcurrentFileWritesConflictResolutionStrategy get pending clustering wrong

2024-08-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8040: - Labels: pull-request-available (was: ) > Fix SimpleConcurrentFileWritesConflictResolutionStrategy

[jira] [Updated] (HUDI-8038) dummy jira

2024-08-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8038: - Labels: pull-request-available (was: ) > dummy jira > -- > > Key: HUDI-80

[jira] [Updated] (HUDI-8037) Partition query for transformed value incorrectly prunes valid partitions

2024-08-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8037: - Labels: pull-request-available (was: ) > Partition query for transformed value incorrectly prunes

[jira] [Updated] (HUDI-8035) Fetching commit metadata from timeline fails during upgrade

2024-07-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8035: - Labels: pull-request-available (was: ) > Fetching commit metadata from timeline fails during upgr

[jira] [Updated] (HUDI-8036) Handle partition schema for custom key gen in SparkHoodieTableFileIndex

2024-07-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8036: - Labels: pull-request-available (was: ) > Handle partition schema for custom key gen in SparkHoodi

[jira] [Updated] (HUDI-8024) Test index updates and rollback

2024-07-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8024: - Labels: pull-request-available (was: ) > Test index updates and rollback > -

[jira] [Updated] (HUDI-8033) RFC-81: Log Compaction with Merge Sort

2024-07-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8033: - Labels: pull-request-available (was: ) > RFC-81: Log Compaction with Merge Sort > ---

[jira] [Updated] (HUDI-8032) Fix MockStateInitializationContext#getKeyedStateStore to use non-KeyedStateStore

2024-07-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8032: - Labels: pull-request-available (was: ) > Fix MockStateInitializationContext#getKeyedStateStore to

[jira] [Updated] (HUDI-8030) Fix for add missing table configurations to payloadProps

2024-07-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8030: - Labels: pull-request-available (was: ) > Fix for add missing table configurations to payloadProps

[jira] [Updated] (HUDI-8029) Implicit partition key dynamo db lock provider should enforce s3 prefix over s3 uri

2024-07-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8029: - Labels: pull-request-available (was: ) > Implicit partition key dynamo db lock provider should en

[jira] [Updated] (HUDI-7918) Remove support of Spark 2, 3.0, 3.1, and 3.2

2024-07-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7918: - Labels: pull-request-available (was: ) > Remove support of Spark 2, 3.0, 3.1, and 3.2 > -

[jira] [Updated] (HUDI-8026) Test multiple indexes creation and updates together

2024-07-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8026: - Labels: pull-request-available (was: ) > Test multiple indexes creation and updates together > --

[jira] [Updated] (HUDI-6191) Improve passing the debezium checkpoint values to start job from offset

2024-07-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6191: - Labels: pull-request-available (was: ) > Improve passing the debezium checkpoint values to start

[jira] [Updated] (HUDI-8016) LastSyncedTime is not updated for Snapshot table in Glue Sync

2024-07-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-8016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-8016: - Labels: pull-request-available (was: ) > LastSyncedTime is not updated for Snapshot table in Glue

[jira] [Updated] (HUDI-7993) Support pruning and skipping with meta fields

2024-07-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-7993: - Labels: pull-request-available (was: ) > Support pruning and skipping with meta fields >

  1   2   3   4   5   6   7   8   9   10   >