[GitHub] [hudi] danny0405 commented on pull request #4486: [HUDI-3132] Minor fixes for HoodieCatalog

2022-01-05 Thread GitBox
danny0405 commented on pull request #4486: URL: https://github.com/apache/hudi/pull/4486#issuecomment-1005478227 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] Carl-Zhou-CN commented on issue #4082: [SUPPORT] How to write multiple HUDi tables simultaneously in a Spark Streaming task?

2022-01-05 Thread GitBox
Carl-Zhou-CN commented on issue #4082: URL: https://github.com/apache/hudi/issues/4082#issuecomment-1005481900 @nsivabalan I was very interested in the way hudi was written 'But this would mean your writes are going through spark datasource write and not as streaming write.' Which way

[jira] [Resolved] (HUDI-3171) Sync empty table to hive metastore

2022-01-05 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen resolved HUDI-3171. -- > Sync empty table to hive metastore > -- > > Key:

[GitHub] [hudi] prashantwason commented on pull request #4212: [HUDI-2925] Fix duplicate cleaning of same files when unfinished clean operations are present.

2022-01-05 Thread GitBox
prashantwason commented on pull request #4212: URL: https://github.com/apache/hudi/pull/4212#issuecomment-1005489938 Responded to the comment above. Are there specific objections to the way this patch is implemented or the issue itself? I am open to any way to fix this issue. -- This

[GitHub] [hudi] prashantwason commented on a change in pull request #4449: [HUDI-2763] Metadata table records - support for key deduplication based on hardcoded key field

2022-01-05 Thread GitBox
prashantwason commented on a change in pull request #4449: URL: https://github.com/apache/hudi/pull/4449#discussion_r778648919 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/storage/HoodieHFileWriter.java ## @@ -77,6 +81,8 @@ public

[GitHub] [hudi] liujinhui1994 commented on issue #4027: [SUPPORT] Structured streaming Async clustering IndexOutOfBoundsException

2022-01-05 Thread GitBox
liujinhui1994 commented on issue #4027: URL: https://github.com/apache/hudi/issues/4027#issuecomment-1005506439 > thanks. sure. I currently emptied the historical data directory and ran it with the same code and found that there is no problem. thanks -- This is an automated

[GitHub] [hudi] xuranyang commented on issue #4082: [SUPPORT] How to write multiple HUDi tables simultaneously in a Spark Streaming task?

2022-01-05 Thread GitBox
xuranyang commented on issue #4082: URL: https://github.com/apache/hudi/issues/4082#issuecomment-1005488249 > @xuranyang : without further info, we can't do much here. Can you please let us know what exactly you are looking for. I am not an expert in structured streaming, but if you are

[GitHub] [hudi] hudi-bot commented on pull request #4512: [HUDI-3170] Do not preserve filename when preserveCommitMetadata enabled

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4512: URL: https://github.com/apache/hudi/pull/4512#issuecomment-1005490941 ## CI report: * 88fed889b20d81fa71c156a7b8e87c2c3651de2f Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4512: [HUDI-3170] Do not preserve filename when preserveCommitMetadata enabled

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4512: URL: https://github.com/apache/hudi/pull/4512#issuecomment-1005458302 ## CI report: * 88fed889b20d81fa71c156a7b8e87c2c3651de2f Azure:

[GitHub] [hudi] yanenze removed a comment on issue #4419: [SUPPORT] Not An Avro File (flink)

2022-01-05 Thread GitBox
yanenze removed a comment on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-1005513986 > i used #4016 on release-0.10.0 , the problem does not reappear -- This is an automated message from the Apache Git Service. To respond to the message, please log

[GitHub] [hudi] yanenze commented on issue #4419: [SUPPORT] Not An Avro File (flink)

2022-01-05 Thread GitBox
yanenze commented on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-1005513986 > i used #4016 on release-0.10.0 , the problem does not reappear -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [hudi] yanenze commented on issue #4419: [SUPPORT] Not An Avro File (flink)

2022-01-05 Thread GitBox
yanenze commented on issue #4419: URL: https://github.com/apache/hudi/issues/4419#issuecomment-1005514624 > Hey @danny0405 : is this something different from #4016 or is the same. If its the same, can we close this issue since its already triaged and fixed. i use #4016 on

[GitHub] [hudi] boneanxs commented on issue #4474: [SUPPORT] Should we shade all aws dependencies to avoid class conflicts?

2022-01-05 Thread GitBox
boneanxs commented on issue #4474: URL: https://github.com/apache/hudi/issues/4474#issuecomment-1005478353 Looks flink-bundle already remove this...[HUDI-2803](https://issues.apache.org/jira/browse/HUDI-2803) -- This is an automated message from the Apache Git Service. To respond to the

[GitHub] [hudi] hudi-bot commented on pull request #4486: [HUDI-3132] Minor fixes for HoodieCatalog

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4486: URL: https://github.com/apache/hudi/pull/4486#issuecomment-1005479040 ## CI report: * d96f3e5662350471fd8ff14c47f3daf12e5f151f Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4486: [HUDI-3132] Minor fixes for HoodieCatalog

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4486: URL: https://github.com/apache/hudi/pull/4486#issuecomment-1005451852 ## CI report: * d96f3e5662350471fd8ff14c47f3daf12e5f151f Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4486: [HUDI-3132] Minor fixes for HoodieCatalog

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4486: URL: https://github.com/apache/hudi/pull/4486#issuecomment-1005515691 ## CI report: * d96f3e5662350471fd8ff14c47f3daf12e5f151f Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4486: [HUDI-3132] Minor fixes for HoodieCatalog

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4486: URL: https://github.com/apache/hudi/pull/4486#issuecomment-1005479040 ## CI report: * d96f3e5662350471fd8ff14c47f3daf12e5f151f Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4511: [HUDI-3171] Sync empty table to hive metastore

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4511: URL: https://github.com/apache/hudi/pull/4511#issuecomment-1005463733 ## CI report: * 33f1af47efe9185c591280ea30932cbe970116a6 UNKNOWN * c01354222d525bae737c2db0455a86af500dd2c6 Azure:

[GitHub] [hudi] zhedoubushishi commented on issue #4474: [SUPPORT] Should we shade all aws dependencies to avoid class conflicts?

2022-01-05 Thread GitBox
zhedoubushishi commented on issue #4474: URL: https://github.com/apache/hudi/issues/4474#issuecomment-1005463939 Thanks for bringing up this issue. My initial idea is to relocate the aws jars with a Hudi prefix to avoid jar conflicts. If we just directly remove the shading for aws

[GitHub] [hudi] hudi-bot removed a comment on pull request #4511: [HUDI-3171] Sync empty table to hive metastore

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4511: URL: https://github.com/apache/hudi/pull/4511#issuecomment-1005430617 ## CI report: * 33f1af47efe9185c591280ea30932cbe970116a6 UNKNOWN * c01354222d525bae737c2db0455a86af500dd2c6 Azure:

[GitHub] [hudi] danny0405 merged pull request #4511: [HUDI-3171] Sync empty table to hive metastore

2022-01-05 Thread GitBox
danny0405 merged pull request #4511: URL: https://github.com/apache/hudi/pull/4511 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Commented] (HUDI-3171) Sync empty table to hive metastore

2022-01-05 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17469126#comment-17469126 ] Danny Chen commented on HUDI-3171: -- Fixed via master branch: 0e297c0c4ca590e4d6b3050647d207e3e0b50912 >

[hudi] branch master updated (a66212d -> 0e297c0)

2022-01-05 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from a66212d [HUDI-2966] Closing LogRecordScanner in compactor (#4478) add 0e297c0 [HUDI-3171] Sync empty table

[GitHub] [hudi] prashantwason commented on a change in pull request #4212: [HUDI-2925] Fix duplicate cleaning of same files when unfinished clean operations are present.

2022-01-05 Thread GitBox
prashantwason commented on a change in pull request #4212: URL: https://github.com/apache/hudi/pull/4212#discussion_r778638096 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/AbstractHoodieWriteClient.java ## @@ -708,15 +708,25 @@ public

[GitHub] [hudi] Guanpx commented on issue #4510: [SUPPORT] Impala query error

2022-01-05 Thread GitBox
Guanpx commented on issue #4510: URL: https://github.com/apache/hudi/issues/4510#issuecomment-1005520954 In hudi-MOR table type, HDFS path have some log file, that leads to impala read error; **use COW will be fine** because that HDFS path only have parquet file -- This is

[GitHub] [hudi] Guanpx closed issue #4510: [SUPPORT] Impala query error

2022-01-05 Thread GitBox
Guanpx closed issue #4510: URL: https://github.com/apache/hudi/issues/4510 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] prashantwason commented on pull request #4067: [WIP][HUDI-2763] Metadata table records key deduplication

2022-01-05 Thread GitBox
prashantwason commented on pull request #4067: URL: https://github.com/apache/hudi/pull/4067#issuecomment-1005521754 I prefer https://github.com/apache/hudi/pull/4449 and have left some comments there. -- This is an automated message from the Apache Git Service. To respond to the

[GitHub] [hudi] waywtdcc commented on issue #4508: [SUPPORT]Duplicate Flink Hudi data

2022-01-05 Thread GitBox
waywtdcc commented on issue #4508: URL: https://github.com/apache/hudi/issues/4508#issuecomment-1005529617 > try to set **index.state.ttl=0**(that default in [hudi-0](https://issues.apache.org/jira/browse/HUDI-0).10.0)https://hudi.apache.org/docs/configurations/#indexstatettl I

[GitHub] [hudi] Guanpx edited a comment on issue #4508: [SUPPORT]Duplicate Flink Hudi data

2022-01-05 Thread GitBox
Guanpx edited a comment on issue #4508: URL: https://github.com/apache/hudi/issues/4508#issuecomment-1005527061 try to set **index.state.ttl=0**(that default in hudi-0.10.0)https://hudi.apache.org/docs/configurations/#indexstatettl -- This is an automated message from the Apache Git

[jira] [Updated] (HUDI-3172) Refactor hudi existing modules to make more code reuse in V2 implementation

2022-01-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3172: - Labels: pull-request-available (was: ) > Refactor hudi existing modules to make more code reuse

[GitHub] [hudi] leesf opened a new pull request #4514: [HUDI-3172] Refactor hudi existing modules to make more code reuse in…

2022-01-05 Thread GitBox
leesf opened a new pull request #4514: URL: https://github.com/apache/hudi/pull/4514 … V2 implementation ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.*

[jira] [Updated] (HUDI-2488) Support async metadata index creation while regular writers and table services are in progress

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-2488: -- Issue Type: New Feature (was: Task) > Support async metadata index creation while regular writers and

[jira] [Created] (HUDI-3174) Implement metadata filesystem view changes to support INDEX action type

2022-01-05 Thread Sagar Sumit (Jira)
Sagar Sumit created HUDI-3174: - Summary: Implement metadata filesystem view changes to support INDEX action type Key: HUDI-3174 URL: https://issues.apache.org/jira/browse/HUDI-3174 Project: Apache Hudi

[jira] [Created] (HUDI-3177) Support CREATE INDEX statement

2022-01-05 Thread Sagar Sumit (Jira)
Sagar Sumit created HUDI-3177: - Summary: Support CREATE INDEX statement Key: HUDI-3177 URL: https://issues.apache.org/jira/browse/HUDI-3177 Project: Apache Hudi Issue Type: Sub-task

[jira] [Updated] (HUDI-3173) Introduce new INDEX action type

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3173: -- Priority: Blocker (was: Major) > Introduce new INDEX action type > --- > >

[jira] [Updated] (HUDI-3174) Implement metadata filesystem view changes to support INDEX action type

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3174: -- Priority: Blocker (was: Major) > Implement metadata filesystem view changes to support INDEX action

[jira] [Updated] (HUDI-3173) Introduce new INDEX action type

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3173: -- Fix Version/s: 0.11.0 > Introduce new INDEX action type > --- > >

[jira] [Updated] (HUDI-3174) Implement metadata filesystem view changes to support INDEX action type

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3174: -- Fix Version/s: 0.11.0 > Implement metadata filesystem view changes to support INDEX action type >

[GitHub] [hudi] hudi-bot commented on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-1005615105 ## CI report: * 5f2bceb6f745b359ba7b5691ef1f2ab02eddde06 UNKNOWN * 3855884f4791a45fa3a973e1e540e6988e863223 UNKNOWN * 78e8080c9d530e1e54799afbef69edb67394bb29

[GitHub] [hudi] hudi-bot removed a comment on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-1005564970 ## CI report: * 5f2bceb6f745b359ba7b5691ef1f2ab02eddde06 UNKNOWN * 3855884f4791a45fa3a973e1e540e6988e863223 UNKNOWN *

[jira] [Updated] (HUDI-3175) Support INDEX action in write client

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3175: -- Story Points: 5 (was: 3) > Support INDEX action in write client >

[jira] [Updated] (HUDI-3175) Support INDEX action in write client

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3175: -- Description: Add a new WriteOperationType and handle conflicts with concurrent writer or any other

[jira] [Updated] (HUDI-3173) Introduce new INDEX action type

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3173: -- Story Points: 5 (was: 3) > Introduce new INDEX action type > --- > >

[jira] [Updated] (HUDI-3177) Support CREATE INDEX statement

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3177: -- Fix Version/s: 0.11.0 > Support CREATE INDEX statement > -- > >

[jira] [Updated] (HUDI-3176) Add index commit metadata

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3176: -- Description: We need index request metadata at the time of index planning and index commit metadata at

[jira] [Updated] (HUDI-3177) Support CREATE INDEX statement

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3177: -- Priority: Blocker (was: Major) > Support CREATE INDEX statement > -- > >

[GitHub] [hudi] prashantwason commented on a change in pull request #4449: [HUDI-2763] Metadata table records - support for key deduplication based on hardcoded key field

2022-01-05 Thread GitBox
prashantwason commented on a change in pull request #4449: URL: https://github.com/apache/hudi/pull/4449#discussion_r778656822 ## File path: hudi-client/hudi-spark-client/src/test/java/org/apache/hudi/client/functional/TestHoodieBackedMetadata.java ## @@ -507,6 +519,255 @@

[jira] [Created] (HUDI-3172) Refactor hudi existing modules to make more code reuse in V2 implementation

2022-01-05 Thread leesf (Jira)
leesf created HUDI-3172: --- Summary: Refactor hudi existing modules to make more code reuse in V2 implementation Key: HUDI-3172 URL: https://issues.apache.org/jira/browse/HUDI-3172 Project: Apache Hudi

[jira] [Updated] (HUDI-2584) Unit tests for bloom filter index based out of metadata table.

2022-01-05 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy updated HUDI-2584: - Status: Patch Available (was: In Progress) > Unit tests for bloom filter index based out

[jira] [Commented] (HUDI-2584) Unit tests for bloom filter index based out of metadata table.

2022-01-05 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17469154#comment-17469154 ] Manoj Govindassamy commented on HUDI-2584: -- PR https://github.com/manojpec/hudi/pull/6 > Unit

[jira] [Commented] (HUDI-2714) Benchmark MetaIndex performance w/ bloom and column stat metadata

2022-01-05 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17469155#comment-17469155 ] Manoj Govindassamy commented on HUDI-2714: -- PR https://github.com/manojpec/hudi/pull/5 >

[jira] [Updated] (HUDI-2714) Benchmark MetaIndex performance w/ bloom and column stat metadata

2022-01-05 Thread Manoj Govindassamy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Govindassamy updated HUDI-2714: - Status: Patch Available (was: In Progress) > Benchmark MetaIndex performance w/ bloom

[GitHub] [hudi] hudi-bot removed a comment on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-1003907464 ## CI report: * 5f2bceb6f745b359ba7b5691ef1f2ab02eddde06 UNKNOWN * 3855884f4791a45fa3a973e1e540e6988e863223 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-1005535200 ## CI report: * 5f2bceb6f745b359ba7b5691ef1f2ab02eddde06 UNKNOWN * 3855884f4791a45fa3a973e1e540e6988e863223 UNKNOWN * 78e8080c9d530e1e54799afbef69edb67394bb29

[jira] [Updated] (HUDI-3170) Clustering preserve commit metadata retains filegroup id despite writes going to new filegroup

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3170: -- Sprint: Hudi-Sprint-Jan-3 > Clustering preserve commit metadata retains filegroup id despite writes

[jira] [Updated] (HUDI-3170) Clustering preserve commit metadata retains filegroup id despite writes going to new filegroup

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3170: -- Status: In Progress (was: Open) > Clustering preserve commit metadata retains filegroup id despite

[GitHub] [hudi] hudi-bot commented on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-1005564970 ## CI report: * 5f2bceb6f745b359ba7b5691ef1f2ab02eddde06 UNKNOWN * 3855884f4791a45fa3a973e1e540e6988e863223 UNKNOWN * 78e8080c9d530e1e54799afbef69edb67394bb29

[GitHub] [hudi] hudi-bot removed a comment on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-1005537465 ## CI report: * 5f2bceb6f745b359ba7b5691ef1f2ab02eddde06 UNKNOWN * 3855884f4791a45fa3a973e1e540e6988e863223 UNKNOWN *

[jira] [Created] (HUDI-3176) Implement async indexer service

2022-01-05 Thread Sagar Sumit (Jira)
Sagar Sumit created HUDI-3176: - Summary: Implement async indexer service Key: HUDI-3176 URL: https://issues.apache.org/jira/browse/HUDI-3176 Project: Apache Hudi Issue Type: Sub-task

[jira] [Updated] (HUDI-3175) Support INDEX action in write client

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3175: -- Description: Add a new WriteOperationType and handle conflicts with concurrent writer or any other

[jira] [Updated] (HUDI-3176) Add index commit metadata

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3176: -- Description: We need index request metadata at the time of index planning and index commit metadata at

[GitHub] [hudi] hudi-bot commented on pull request #4514: [HUDI-3172] Refactor hudi existing modules to make more code reuse in…

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4514: URL: https://github.com/apache/hudi/pull/4514#issuecomment-1005639286 ## CI report: * 5c4150f9d022cf55b41a1815316aba6a4e95f010 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4514: [HUDI-3172] Refactor hudi existing modules to make more code reuse in…

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4514: URL: https://github.com/apache/hudi/pull/4514#issuecomment-1005628427 ## CI report: * 5c4150f9d022cf55b41a1815316aba6a4e95f010 Azure:

[jira] [Updated] (HUDI-3172) Refactor hudi existing modules to make more code reuse in V2 implementation

2022-01-05 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf updated HUDI-3172: Issue Type: Improvement (was: Bug) > Refactor hudi existing modules to make more code reuse in V2 implementation >

[GitHub] [hudi] Guanpx commented on issue #4508: [SUPPORT]Duplicate Flink Hudi data

2022-01-05 Thread GitBox
Guanpx commented on issue #4508: URL: https://github.com/apache/hudi/issues/4508#issuecomment-1005527061 try to set **index.state.ttl=0**(that default in hudi-0.10.0) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [hudi] vinishjail97 opened a new pull request #4513: Fixing null schema with empty commit in incremental relation

2022-01-05 Thread GitBox
vinishjail97 opened a new pull request #4513: URL: https://github.com/apache/hudi/pull/4513 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the

[GitHub] [hudi] hudi-bot commented on pull request #4513: Fixing null schema with empty commit in incremental relation

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4513: URL: https://github.com/apache/hudi/pull/4513#issuecomment-1005535484 ## CI report: * 830716055649f558feca345d42452611389dd284 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[GitHub] [hudi] hudi-bot commented on pull request #4513: Fixing null schema with empty commit in incremental relation

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4513: URL: https://github.com/apache/hudi/pull/4513#issuecomment-1005537798 ## CI report: * 830716055649f558feca345d42452611389dd284 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-1005535200 ## CI report: * 5f2bceb6f745b359ba7b5691ef1f2ab02eddde06 UNKNOWN * 3855884f4791a45fa3a973e1e540e6988e863223 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-1005537465 ## CI report: * 5f2bceb6f745b359ba7b5691ef1f2ab02eddde06 UNKNOWN * 3855884f4791a45fa3a973e1e540e6988e863223 UNKNOWN * 78e8080c9d530e1e54799afbef69edb67394bb29

[GitHub] [hudi] hudi-bot removed a comment on pull request #4513: Fixing null schema with empty commit in incremental relation

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4513: URL: https://github.com/apache/hudi/pull/4513#issuecomment-1005535484 ## CI report: * 830716055649f558feca345d42452611389dd284 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot

[GitHub] [hudi] forest455 commented on issue #4200: spark-sql query timestamp partition error

2022-01-05 Thread GitBox
forest455 commented on issue #4200: URL: https://github.com/apache/hudi/issues/4200#issuecomment-1005582925 Thanks for all your efforts. it works well now. Get Outlook for Android From: Sivabalan Narayanan ***@***.***>

[GitHub] [hudi] hudi-bot commented on pull request #4514: [HUDI-3172] Refactor hudi existing modules to make more code reuse in…

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4514: URL: https://github.com/apache/hudi/pull/4514#issuecomment-1005599153 ## CI report: * 5c4150f9d022cf55b41a1815316aba6a4e95f010 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4514: [HUDI-3172] Refactor hudi existing modules to make more code reuse in…

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4514: URL: https://github.com/apache/hudi/pull/4514#issuecomment-1005597155 ## CI report: * 5c4150f9d022cf55b41a1815316aba6a4e95f010 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot

[jira] [Updated] (HUDI-3173) Introduce new INDEX action type

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3173: -- Story Points: 3 > Introduce new INDEX action type > --- > >

[jira] [Created] (HUDI-3175) Support INDEX action in write client

2022-01-05 Thread Sagar Sumit (Jira)
Sagar Sumit created HUDI-3175: - Summary: Support INDEX action in write client Key: HUDI-3175 URL: https://issues.apache.org/jira/browse/HUDI-3175 Project: Apache Hudi Issue Type: Sub-task

[jira] [Updated] (HUDI-3176) Add index commit metadata

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3176: -- Summary: Add index commit metadata (was: Implement async indexer service) > Add index commit metadata

[jira] [Updated] (HUDI-3175) Support INDEX action in write client

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3175: -- Fix Version/s: 0.11.0 > Support INDEX action in write client > > >

[jira] [Updated] (HUDI-3176) Add index commit metadata

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3176: -- Fix Version/s: 0.11.0 > Add index commit metadata > - > > Key:

[jira] [Updated] (HUDI-3175) Support INDEX action in write client

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3175: -- Priority: Blocker (was: Major) > Support INDEX action in write client >

[jira] [Updated] (HUDI-3176) Add index commit metadata

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3176: -- Priority: Blocker (was: Major) > Add index commit metadata > - > >

[jira] [Updated] (HUDI-3177) Support CREATE INDEX statement

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3177: -- Story Points: 2 > Support CREATE INDEX statement > -- > >

[jira] [Updated] (HUDI-3177) Support CREATE INDEX statement

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3177: -- Labels: 2 (was: ) > Support CREATE INDEX statement > -- > >

[jira] [Updated] (HUDI-3177) Support CREATE INDEX statement

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3177: -- Description: Users should be able to trigger index creation using CREATE INDEX statement for one or

[jira] [Updated] (HUDI-3177) Support CREATE INDEX statement

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3177: -- Labels: (was: 2) > Support CREATE INDEX statement > -- > >

[GitHub] [hudi] danny0405 commented on issue #4508: [SUPPORT]Duplicate Flink Hudi data

2022-01-05 Thread GitBox
danny0405 commented on issue #4508: URL: https://github.com/apache/hudi/issues/4508#issuecomment-1005526167 > > Do you setup the state ttl already ? > > I didn't set TTL related parameters what version of hudi did you use ? In 0.9 it is 1.5 days as default. -- This is an

[GitHub] [hudi] hudi-bot commented on pull request #4514: [HUDI-3172] Refactor hudi existing modules to make more code reuse in…

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4514: URL: https://github.com/apache/hudi/pull/4514#issuecomment-1005601132 ## CI report: * 5c4150f9d022cf55b41a1815316aba6a4e95f010 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4514: [HUDI-3172] Refactor hudi existing modules to make more code reuse in…

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4514: URL: https://github.com/apache/hudi/pull/4514#issuecomment-1005599153 ## CI report: * 5c4150f9d022cf55b41a1815316aba6a4e95f010 Azure:

[jira] [Created] (HUDI-3173) Introduce new INDEX action type

2022-01-05 Thread Sagar Sumit (Jira)
Sagar Sumit created HUDI-3173: - Summary: Introduce new INDEX action type Key: HUDI-3173 URL: https://issues.apache.org/jira/browse/HUDI-3173 Project: Apache Hudi Issue Type: Sub-task

[GitHub] [hudi] hudi-bot commented on pull request #4514: [HUDI-3172] Refactor hudi existing modules to make more code reuse in…

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4514: URL: https://github.com/apache/hudi/pull/4514#issuecomment-1005622609 ## CI report: * 5c4150f9d022cf55b41a1815316aba6a4e95f010 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4514: [HUDI-3172] Refactor hudi existing modules to make more code reuse in…

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4514: URL: https://github.com/apache/hudi/pull/4514#issuecomment-1005601132 ## CI report: * 5c4150f9d022cf55b41a1815316aba6a4e95f010 Azure:

[jira] [Assigned] (HUDI-3172) Refactor hudi existing modules to make more code reuse in V2 implementation

2022-01-05 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf reassigned HUDI-3172: --- Assignee: leesf > Refactor hudi existing modules to make more code reuse in V2 implementation >

[jira] [Assigned] (HUDI-3140) Fix bulk_insert failure on Spark 3.2.0

2022-01-05 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf reassigned HUDI-3140: --- Assignee: leesf > Fix bulk_insert failure on Spark 3.2.0 > -- > >

[jira] [Updated] (HUDI-3140) Fix bulk_insert failure on Spark 3.2.0

2022-01-05 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf updated HUDI-3140: Fix Version/s: 0.11.0 > Fix bulk_insert failure on Spark 3.2.0 > -- > >

[jira] [Resolved] (HUDI-3140) Fix bulk_insert failure on Spark 3.2.0

2022-01-05 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf resolved HUDI-3140. - > Fix bulk_insert failure on Spark 3.2.0 > -- > > Key: HUDI-3140

[jira] [Closed] (HUDI-3140) Fix bulk_insert failure on Spark 3.2.0

2022-01-05 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf closed HUDI-3140. --- > Fix bulk_insert failure on Spark 3.2.0 > -- > > Key: HUDI-3140 >

[GitHub] [hudi] hudi-bot removed a comment on pull request #4513: Fixing null schema with empty commit in incremental relation

2022-01-05 Thread GitBox
hudi-bot removed a comment on pull request #4513: URL: https://github.com/apache/hudi/pull/4513#issuecomment-1005537798 ## CI report: * 830716055649f558feca345d42452611389dd284 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4513: Fixing null schema with empty commit in incremental relation

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4513: URL: https://github.com/apache/hudi/pull/4513#issuecomment-1005580582 ## CI report: * 830716055649f558feca345d42452611389dd284 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4514: [HUDI-3172] Refactor hudi existing modules to make more code reuse in…

2022-01-05 Thread GitBox
hudi-bot commented on pull request #4514: URL: https://github.com/apache/hudi/pull/4514#issuecomment-1005597155 ## CI report: * 5c4150f9d022cf55b41a1815316aba6a4e95f010 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[jira] [Updated] (HUDI-3174) Implement metadata filesystem view changes to support INDEX action type

2022-01-05 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-3174: -- Description: Handle pending index action while listing partitions. > Implement metadata filesystem view

  1   2   3   4   >