[GitHub] [hudi] nmahmood630 commented on issue #2987: [SUPPORT] Only able to retrieve last _hoodie_commit_time

2021-05-25 Thread GitBox
nmahmood630 commented on issue #2987: URL: https://github.com/apache/hudi/issues/2987#issuecomment-848397902 I think the issue I am seeing is similar to: https://github.com/apache/hudi/issues/2002 where all the records commit time are getting updated since this table holds aggregation

[jira] [Created] (HUDI-1934) Update keyGenerator configuration docs

2021-05-25 Thread Xianghu Wang (Jira)
Xianghu Wang created HUDI-1934: -- Summary: Update keyGenerator configuration docs Key: HUDI-1934 URL: https://issues.apache.org/jira/browse/HUDI-1934 Project: Apache Hudi Issue Type: Sub-task

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2993: [HUDI-1929] Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2993: URL: https://github.com/apache/hudi/pull/2993#issuecomment-848384059 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2926: [HUDI-1879] Support Partition Prune For MergeOnRead Snapshot Table

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2926: URL: https://github.com/apache/hudi/pull/2926#issuecomment-835303925 #

[jira] [Created] (HUDI-1935) Update Logger for FlatteningTransformer

2021-05-25 Thread Vinay (Jira)
Vinay created HUDI-1935: --- Summary: Update Logger for FlatteningTransformer Key: HUDI-1935 URL: https://issues.apache.org/jira/browse/HUDI-1935 Project: Apache Hudi Issue Type: Task

[jira] [Commented] (HUDI-1138) Re-implement marker files via timeline server

2021-05-25 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17351505#comment-17351505 ] Ethan Guo commented on HUDI-1138: - Here is my plan for improving the marker file mechanism: * Abstraction

[GitHub] [hudi] pengzhiwei2018 commented on a change in pull request #2926: [HUDI-1879] Support Partition Prune For MergeOnRead Snapshot Table

2021-05-25 Thread GitBox
pengzhiwei2018 commented on a change in pull request #2926: URL: https://github.com/apache/hudi/pull/2926#discussion_r639372337 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/MergeOnReadSnapshotRelation.scala ## @@ -131,15 +133,28 @@ class

[GitHub] [hudi] veenaypatil commented on pull request #2996: [HUDI-1935] Update Logger statement of FlatteningTransformer

2021-05-25 Thread GitBox
veenaypatil commented on pull request #2996: URL: https://github.com/apache/hudi/pull/2996#issuecomment-848469043 Unit tests for hudi-spark-client failed but this change should not be the cause of it. @wangxianghu @vinothchandar can you pls merge -- This is an automated message from

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2996: [HUDI-1935] Update Logger statement of FlatteningTransformer

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2996: URL: https://github.com/apache/hudi/pull/2996#issuecomment-848443240 #

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2996: [HUDI-1935] Update Logger statement of FlatteningTransformer

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2996: URL: https://github.com/apache/hudi/pull/2996#issuecomment-848443240 #

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2996: [HUDI-1935] Update Logger statement of FlatteningTransformer

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2996: URL: https://github.com/apache/hudi/pull/2996#issuecomment-848443240 #

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2996: [HUDI-1935] Update Logger statement of FlatteningTransformer

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2996: URL: https://github.com/apache/hudi/pull/2996#issuecomment-848443240 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[jira] [Updated] (HUDI-1935) Update Logger for FlatteningTransformer

2021-05-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1935: - Labels: pull-request-available (was: ) > Update Logger for FlatteningTransformer >

[GitHub] [hudi] veenaypatil opened a new pull request #2996: [HUDI-1935] Update Logger statement of FlatteningTransformer

2021-05-25 Thread GitBox
veenaypatil opened a new pull request #2996: URL: https://github.com/apache/hudi/pull/2996 ## What is the purpose of the pull request This PR just updates the Logger statement as it was pointing to different class ## Brief change log Modify Logger statement of

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2993: [HUDI-1929] Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2993: URL: https://github.com/apache/hudi/pull/2993#issuecomment-848384059 #

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2993: [HUDI-1929] Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2993: URL: https://github.com/apache/hudi/pull/2993#issuecomment-848384059 #

[GitHub] [hudi] danny0405 commented on a change in pull request #2899: [HUDI-1865] Make embedded time line service singleton

2021-05-25 Thread GitBox
danny0405 commented on a change in pull request #2899: URL: https://github.com/apache/hudi/pull/2899#discussion_r639363503 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/embedded/EmbeddedTimelineServerHelper.java ## @@ -35,16 +35,22 @@

[GitHub] [hudi] codecov-commenter commented on pull request #2996: [HUDI-1935] Update Logger statement of FlatteningTransformer

2021-05-25 Thread GitBox
codecov-commenter commented on pull request #2996: URL: https://github.com/apache/hudi/pull/2996#issuecomment-848443240 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2996?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2993: [HUDI-1929] Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2993: URL: https://github.com/apache/hudi/pull/2993#issuecomment-848384059 #

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2926: [HUDI-1879] Support Partition Prune For MergeOnRead Snapshot Table

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2926: URL: https://github.com/apache/hudi/pull/2926#issuecomment-835303925 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2993: [HUDI-1929] Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2993: URL: https://github.com/apache/hudi/pull/2993#issuecomment-848384059 #

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2993: [HUDI-1929] Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2993: URL: https://github.com/apache/hudi/pull/2993#issuecomment-848384059 #

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2993: [HUDI-1929] Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2993: URL: https://github.com/apache/hudi/pull/2993#issuecomment-848384059 #

[GitHub] [hudi] pengzhiwei2018 commented on a change in pull request #2926: [HUDI-1879] Support Partition Prune For MergeOnRead Snapshot Table

2021-05-25 Thread GitBox
pengzhiwei2018 commented on a change in pull request #2926: URL: https://github.com/apache/hudi/pull/2926#discussion_r639362769 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/MergeOnReadSnapshotRelation.scala ## @@ -182,4 +197,98 @@ object

[GitHub] [hudi] pengzhiwei2018 commented on a change in pull request #2925: [HUDI-1879] Fix RO Tables Returning Snapshot Result

2021-05-25 Thread GitBox
pengzhiwei2018 commented on a change in pull request #2925: URL: https://github.com/apache/hudi/pull/2925#discussion_r639374959 ## File path: hudi-sync/hudi-hive-sync/src/main/java/org/apache/hudi/hive/HiveSyncTool.java ## @@ -194,11 +197,17 @@ private void syncSchema(String

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2993: [HUDI-1929] Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2993: URL: https://github.com/apache/hudi/pull/2993#issuecomment-848384059 #

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2993: [HUDI-1929] Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2993: URL: https://github.com/apache/hudi/pull/2993#issuecomment-848384059 #

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2993: [HUDI-1929] Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2993: URL: https://github.com/apache/hudi/pull/2993#issuecomment-848384059 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] nsivabalan merged pull request #2845: [HUDI-1723] Fix path selector listing files with the same mod date

2021-05-25 Thread GitBox
nsivabalan merged pull request #2845: URL: https://github.com/apache/hudi/pull/2845 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service,

[hudi] branch master updated: [HUDI-1723] Fix path selector listing files with the same mod date (#2845)

2021-05-25 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new afa6bc0 [HUDI-1723] Fix path selector listing

[GitHub] [hudi] sbernauer commented on pull request #2012: [HUDI-1129] Deltastreamer Add support for schema evolution

2021-05-25 Thread GitBox
sbernauer commented on pull request #2012: URL: https://github.com/apache/hudi/pull/2012#issuecomment-847796132 Sure @nsivabalan i will try out the fix in https://github.com/apache/hudi/pull/2927 and give feedback. Thanks for the invitation for slack, i appreciate! My memberId is

[jira] [Created] (HUDI-1931) BucketAssignFunction use wrong state

2021-05-25 Thread loukey_j (Jira)
loukey_j created HUDI-1931: -- Summary: BucketAssignFunction use wrong state Key: HUDI-1931 URL: https://issues.apache.org/jira/browse/HUDI-1931 Project: Apache Hudi Issue Type: Improvement

[GitHub] [hudi] nsivabalan edited a comment on issue #2992: [SUPPORT] Insert_Override Api not working as expected in Hudi 0.7.0

2021-05-25 Thread GitBox
nsivabalan edited a comment on issue #2992: URL: https://github.com/apache/hudi/issues/2992#issuecomment-847899578 @ayush71994 : 1. May I know which config you are referring to here "delete.duplicates"? Can you point me to full config from here

[GitHub] [hudi] nsivabalan edited a comment on issue #2992: [SUPPORT] Insert_Override Api not working as expected in Hudi 0.7.0

2021-05-25 Thread GitBox
nsivabalan edited a comment on issue #2992: URL: https://github.com/apache/hudi/issues/2992#issuecomment-847899578 @ayush71994 : 1. May I know which config you are referring to here "delete.duplicates"? Can you point me to full config from here

[GitHub] [hudi] nsivabalan commented on pull request #2012: [HUDI-1129] Deltastreamer Add support for schema evolution

2021-05-25 Thread GitBox
nsivabalan commented on pull request #2012: URL: https://github.com/apache/hudi/pull/2012#issuecomment-847846030 @sbernauer : sorry I might need your email to invite to apache hudi's slack workspace. -- This is an automated message from the Apache Git Service. To respond to the

[GitHub] [hudi] sbernauer commented on pull request #2012: [HUDI-1129] Deltastreamer Add support for schema evolution

2021-05-25 Thread GitBox
sbernauer commented on pull request #2012: URL: https://github.com/apache/hudi/pull/2012#issuecomment-847848890 No problem. It is bernaue...@web.de -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [hudi] nsivabalan commented on issue #2992: [SUPPORT] Insert_Override Api not working as expected in Hudi 0.7.0

2021-05-25 Thread GitBox
nsivabalan commented on issue #2992: URL: https://github.com/apache/hudi/issues/2992#issuecomment-847899578 @ayush71994 : May I know which config you are referring to here "delete.duplicates"? Can you point me to full config from here https://hudi.apache.org/docs/configurations.html

[GitHub] [hudi] wangxianghu opened a new pull request #2993: [HUDI-1929] Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread GitBox
wangxianghu opened a new pull request #2993: URL: https://github.com/apache/hudi/pull/2993 …y type ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is

[jira] [Updated] (HUDI-1929) Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1929: - Labels: pull-request-available (was: ) > Make HoodieDeltaStreamer support configure KeyGenerator

[GitHub] [hudi] nsivabalan commented on pull request #2012: [HUDI-1129] Deltastreamer Add support for schema evolution

2021-05-25 Thread GitBox
nsivabalan commented on pull request #2012: URL: https://github.com/apache/hudi/pull/2012#issuecomment-847855344 @sbernauer @giaosudau @dirksan28 @sathyaprakashg : There are quite a few flows or use-cases in general wrt schema evolution. Would you mind helping us explain your use-case.

[GitHub] [hudi] nsivabalan edited a comment on issue #2992: [SUPPORT] Insert_Override Api not working as expected in Hudi 0.7.0

2021-05-25 Thread GitBox
nsivabalan edited a comment on issue #2992: URL: https://github.com/apache/hudi/issues/2992#issuecomment-847899578 @ayush71994 : May I know which config you are referring to here "delete.duplicates"? Can you point me to full config from here

[jira] [Commented] (HUDI-1668) GlobalSortPartitioner is getting called twice during bulk_insert.

2021-05-25 Thread Sugamber (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17351095#comment-17351095 ] Sugamber commented on HUDI-1668: [~nishith29] Yes, We can close this. Thank you!!! >

[GitHub] [hudi] nsivabalan edited a comment on pull request #2012: [HUDI-1129] Deltastreamer Add support for schema evolution

2021-05-25 Thread GitBox
nsivabalan edited a comment on pull request #2012: URL: https://github.com/apache/hudi/pull/2012#issuecomment-847855344 @sbernauer @giaosudau @dirksan28 @sathyaprakashg : There are quite a few flows or use-cases in general wrt schema evolution. Would you mind helping us explain your

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2926: [HUDI-1879] Support Partition Prune For MergeOnRead Snapshot Table

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2926: URL: https://github.com/apache/hudi/pull/2926#issuecomment-835303925 #

[jira] [Updated] (HUDI-1935) Update Logger for FlatteningTransformer

2021-05-25 Thread Vinay (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinay updated HUDI-1935: Status: In Progress (was: Open) > Update Logger for FlatteningTransformer >

[jira] [Created] (HUDI-1936) Introduce a optional property for conditional upsert

2021-05-25 Thread Biswajit mohapatra (Jira)
Biswajit mohapatra created HUDI-1936: Summary: Introduce a optional property for conditional upsert Key: HUDI-1936 URL: https://issues.apache.org/jira/browse/HUDI-1936 Project: Apache Hudi

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2993: [HUDI-1929] Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2993: URL: https://github.com/apache/hudi/pull/2993#issuecomment-848384059 #

[GitHub] [hudi] wangxianghu commented on pull request #2993: [HUDI-1929] Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread GitBox
wangxianghu commented on pull request #2993: URL: https://github.com/apache/hudi/pull/2993#issuecomment-848482449 @yanghua please take a look when free -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[jira] [Created] (HUDI-1930) Make Spark DataSource support configure KeyGenerator by type

2021-05-25 Thread Xianghu Wang (Jira)
Xianghu Wang created HUDI-1930: -- Summary: Make Spark DataSource support configure KeyGenerator by type Key: HUDI-1930 URL: https://issues.apache.org/jira/browse/HUDI-1930 Project: Apache Hudi

[GitHub] [hudi] ayush71994 opened a new issue #2992: [SUPPORT] Insert_Override Api not working as expected in Hudi 0.7.0

2021-05-25 Thread GitBox
ayush71994 opened a new issue #2992: URL: https://github.com/apache/hudi/issues/2992 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://cwiki.apache.org/confluence/display/HUDI/FAQ)? - Yes. - Join the mailing list to engage in conversations and get

[GitHub] [hudi] yuzhaojing closed pull request #2988: support exactly once

2021-05-25 Thread GitBox
yuzhaojing closed pull request #2988: URL: https://github.com/apache/hudi/pull/2988 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service,

[GitHub] [hudi] danny0405 commented on pull request #2990: [HUDI-1923] Support restore state when commit failed

2021-05-25 Thread GitBox
danny0405 commented on pull request #2990: URL: https://github.com/apache/hudi/pull/2990#issuecomment-847623604 cc @garyli1019, i know that you are using the Flink hudi pipeline, maybe you should also take a review for this. I got an impression that we should add a config option for both

[GitHub] [hudi] codecov-commenter commented on pull request #2991: [HUDI-764] [HUDI-765] ORC reader and writer implementation

2021-05-25 Thread GitBox
codecov-commenter commented on pull request #2991: URL: https://github.com/apache/hudi/pull/2991#issuecomment-847672177 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2991?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)

[jira] [Created] (HUDI-1929) Make HoodieDeltaStreamer support configure KeyGenerator by type

2021-05-25 Thread Xianghu Wang (Jira)
Xianghu Wang created HUDI-1929: -- Summary: Make HoodieDeltaStreamer support configure KeyGenerator by type Key: HUDI-1929 URL: https://issues.apache.org/jira/browse/HUDI-1929 Project: Apache Hudi

[GitHub] [hudi] yuzhaojing opened a new pull request #2988: support exactly once

2021-05-25 Thread GitBox
yuzhaojing opened a new pull request #2988: URL: https://github.com/apache/hudi/pull/2988 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the

[GitHub] [hudi] yuzhaojing opened a new pull request #2989: [HUDI-1441] Support restore state when commit failed

2021-05-25 Thread GitBox
yuzhaojing opened a new pull request #2989: URL: https://github.com/apache/hudi/pull/2989 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the

[GitHub] [hudi] yuzhaojing closed pull request #2989: [HUDI-1441] Support restore state when commit failed

2021-05-25 Thread GitBox
yuzhaojing closed pull request #2989: URL: https://github.com/apache/hudi/pull/2989 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service,

[jira] [Updated] (HUDI-1923) Add state in StreamWriteFunction to restore

2021-05-25 Thread yuzhaojing (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuzhaojing updated HUDI-1923: - Description: In flink, notifyCheckpointComplete not in checkpoint life cycle. If a checkpoint is success

[GitHub] [hudi] am-cpp commented on issue #2992: [SUPPORT] Insert_Override Api not working as expected in Hudi 0.7.0

2021-05-25 Thread GitBox
am-cpp commented on issue #2992: URL: https://github.com/apache/hudi/issues/2992#issuecomment-847745284 The issue seems to be happening only when the **INSERT_DROP_DUPS_OPT_KEY** flag is set to **true**. Looks like this config is being used for both: 1. Pre-combining:

[GitHub] [hudi] jintaoguan opened a new pull request #2991: Hudi 57

2021-05-25 Thread GitBox
jintaoguan opened a new pull request #2991: URL: https://github.com/apache/hudi/pull/2991 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the

[jira] [Updated] (HUDI-1920) Set "archived" as the default value of HOODIE_ARCHIVELOG_FOLDER_PROP_NAME

2021-05-25 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vinoyang updated HUDI-1920: --- Fix Version/s: 0.9.0 > Set "archived" as the default value of HOODIE_ARCHIVELOG_FOLDER_PROP_NAME >

[hudi] branch master updated (aba1ead -> e702074)

2021-05-25 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository. vinoyang pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from aba1ead [HUDI-1919] Type mismatch when streaming read copy_on_write table using flink (#2986) add e702074

[GitHub] [hudi] yanghua merged pull request #2978: [HUDI-1920] Set archived as the default value of HOODIE_ARCHIVELOG_FOLDER_PROP_NAME

2021-05-25 Thread GitBox
yanghua merged pull request #2978: URL: https://github.com/apache/hudi/pull/2978 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please

[jira] [Updated] (HUDI-1920) Set "archived" as the default value of HOODIE_ARCHIVELOG_FOLDER_PROP_NAME

2021-05-25 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vinoyang updated HUDI-1920: --- Issue Type: Improvement (was: Task) > Set "archived" as the default value of

[GitHub] [hudi] jintaoguan closed pull request #2991: [HUDI-764] [HUDI-765] ORC reader and writer implementation

2021-05-25 Thread GitBox
jintaoguan closed pull request #2991: URL: https://github.com/apache/hudi/pull/2991 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service,

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2902: [HUDI-1800] Exclude file slices in pending compaction when performing small file sizing

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2902: URL: https://github.com/apache/hudi/pull/2902#issuecomment-829931431 #

[GitHub] [hudi] yuzhaojing opened a new pull request #2990: [HUDI-1923] Support restore state when commit failed

2021-05-25 Thread GitBox
yuzhaojing opened a new pull request #2990: URL: https://github.com/apache/hudi/pull/2990 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the

[jira] [Updated] (HUDI-1923) Add state in StreamWriteFunction to restore

2021-05-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1923: - Labels: pull-request-available (was: ) > Add state in StreamWriteFunction to restore >

[GitHub] [hudi] Tandoy commented on issue #143: Tracking ticket for folks to be added to slack group

2021-05-25 Thread GitBox
Tandoy commented on issue #143: URL: https://github.com/apache/hudi/issues/143#issuecomment-847629180 please add me to the slack group the email: tangzhi8...@gmail.com thanks a lot -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[jira] [Closed] (HUDI-1920) Set "archived" as the default value of HOODIE_ARCHIVELOG_FOLDER_PROP_NAME

2021-05-25 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vinoyang closed HUDI-1920. -- Resolution: Implemented e7020748b500e38838a8d84df64267a07b529aa7 > Set "archived" as the default value of

[GitHub] [hudi] nsivabalan commented on pull request #2923: [HUDI-1864] Added support for Date, Timestamp, LocalDate and LocalDateTime in TimestampBasedAvroKeyGenerator

2021-05-25 Thread GitBox
nsivabalan commented on pull request #2923: URL: https://github.com/apache/hudi/pull/2923#issuecomment-847933890 Can you check CI failure please. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] nsivabalan commented on pull request #2310: [HUDI-1444] fix rollback for emtpy partition table

2021-05-25 Thread GitBox
nsivabalan commented on pull request #2310: URL: https://github.com/apache/hudi/pull/2310#issuecomment-847950023 yeah, looks like it. closing it for now. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [hudi] nsivabalan edited a comment on pull request #2902: [HUDI-1800] Exclude file slices in pending compaction when performing small file sizing

2021-05-25 Thread GitBox
nsivabalan edited a comment on pull request #2902: URL: https://github.com/apache/hudi/pull/2902#issuecomment-847918561 Can you check why CI is failing. we can land once fixed. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [hudi] vinothchandar commented on pull request #2388: [HUDI-1353] add incremental timeline support for pending clustering ops

2021-05-25 Thread GitBox
vinothchandar commented on pull request #2388: URL: https://github.com/apache/hudi/pull/2388#issuecomment-848022526 @n3nash @satishkotha Any updates on this? generally love to get these follow ups from clustering over the fence if we can -- This is an automated message from the Apache

[GitHub] [hudi] vinothchandar merged pull request #2981: [HUDI-1922] Bulk insert with row writer supports mor table

2021-05-25 Thread GitBox
vinothchandar merged pull request #2981: URL: https://github.com/apache/hudi/pull/2981 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service,

[hudi] branch master updated (afa6bc0 -> 112732d)

2021-05-25 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from afa6bc0 [HUDI-1723] Fix path selector listing files with the same mod date (#2845) add 112732d [HUDI-1922]

[GitHub] [hudi] nsivabalan commented on a change in pull request #2923: [HUDI-1864] Added support for Date, Timestamp, LocalDate and LocalDateTime in TimestampBasedAvroKeyGenerator

2021-05-25 Thread GitBox
nsivabalan commented on a change in pull request #2923: URL: https://github.com/apache/hudi/pull/2923#discussion_r638867446 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/keygen/KeyGenUtils.java ## @@ -62,7 +62,7 @@ } else if

[GitHub] [hudi] vinothchandar commented on pull request #2378: [HUDI-1491] Support partition pruning for MOR snapshot query

2021-05-25 Thread GitBox
vinothchandar commented on pull request #2378: URL: https://github.com/apache/hudi/pull/2378#issuecomment-848021692 #2926 overlaps with this? @yui2010 , @pengzhiwei2018 any thoughts? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [hudi] vinothchandar commented on pull request #2899: [HUDI-1865] Make embedded time line service singleton

2021-05-25 Thread GitBox
vinothchandar commented on pull request #2899: URL: https://github.com/apache/hudi/pull/2899#issuecomment-848040019 @danny0405 we left this hanging a bit. Let me re-review this and get it landing in some form. -- This is an automated message from the Apache Git Service. To respond to

[GitHub] [hudi] nsivabalan commented on pull request #2902: [HUDI-1800] Exclude file slices in pending compaction when performing small file sizing

2021-05-25 Thread GitBox
nsivabalan commented on pull request #2902: URL: https://github.com/apache/hudi/pull/2902#issuecomment-847918561 Can you check why CI is failing. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [hudi] sbernauer edited a comment on pull request #2012: [HUDI-1129] Deltastreamer Add support for schema evolution

2021-05-25 Thread GitBox
sbernauer edited a comment on pull request #2012: URL: https://github.com/apache/hudi/pull/2012#issuecomment-847940734 Hi @nsivabalan, we have multiple schema versions of the events we consume. We use kafka and Confluent Schema Registry. I think all the events in kafka are written

[jira] [Reopened] (HUDI-1723) DFSPathSelector skips files with the same modify date when read up to source limit

2021-05-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reopened HUDI-1723: --- > DFSPathSelector skips files with the same modify date when read up to source > limit >

[jira] [Updated] (HUDI-1763) DefaultHoodieRecordPayload does not honor ordering value when records within multiple log files are merged

2021-05-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1763: -- Status: In Progress (was: Open) > DefaultHoodieRecordPayload does not honor ordering

[jira] [Updated] (HUDI-1763) DefaultHoodieRecordPayload does not honor ordering value when records within multiple log files are merged

2021-05-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1763: -- Status: Patch Available (was: In Progress) > DefaultHoodieRecordPayload does not honor

[GitHub] [hudi] n3nash commented on issue #2975: [SUPPORT] Read record using index

2021-05-25 Thread GitBox
n3nash commented on issue #2975: URL: https://github.com/apache/hudi/issues/2975#issuecomment-847968028 @fanaticjo Can you help @calleo since you recently implemented a custom recordpayload while using pyspark ? -- This is an automated message from the Apache Git Service. To respond to

[GitHub] [hudi] loukey-lj opened a new pull request #2994: Hudi 1931

2021-05-25 Thread GitBox
loukey-lj opened a new pull request #2994: URL: https://github.com/apache/hudi/pull/2994 org.apache.hudi.sink.partitioner.BucketAssignFunction#partitionLoadState and org.apache.hudi.sink.partitioner.BucketAssignFunction#indexState use wrong state, RowDataToHoodieFunction was keyby

[GitHub] [hudi] vinothchandar commented on a change in pull request #2926: [HUDI-1879] Support Partition Prune For MergeOnRead Snapshot Table

2021-05-25 Thread GitBox
vinothchandar commented on a change in pull request #2926: URL: https://github.com/apache/hudi/pull/2926#discussion_r638929143 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/MergeOnReadSnapshotRelation.scala ## @@ -131,15 +133,28 @@ class

[GitHub] [hudi] vinothchandar commented on a change in pull request #2903: [HUDI-1850] Fixing read of a empty table but with failed write

2021-05-25 Thread GitBox
vinothchandar commented on a change in pull request #2903: URL: https://github.com/apache/hudi/pull/2903#discussion_r638949244 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/DefaultSource.scala ## @@ -105,7 +105,9 @@ class DefaultSource extends

[jira] [Created] (HUDI-1932) Hive Sync should not always update last_commit_time_sync

2021-05-25 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1932: Summary: Hive Sync should not always update last_commit_time_sync Key: HUDI-1932 URL: https://issues.apache.org/jira/browse/HUDI-1932 Project: Apache Hudi Issue

[GitHub] [hudi] vinothchandar commented on pull request #2496: [HUDI-1554] Introduced buffering for streams in HUDI.

2021-05-25 Thread GitBox
vinothchandar commented on pull request #2496: URL: https://github.com/apache/hudi/pull/2496#issuecomment-848039122 I have not been able to test this on S3. let me pick it up later next week. -- This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [hudi] sbernauer commented on pull request #2012: [HUDI-1129] Deltastreamer Add support for schema evolution

2021-05-25 Thread GitBox
sbernauer commented on pull request #2012: URL: https://github.com/apache/hudi/pull/2012#issuecomment-847940734 Hi @nsivabalan, we have multiple schema versions of the events we consume. We use kafka and Confluent Schema Registry. I think all the events in kafka are written with

[jira] [Updated] (HUDI-1723) DFSPathSelector skips files with the same modify date when read up to source limit

2021-05-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1723: -- Status: Closed (was: Patch Available) > DFSPathSelector skips files with the same

[jira] [Resolved] (HUDI-1723) DFSPathSelector skips files with the same modify date when read up to source limit

2021-05-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-1723. --- Resolution: Fixed > DFSPathSelector skips files with the same modify date when read

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2977: [HUDI-1763] Fixing honoring of Ordering val in DefaultHoodieRecordPayload.preCombine

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2977: URL: https://github.com/apache/hudi/pull/2977#issuecomment-846016572 #

[GitHub] [hudi] codecov-commenter commented on pull request #2994: Hudi 1931

2021-05-25 Thread GitBox
codecov-commenter commented on pull request #2994: URL: https://github.com/apache/hudi/pull/2994#issuecomment-848024173 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2994?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)

[GitHub] [hudi] vinothchandar commented on a change in pull request #2899: [HUDI-1865] Make embedded time line service singleton

2021-05-25 Thread GitBox
vinothchandar commented on a change in pull request #2899: URL: https://github.com/apache/hudi/pull/2899#discussion_r638986316 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/embedded/EmbeddedTimelineServerHelper.java ## @@ -35,16 +35,22 @@

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2899: [HUDI-1865] Make embedded time line service singleton

2021-05-25 Thread GitBox
codecov-commenter edited a comment on pull request #2899: URL: https://github.com/apache/hudi/pull/2899#issuecomment-829195458 #

[GitHub] [hudi] vaibhav-sinha commented on a change in pull request #2923: [HUDI-1864] Added support for Date, Timestamp, LocalDate and LocalDateTime in TimestampBasedAvroKeyGenerator

2021-05-25 Thread GitBox
vaibhav-sinha commented on a change in pull request #2923: URL: https://github.com/apache/hudi/pull/2923#discussion_r639042519 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/keygen/KeyGenUtils.java ## @@ -62,7 +62,7 @@ } else if

[GitHub] [hudi] vaibhav-sinha commented on pull request #2923: [HUDI-1864] Added support for Date, Timestamp, LocalDate and LocalDateTime in TimestampBasedAvroKeyGenerator

2021-05-25 Thread GitBox
vaibhav-sinha commented on pull request #2923: URL: https://github.com/apache/hudi/pull/2923#issuecomment-848098918 The tests were clean except for one test case failing before which I had fixed. But after merging the latest changes from master, I see a lot of tests failing and the errors

[GitHub] [hudi] rshanmugam1 opened a new issue #2609: [SUPPORT] Presto hudi query slow when compared to parquet

2021-05-25 Thread GitBox
rshanmugam1 opened a new issue #2609: URL: https://github.com/apache/hudi/issues/2609 **Describe the problem you faced** Presto query performance with hudi table takes ~2x extra time when compared to parquet for simple query . data stored in s3. hudi metadata store enabled. note,

  1   2   >