[GitHub] [hudi] MINCWANG commented on pull request #2828: Remove the com.google.guave jar from hudi-flink-bundle to avoid conflicts.

2021-04-14 Thread GitBox
MINCWANG commented on pull request #2828: URL: https://github.com/apache/hudi/pull/2828#issuecomment-820136034 > @MINCWANG Thanks for your contribution. Would you please file a jira issue to track this change? @yanghua OK, This jira address:

[jira] [Closed] (HUDI-1798) Flink streaming reader should always monitor the delta commits files

2021-04-14 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vinoyang closed HUDI-1798. -- Resolution: Fixed 6d1aec604fd854dc0fb27a4c6aa6113ae771c7e1 > Flink streaming reader should always monitor the

[hudi] branch master updated (62bb9e1 -> 6d1aec6)

2021-04-14 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository. vinoyang pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 62bb9e1 [Hotfix][utilities] Optimized codes (#2821) add 6d1aec6 [HUDI-1798] Flink streaming reader should

[GitHub] [hudi] yanghua merged pull request #2825: [HUDI-1798] Flink streaming reader should always monitor the delta commits files

2021-04-14 Thread GitBox
yanghua merged pull request #2825: URL: https://github.com/apache/hudi/pull/2825 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please

[GitHub] [hudi] yanghua commented on pull request #2828: Remove the com.google.guave jar from hudi-flink-bundle to avoid conflicts.

2021-04-14 Thread GitBox
yanghua commented on pull request #2828: URL: https://github.com/apache/hudi/pull/2828#issuecomment-820132727 @MINCWANG Thanks for your contribution. Would you please file a jira issue to track this change? -- This is an automated message from the Apache Git Service. To respond to the

[GitHub] [hudi] codecov-io edited a comment on pull request #2645: [HUDI-1659] Basic Implementation Of Spark Sql Support

2021-04-14 Thread GitBox
codecov-io edited a comment on pull request #2645: URL: https://github.com/apache/hudi/pull/2645#issuecomment-792430670 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2645?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)

[GitHub] [hudi] codecov-io edited a comment on pull request #2645: [HUDI-1659] Basic Implementation Of Spark Sql Support

2021-04-14 Thread GitBox
codecov-io edited a comment on pull request #2645: URL: https://github.com/apache/hudi/pull/2645#issuecomment-792430670 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2645?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)

[GitHub] [hudi] codecov-io commented on pull request #2828: Remove the com.google.guave jar from hudi-flink-bundle to avoid conflicts.

2021-04-14 Thread GitBox
codecov-io commented on pull request #2828: URL: https://github.com/apache/hudi/pull/2828#issuecomment-820108781 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2828?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation) Report

[GitHub] [hudi] codecov-io edited a comment on pull request #2645: [HUDI-1659] Basic Implementation Of Spark Sql Support

2021-04-14 Thread GitBox
codecov-io edited a comment on pull request #2645: URL: https://github.com/apache/hudi/pull/2645#issuecomment-792430670 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2645?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)

[GitHub] [hudi] MINCWANG opened a new pull request #2828: Remove the com.google.guave jar from hudi-flink-bundle to avoid conflicts.

2021-04-14 Thread GitBox
MINCWANG opened a new pull request #2828: URL: https://github.com/apache/hudi/pull/2828 Remove the com.google.guava jar from hudi-flink-bundle to avoid conflicts. ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review

[jira] [Commented] (HUDI-1797) Shade google guava for hudi-flink-bundle jar

2021-04-14 Thread WangMinChao (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17321906#comment-17321906 ] WangMinChao commented on HUDI-1797: --- _Remove the com.google.guava jar from hudi-flink-bundle to avoid

[GitHub] [hudi] vinothchandar commented on pull request #2821: [Hotfix][utilities] Optimized codes

2021-04-14 Thread GitBox
vinothchandar commented on pull request #2821: URL: https://github.com/apache/hudi/pull/2821#issuecomment-820091027 Can we do a `[MINOR]` tag consistent with our current processes? Happy to discuss a `HOTFIX` label on the dev list and add to docs, if one of you are interested in making

[jira] [Updated] (HUDI-1797) Shade google guava for hudi-flink-bundle jar

2021-04-14 Thread WangMinChao (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] WangMinChao updated HUDI-1797: -- Attachment: screenshot-1.png > Shade google guava for hudi-flink-bundle jar >

[GitHub] [hudi] QiAnXinCodeSafe opened a new issue #2827: There is a vulnerability in hive 2.3.1 ,upgrade recommended

2021-04-14 Thread GitBox
QiAnXinCodeSafe opened a new issue #2827: URL: https://github.com/apache/hudi/issues/2827 https://github.com/apache/hudi/blob/62bb9e10d9d2f2a9807ee46b0ed094ef2fcc89e5/pom.xml#L103 CVE-2018-1282 CVE-2018-11777 CVE-2018-1314 CVE-2020-1926 Recommended upgrade version:2.3.8

[GitHub] [hudi] QiAnXinCodeSafe opened a new issue #2826: There is a vulnerability in jackson.databind 2.6.7.3,upgrade recommended

2021-04-14 Thread GitBox
QiAnXinCodeSafe opened a new issue #2826: URL: https://github.com/apache/hudi/issues/2826 https://github.com/apache/hudi/blob/62bb9e10d9d2f2a9807ee46b0ed094ef2fcc89e5/pom.xml#L87 CVE-2020-9547 CVE-2019-20330 CVE-2019-16942 CVE-2020-8840 Recommended upgrade version:2.9.10.8

[jira] [Updated] (HUDI-1798) Flink streaming reader should always monitor the delta commits files

2021-04-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1798: - Labels: pull-request-available (was: ) > Flink streaming reader should always monitor the delta

[GitHub] [hudi] MyLanPangzi opened a new pull request #2825: [HUDI-1798] Flink streaming reader should always monitor the delta commits files

2021-04-14 Thread GitBox
MyLanPangzi opened a new pull request #2825: URL: https://github.com/apache/hudi/pull/2825 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of

[GitHub] [hudi] QiAnXinCodeSafe opened a new issue #2824: There is a vulnerability in hadoop 2.7.3 ,upgrade recommended

2021-04-14 Thread GitBox
QiAnXinCodeSafe opened a new issue #2824: URL: https://github.com/apache/hudi/issues/2824 https://github.com/apache/hudi/blob/62bb9e10d9d2f2a9807ee46b0ed094ef2fcc89e5/pom.xml#L101 CVE-2017-15718 CVE-2020-9492 CVE-2018-8009 CVE-2016-6811 CVE-2018-8029 Recommended upgrade

[jira] [Assigned] (HUDI-1798) Flink streaming reader should always monitor the delta commits files

2021-04-14 Thread Jira
[ https://issues.apache.org/jira/browse/HUDI-1798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] 谢波 reassigned HUDI-1798: Assignee: 谢波 > Flink streaming reader should always monitor the delta commits files >

[jira] [Created] (HUDI-1798) Flink streaming reader should always monitor the delta commits files

2021-04-14 Thread Danny Chen (Jira)
Danny Chen created HUDI-1798: Summary: Flink streaming reader should always monitor the delta commits files Key: HUDI-1798 URL: https://issues.apache.org/jira/browse/HUDI-1798 Project: Apache Hudi

[GitHub] [hudi] QiAnXinCodeSafe opened a new issue #2823: There is a vulnerability in log4j 1.2.17 ,upgrade recommended

2021-04-14 Thread GitBox
QiAnXinCodeSafe opened a new issue #2823: URL: https://github.com/apache/hudi/issues/2823 https://github.com/apache/hudi/blob/62bb9e10d9d2f2a9807ee46b0ed094ef2fcc89e5/pom.xml#L98 CVE-2019-17571 CVE-2020-9488 Recommended upgrade version:2.13.2 -- This is an

[GitHub] [hudi] vinothchandar commented on pull request #2136: [HUDI-37][WIP] Persist the HoodieIndex type in the hoodie.properties file

2021-04-14 Thread GitBox
vinothchandar commented on pull request #2136: URL: https://github.com/apache/hudi/pull/2136#issuecomment-820013737 Closing this PR, in favor of new approach. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[GitHub] [hudi] vinothchandar closed pull request #2136: [HUDI-37][WIP] Persist the HoodieIndex type in the hoodie.properties file

2021-04-14 Thread GitBox
vinothchandar closed pull request #2136: URL: https://github.com/apache/hudi/pull/2136 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service,

[jira] [Closed] (HUDI-1795) allow ExternalSpillMap use accurate payload size rather than estimated

2021-04-14 Thread ZiyueGuan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ZiyueGuan closed HUDI-1795. --- Resolution: Duplicate > allow ExternalSpillMap use accurate payload size rather than estimated >

[GitHub] [hudi] pengzhiwei2018 commented on pull request #2645: [HUDI-1659] Basic Implementation Of Spark Sql Support

2021-04-14 Thread GitBox
pengzhiwei2018 commented on pull request #2645: URL: https://github.com/apache/hudi/pull/2645#issuecomment-820006638 > @pengzhiwei2018 are the tests passing? Hi @vinothchandar , It seems that there is a leak of `KafkaTestUtils` in `TestHoodieDeltaStreamer`. This can happen by

[GitHub] [hudi] NickYoungPeng commented on a change in pull request #2744: [HUDI-1742]improve table level config priority

2021-04-14 Thread GitBox
NickYoungPeng commented on a change in pull request #2744: URL: https://github.com/apache/hudi/pull/2744#discussion_r612885072 ## File path: hudi-utilities/src/test/java/org/apache/hudi/utilities/functional/TestHoodieDeltaStreamer.java ## @@ -1624,6 +1624,13 @@ public

[GitHub] [hudi] wangxianghu merged pull request #2821: [Hotfix][utilities] Optimized codes

2021-04-14 Thread GitBox
wangxianghu merged pull request #2821: URL: https://github.com/apache/hudi/pull/2821 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service,

[hudi] branch master updated: [Hotfix][utilities] Optimized codes (#2821)

2021-04-14 Thread wangxianghu
This is an automated email from the ASF dual-hosted git repository. wangxianghu pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 62bb9e1 [Hotfix][utilities] Optimized codes

[GitHub] [hudi] RocMarshal commented on a change in pull request #2821: [Hotfix][utilities] Optimized codes

2021-04-14 Thread GitBox
RocMarshal commented on a change in pull request #2821: URL: https://github.com/apache/hudi/pull/2821#discussion_r613698358 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/checkpointing/InitialCheckpointFromAnotherHoodieTimelineProvider.java ## @@ -35,7

[GitHub] [hudi] njalan commented on issue #2609: [SUPPORT] Presto hudi query slow when compared to parquet

2021-04-14 Thread GitBox
njalan commented on issue #2609: URL: https://github.com/apache/hudi/issues/2609#issuecomment-819953659 @tooptoop4 Thanks for your reply -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] ssdong commented on a change in pull request #2819: [HUDI-1794] Moved static COMMIT_FORMATTER to thread local variable as SimpleDateFormat is not thread safe.

2021-04-14 Thread GitBox
ssdong commented on a change in pull request #2819: URL: https://github.com/apache/hudi/pull/2819#discussion_r613692854 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/timeline/HoodieActiveTimeline.java ## @@ -73,6 +71,16 @@ private static final Logger

[GitHub] [hudi] ssdong commented on a change in pull request #2821: [Hotfix][utilities] Optimized codes

2021-04-14 Thread GitBox
ssdong commented on a change in pull request #2821: URL: https://github.com/apache/hudi/pull/2821#discussion_r613676406 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/checkpointing/InitialCheckpointFromAnotherHoodieTimelineProvider.java ## @@ -35,7 +35,7

[hudi] branch asf-site updated: Travis CI build asf-site

2021-04-14 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 3d9cd59 Travis CI build asf-site 3d9cd59 is

[GitHub] [hudi] satishkotha commented on a change in pull request #2678: [HUDI-1746] Added support for replace commits in commit showpartitions, commit show_write_stats, commit showfiles

2021-04-14 Thread GitBox
satishkotha commented on a change in pull request #2678: URL: https://github.com/apache/hudi/pull/2678#discussion_r613635722 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/CommitsCommand.java ## @@ -431,4 +429,23 @@ public String syncCommits(@CliOption(key

[GitHub] [hudi] dszakallas commented on issue #1751: [SUPPORT] Hudi not working with Spark 3.0.0

2021-04-14 Thread GitBox
dszakallas commented on issue #1751: URL: https://github.com/apache/hudi/issues/1751#issuecomment-819901171 I am getting the same `java.lang.NoClassDefFoundError: org/apache/calcite/rel/type/RelDataTypeSystem` when trying to use Hive sync. It looks like Spark 3 is using a custom class

[GitHub] [hudi] satishkotha commented on pull request #2678: [HUDI-1746] Added support for replace commits in commit showpartitions, commit show_write_stats, commit showfiles

2021-04-14 Thread GitBox
satishkotha commented on pull request #2678: URL: https://github.com/apache/hudi/pull/2678#issuecomment-819896794 @jsbali looks like there are test failures? Can you please fix them? I can review after that. -- This is an automated message from the Apache Git Service. To respond to the

[GitHub] [hudi] satishkotha edited a comment on pull request #2677: [HUDI-1714] Added tests to TestHoodieTimelineArchiveLog for the archival of compl…

2021-04-14 Thread GitBox
satishkotha edited a comment on pull request #2677: URL: https://github.com/apache/hudi/pull/2677#issuecomment-819896025 > @satishkotha I have made the changes as requested. PTAL @jsbali LGTM. thanks for improving test coverage. Can you resolve git conflicts below? I can merge after

[GitHub] [hudi] satishkotha commented on pull request #2677: [HUDI-1714] Added tests to TestHoodieTimelineArchiveLog for the archival of compl…

2021-04-14 Thread GitBox
satishkotha commented on pull request #2677: URL: https://github.com/apache/hudi/pull/2677#issuecomment-819896025 > @satishkotha I have made the changes as requested. PTAL @jsbali LGTM. thanks for improving test coverage. Can you resolve conflicts above? I can merge after that --

[hudi] branch asf-site updated: [DOCS] Add Trino and Pulsar to homepage (#2815)

2021-04-14 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new fd7f2bc [DOCS] Add Trino and Pulsar to

[GitHub] [hudi] vinothchandar merged pull request #2815: [DOCS] Add Trino and Pulsar to homepage

2021-04-14 Thread GitBox
vinothchandar merged pull request #2815: URL: https://github.com/apache/hudi/pull/2815 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service,

[GitHub] [hudi] n3nash commented on issue #2820: getting ClassNotFoundException: com.esotericsoftware.shaded.org.objenesis.strategy.InstantiatorStrategy

2021-04-14 Thread GitBox
n3nash commented on issue #2820: URL: https://github.com/apache/hudi/issues/2820#issuecomment-819792823 @ismailsimsek Can you paste the command you are using to start your spark-shell (assuming that's what you are using here to ingest to a hudi table) ? It should be something like

[GitHub] [hudi] codecov-io commented on pull request #2822: [Hotfix][hudi-sync] Refactor method up to parent-class

2021-04-14 Thread GitBox
codecov-io commented on pull request #2822: URL: https://github.com/apache/hudi/pull/2822#issuecomment-819770749 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2822?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation) Report

[jira] [Updated] (HUDI-1714) Improve code coverage of TestHoodieTimelineArchiveLog

2021-04-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1714: - Labels: pull-request-available sev:normal (was: sev:normal) > Improve code coverage of

[GitHub] [hudi] jsbali commented on pull request #2677: [HUDI-1714] Added tests to TestHoodieTimelineArchiveLog for the archival of compl…

2021-04-14 Thread GitBox
jsbali commented on pull request #2677: URL: https://github.com/apache/hudi/pull/2677#issuecomment-819764635 @satishkotha I have made the changes as requested. PTAL -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[GitHub] [hudi] jsbali commented on pull request #2678: [HUDI-1746] Added support for replace commits in commit showpartitions, commit show_write_stats, commit showfiles

2021-04-14 Thread GitBox
jsbali commented on pull request #2678: URL: https://github.com/apache/hudi/pull/2678#issuecomment-819764118 @satishkotha I have made the changes as requested. PTAL -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[GitHub] [hudi] vinothchandar commented on pull request #2645: [HUDI-1659] Basic Implementation Of Spark Sql Support

2021-04-14 Thread GitBox
vinothchandar commented on pull request #2645: URL: https://github.com/apache/hudi/pull/2645#issuecomment-819721513 @pengzhiwei2018 are the tests passing? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [hudi] vinothchandar commented on pull request #2815: [DOCS] Add Trino and Pulsar to homepage

2021-04-14 Thread GitBox
vinothchandar commented on pull request #2815: URL: https://github.com/apache/hudi/pull/2815#issuecomment-819693141 Added! Will land once CI passes. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [hudi] RocMarshal opened a new pull request #2822: [Hotfix][hudi-sync] Refactor method up to parent-class

2021-04-14 Thread GitBox
RocMarshal opened a new pull request #2822: URL: https://github.com/apache/hudi/pull/2822 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the

[GitHub] [hudi] aditiwari01 edited a comment on issue #2802: Hive read issues when different partition have different schemas.

2021-04-14 Thread GitBox
aditiwari01 edited a comment on issue #2802: URL: https://github.com/apache/hudi/issues/2802#issuecomment-819561416 Hi @n3nash The issue I am facing is in the case when commit gets succeeded. Let me explain the issue with an example: Commit1: insert key1 in partition1 and

[GitHub] [hudi] aditiwari01 commented on issue #2802: Hive read issues when different partition have different schemas.

2021-04-14 Thread GitBox
aditiwari01 commented on issue #2802: URL: https://github.com/apache/hudi/issues/2802#issuecomment-819561416 Hi @n3nash The issue I am facing is in the case when commit gets succeeded. Let me explain the issue with an example: Commit1: insert key1 in partition1 and key2 in

[GitHub] [hudi] ismailsimsek commented on issue #1909: [SUPPORT] "Failed to get update last commit time synced to 20200804071144"

2021-04-14 Thread GitBox
ismailsimsek commented on issue #1909: URL: https://github.com/apache/hudi/issues/1909#issuecomment-819535304 probably related to https://github.com/apache/hudi/issues/2797#issuecomment-819532968 -- This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [hudi] ismailsimsek commented on issue #2797: [SUPPORT] Can not create a Path from an empty string on unpartitioned table

2021-04-14 Thread GitBox
ismailsimsek commented on issue #2797: URL: https://github.com/apache/hudi/issues/2797#issuecomment-819532968 its might be related to missing Glue database s3 path, the field is named "Amazon S3 path"(lakeformation) or "Location"(glue) in aws console as far as i see at one point in

[GitHub] [hudi] RocMarshal opened a new pull request #2821: [Hotfix][utilities] Optimized codes

2021-04-14 Thread GitBox
RocMarshal opened a new pull request #2821: URL: https://github.com/apache/hudi/pull/2821 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the

[GitHub] [hudi] pengzhiwei2018 edited a comment on pull request #2645: [HUDI-1659] Basic Implementation Of Spark Sql Support

2021-04-14 Thread GitBox
pengzhiwei2018 edited a comment on pull request #2645: URL: https://github.com/apache/hudi/pull/2645#issuecomment-815583131 Hi @vinothchandar @kwondw , Thanks for the review on this feature. The code has updated. Main changes: - Sql support spark3 based on the same `HoodieAnalysis`

[jira] [Created] (HUDI-1797) Shade google guava for hudi-flink-bundle jar

2021-04-14 Thread Danny Chen (Jira)
Danny Chen created HUDI-1797: Summary: Shade google guava for hudi-flink-bundle jar Key: HUDI-1797 URL: https://issues.apache.org/jira/browse/HUDI-1797 Project: Apache Hudi Issue Type: Task

[GitHub] [hudi] ismailsimsek opened a new issue #2820: getting ClassNotFoundException: com.esotericsoftware.shaded.org.objenesis.strategy.InstantiatorStrategy

2021-04-14 Thread GitBox
ismailsimsek opened a new issue #2820: URL: https://github.com/apache/hudi/issues/2820 getting `Caused by: java.lang.ClassNotFoundException: com.esotericsoftware.shaded.org.objenesis.strategy.InstantiatorStrategy` following dependencies loaded by spark ``` :: loading

[GitHub] [hudi] qianjiangbing commented on issue #2813: [SUPPORT] HoodieRealtimeRecordReader can only work on RealtimeSplit and not with hdfs://111.parquet:0+4

2021-04-14 Thread GitBox
qianjiangbing commented on issue #2813: URL: https://github.com/apache/hudi/issues/2813#issuecomment-819369835 I use hive2.3.8 to test, it's ok! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] ismailsimsek commented on issue #1679: [HUDI-1609] How to disable Hive JDBC and enable metastore

2021-04-14 Thread GitBox
ismailsimsek commented on issue #1679: URL: https://github.com/apache/hudi/issues/1679#issuecomment-819335030 this comment might help for the second error #1751 (comment) `java.lang.NoClassDefFoundError: org/apache/calcite/rel/type/RelDataTypeSystem` > org.apache.calcite >

[GitHub] [hudi] ismailsimsek commented on issue #2688: [SUPPORT] Sync to Hive using Metastore

2021-04-14 Thread GitBox
ismailsimsek commented on issue #2688: URL: https://github.com/apache/hudi/issues/2688#issuecomment-819333455 this comment might help https://github.com/apache/hudi/issues/1751#issuecomment-648481221 adding this jars solved the error for me > org.apache.calcite > calcite-core

[hudi] branch master updated (ab4a7b0 -> 8d29863)

2021-04-14 Thread garyli
This is an automated email from the ASF dual-hosted git repository. garyli pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from ab4a7b0 [HUDI-1788] Insert overwrite (table) for Flink writer (#2808) add 8d29863 [HUDI-1615] Fixing usage of

[GitHub] [hudi] garyli1019 merged pull request #2777: [HUDI-1615] Fixing usage of NULL schema for delete operation in HoodieSparkSqlWriter

2021-04-14 Thread GitBox
garyli1019 merged pull request #2777: URL: https://github.com/apache/hudi/pull/2777 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service,

[GitHub] [hudi] codecov-io edited a comment on pull request #2819: [HUDI-1794] Moved static COMMIT_FORMATTER to thread local variable as SimpleDateFormat is not thread safe.

2021-04-14 Thread GitBox
codecov-io edited a comment on pull request #2819: URL: https://github.com/apache/hudi/pull/2819#issuecomment-819291374 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For

[jira] [Created] (HUDI-1796) allow ExternalSpillMap use accurate payload size rather than estimated

2021-04-14 Thread ZiyueGuan (Jira)
ZiyueGuan created HUDI-1796: --- Summary: allow ExternalSpillMap use accurate payload size rather than estimated Key: HUDI-1796 URL: https://issues.apache.org/jira/browse/HUDI-1796 Project: Apache Hudi

[GitHub] [hudi] BigWrite closed issue #2817: [SUPPORT]how can i query old snapshot by flink-sql

2021-04-14 Thread GitBox
BigWrite closed issue #2817: URL: https://github.com/apache/hudi/issues/2817 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please

[jira] [Created] (HUDI-1795) allow ExternalSpillMap use accurate payload size rather than estimated

2021-04-14 Thread ZiyueGuan (Jira)
ZiyueGuan created HUDI-1795: --- Summary: allow ExternalSpillMap use accurate payload size rather than estimated Key: HUDI-1795 URL: https://issues.apache.org/jira/browse/HUDI-1795 Project: Apache Hudi

[GitHub] [hudi] codecov-io commented on pull request #2819: [HUDI-1794] Moved static COMMIT_FORMATTER to thread local variable as SimpleDateFormat is not thread safe.

2021-04-14 Thread GitBox
codecov-io commented on pull request #2819: URL: https://github.com/apache/hudi/pull/2819#issuecomment-819291374 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2819?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation) Report

[GitHub] [hudi] BigWrite commented on issue #2817: [SUPPORT]how can i query old snapshot by flink-sql

2021-04-14 Thread GitBox
BigWrite commented on issue #2817: URL: https://github.com/apache/hudi/issues/2817#issuecomment-819289708 > BTW, i suggest you to use the latest master code which is more stable for production now ~ thanks for your help. we are research how to build our datalake,your help means a

[GitHub] [hudi] danny0405 commented on issue #2817: [SUPPORT]how can i query old snapshot by flink-sql

2021-04-14 Thread GitBox
danny0405 commented on issue #2817: URL: https://github.com/apache/hudi/issues/2817#issuecomment-819282430 BTW, i suggest you to use the latest master code which is more stable for production now ~ -- This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [hudi] danny0405 commented on issue #2817: [SUPPORT]how can i query old snapshot by flink-sql

2021-04-14 Thread GitBox
danny0405 commented on issue #2817: URL: https://github.com/apache/hudi/issues/2817#issuecomment-819282116 > @danny0405 tks for your resp, can i manage history snapshot? such as clean log from one year ago ,to save space, only save 1 snapshot at first day of each month You can do

[GitHub] [hudi] BigWrite commented on issue #2817: [SUPPORT]how can i query old snapshot by flink-sql

2021-04-14 Thread GitBox
BigWrite commented on issue #2817: URL: https://github.com/apache/hudi/issues/2817#issuecomment-819280589 @danny0405 tks for your resp, can i manage history snapshot? such as clean log from one year ago ,to save space, only save 1 snapshot at first day of each month -- This is an

[GitHub] [hudi] xiarixiaoyao commented on pull request #2815: [DOCS] Add Trino and Pulsar to homepage

2021-04-14 Thread GitBox
xiarixiaoyao commented on pull request #2815: URL: https://github.com/apache/hudi/pull/2815#issuecomment-819279331 ![image](https://user-images.githubusercontent.com/13514703/114667285-de17eb80-9d31-11eb-97ae-bcef248b6fbe.png) -- This is an automated message from the Apache Git

[jira] [Updated] (HUDI-1794) Generating a new instant time in HoodieActiveTimeline is not thread safe

2021-04-14 Thread Prashant Wason (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Wason updated HUDI-1794: - Status: Patch Available (was: In Progress) > Generating a new instant time in

[jira] [Updated] (HUDI-1794) Generating a new instant time in HoodieActiveTimeline is not thread safe

2021-04-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1794: - Labels: pull-request-available (was: ) > Generating a new instant time in HoodieActiveTimeline

[GitHub] [hudi] prashantwason opened a new pull request #2819: [HUDI-1794] Moved static COMMIT_FORMATTER to thread local variable as SimpleDateFormat is not thread safe.

2021-04-14 Thread GitBox
prashantwason opened a new pull request #2819: URL: https://github.com/apache/hudi/pull/2819 Added unit test to ensure new instant time can be generated in multiple threads correctly. ## What is the purpose of the pull request When generating a new instant time in

[GitHub] [hudi] codecov-io edited a comment on pull request #2793: [HUDI-57] Support ORC Storage

2021-04-14 Thread GitBox
codecov-io edited a comment on pull request #2793: URL: https://github.com/apache/hudi/pull/2793#issuecomment-819265116 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2793?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)

[GitHub] [hudi] codecov-io commented on pull request #2793: [HUDI-57] Support ORC Storage

2021-04-14 Thread GitBox
codecov-io commented on pull request #2793: URL: https://github.com/apache/hudi/pull/2793#issuecomment-819265116 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2793?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation) Report

[GitHub] [hudi] codecov-io edited a comment on pull request #2793: [HUDI-57] Support ORC Storage

2021-04-14 Thread GitBox
codecov-io edited a comment on pull request #2793: URL: https://github.com/apache/hudi/pull/2793#issuecomment-819265116 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2793?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)

[GitHub] [hudi] danny0405 commented on issue #2817: [SUPPORT]how can i query old snapshot by flink-sql

2021-04-14 Thread GitBox
danny0405 commented on issue #2817: URL: https://github.com/apache/hudi/issues/2817#issuecomment-819258466 @n3nash Sorry i have no permission to edit the tags. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [hudi] danny0405 commented on issue #2817: [SUPPORT]how can i query old snapshot by flink-sql

2021-04-14 Thread GitBox
danny0405 commented on issue #2817: URL: https://github.com/apache/hudi/issues/2817#issuecomment-819258164 Can not do that now really, for batch query, Flink SQL always query the latest snapshot, would support query the history snapshot soon ~ -- This is an automated message from the

[GitHub] [hudi] xglv1985 edited a comment on issue #2812: [SUPPORT]Got a parquet related error when incremental querying MOR table, using Spark 2.4

2021-04-14 Thread GitBox
xglv1985 edited a comment on issue #2812: URL: https://github.com/apache/hudi/issues/2812#issuecomment-819257340 > @xglv1985 It looks like you hit this issue -> https://issues.apache.org/jira/browse/SPARK-11844. > > The ticket was created as a MINOR and is already resolved by

[GitHub] [hudi] xglv1985 commented on issue #2812: [SUPPORT]Got a parquet related error when incremental querying MOR table, using Spark 2.4

2021-04-14 Thread GitBox
xglv1985 commented on issue #2812: URL: https://github.com/apache/hudi/issues/2812#issuecomment-819257340 > @xglv1985 It looks like you hit this issue -> https://issues.apache.org/jira/browse/SPARK-11844. > > The ticket was created as a MINOR and is already resolved by customers

[jira] [Updated] (HUDI-1794) Generating a new instant time in HoodieActiveTimeline is not thread safe

2021-04-14 Thread Prashant Wason (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Wason updated HUDI-1794: - Status: In Progress (was: Open) > Generating a new instant time in HoodieActiveTimeline is not

[jira] [Created] (HUDI-1794) Generating a new instant time in HoodieActiveTimeline is not thread safe

2021-04-14 Thread Prashant Wason (Jira)
Prashant Wason created HUDI-1794: Summary: Generating a new instant time in HoodieActiveTimeline is not thread safe Key: HUDI-1794 URL: https://issues.apache.org/jira/browse/HUDI-1794 Project: Apache