[GitHub] [hudi] Liulietong commented on a change in pull request #2584: [Hudi-1583]: Fix bug that Hudi will skip remaining log files if there is logFile with zero size in logFileList when merge on rea

2021-02-25 Thread GitBox
Liulietong commented on a change in pull request #2584: URL: https://github.com/apache/hudi/pull/2584#discussion_r583434679 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/log/HoodieLogFormatReader.java ## @@ -104,7 +104,7 @@ public boolean hasNext() {

[GitHub] [hudi] Liulietong commented on a change in pull request #2584: [Hudi-1583]: Fix bug that Hudi will skip remaining log files if there is logFile with zero size in logFileList when merge on rea

2021-02-25 Thread GitBox
Liulietong commented on a change in pull request #2584: URL: https://github.com/apache/hudi/pull/2584#discussion_r583434679 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/log/HoodieLogFormatReader.java ## @@ -104,7 +104,7 @@ public boolean hasNext() {

[GitHub] [hudi] codecov-io edited a comment on pull request #2607: [HUDI-1643] Hudi observability - framework to report stats from execu…

2021-02-25 Thread GitBox
codecov-io edited a comment on pull request #2607: URL: https://github.com/apache/hudi/pull/2607#issuecomment-786454326 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2607?src=pr=h1) Report > Merging [#2607](https://codecov.io/gh/apache/hudi/pull/2607?src=pr=desc) (ab93c26) into

[GitHub] [hudi] codecov-io commented on pull request #2607: [HUDI-1643] Hudi observability - framework to report stats from execu…

2021-02-25 Thread GitBox
codecov-io commented on pull request #2607: URL: https://github.com/apache/hudi/pull/2607#issuecomment-786454326 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2607?src=pr=h1) Report > Merging [#2607](https://codecov.io/gh/apache/hudi/pull/2607?src=pr=desc) (ab93c26) into

[GitHub] [hudi] t0il3ts0ap edited a comment on issue #2589: [SUPPORT] Issue with adding column while running deltastreamer with kafka source.

2021-02-25 Thread GitBox
t0il3ts0ap edited a comment on issue #2589: URL: https://github.com/apache/hudi/issues/2589#issuecomment-786029402 @satishkotha Ran again on fresh table, still same issue. SparkSubmit: ``` spark-submit --master yarn --packages

[GitHub] [hudi] liujinhui1994 commented on pull request #2438: [HUDI-1447] DeltaStreamer kafka source supports consuming from specified timestamp

2021-02-25 Thread GitBox
liujinhui1994 commented on pull request #2438: URL: https://github.com/apache/hudi/pull/2438#issuecomment-786452099 I will add the unit test, and then please review This is an automated message from the Apache Git Service.

[jira] [Updated] (HUDI-1643) [Hudi Observability] Framework for reporting stats from executors

2021-02-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1643: - Labels: pull-request-available (was: ) > [Hudi Observability] Framework for reporting stats from

[GitHub] [hudi] yanghua commented on a change in pull request #2596: [HUDI-1636] Support Builder Pattern To Build Table Properties For Hoo…

2021-02-25 Thread GitBox
yanghua commented on a change in pull request #2596: URL: https://github.com/apache/hudi/pull/2596#discussion_r583407438 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/HoodieTableConfig.java ## @@ -258,4 +260,142 @@ public String getArchivelogFolder() {

[GitHub] [hudi] nbalajee opened a new pull request #2607: [HUDI-1643] Hudi observability - framework to report stats from execu…

2021-02-25 Thread GitBox
nbalajee opened a new pull request #2607: URL: https://github.com/apache/hudi/pull/2607 …tors ## What is the purpose of the pull request Frame work for collecting Hudi Observability stats from the executors. ## Brief change log - Using distributed registry, report

[GitHub] [hudi] xushiyan commented on a change in pull request #2541: [HUDI-1587] Add latency and freshness support

2021-02-25 Thread GitBox
xushiyan commented on a change in pull request #2541: URL: https://github.com/apache/hudi/pull/2541#discussion_r583403910 ## File path: hudi-common/src/main/java/org/apache/hudi/common/util/DateTimeUtils.java ## @@ -0,0 +1,41 @@ +/* + * Licensed to the Apache Software

[hudi] branch asf-site updated: Travis CI build asf-site

2021-02-25 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new c5d50f0 Travis CI build asf-site c5d50f0 is

[jira] [Created] (HUDI-1643) [Hudi Observability] Framework for reporting stats from executors

2021-02-25 Thread Balajee Nagasubramaniam (Jira)
Balajee Nagasubramaniam created HUDI-1643: - Summary: [Hudi Observability] Framework for reporting stats from executors Key: HUDI-1643 URL: https://issues.apache.org/jira/browse/HUDI-1643

[hudi] branch asf-site updated: [MINOR] Fixing slideshare link (#2606)

2021-02-25 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 1dd7a41 [MINOR] Fixing slideshare link

[GitHub] [hudi] vinothchandar merged pull request #2606: [MINOR] Fixing slideshare link

2021-02-25 Thread GitBox
vinothchandar merged pull request #2606: URL: https://github.com/apache/hudi/pull/2606 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [hudi] satishkotha commented on a change in pull request #2584: [Hudi-1583]: Fix bug that Hudi will skip remaining log files if there is logFile with zero size in logFileList when merge on re

2021-02-25 Thread GitBox
satishkotha commented on a change in pull request #2584: URL: https://github.com/apache/hudi/pull/2584#discussion_r583398027 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/log/HoodieLogFormatReader.java ## @@ -104,7 +104,7 @@ public boolean hasNext() {

[GitHub] [hudi] pengzhiwei2018 commented on a change in pull request #2596: [HUDI-1636] Support Builder Pattern To Build Table Properties For Hoo…

2021-02-25 Thread GitBox
pengzhiwei2018 commented on a change in pull request #2596: URL: https://github.com/apache/hudi/pull/2596#discussion_r583385132 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/TableCommand.java ## @@ -106,10 +106,13 @@ public String createTable( throw

[GitHub] [hudi] n3nash commented on pull request #2606: [MINOR] Fixing slideshare link

2021-02-25 Thread GitBox
n3nash commented on pull request #2606: URL: https://github.com/apache/hudi/pull/2606#issuecomment-786408716 Verified locally that it works. This is an automated message from the Apache Git Service. To respond to the

[GitHub] [hudi] n3nash opened a new pull request #2606: [MINOR] Fixing slideshare link

2021-02-25 Thread GitBox
n3nash opened a new pull request #2606: URL: https://github.com/apache/hudi/pull/2606 Fixing broken link ## Committer checklist - [ ] Has a corresponding JIRA in PR title & commit - [ ] Commit message is descriptive of the change - [ ] CI is green -

[GitHub] [hudi] danny0405 commented on a change in pull request #2600: [HUDI-1638] Some improvements to BucketAssignFunction

2021-02-25 Thread GitBox
danny0405 commented on a change in pull request #2600: URL: https://github.com/apache/hudi/pull/2600#discussion_r583369329 ## File path: hudi-flink/src/main/java/org/apache/hudi/operator/partitioner/BucketAssignFunction.java ## @@ -136,15 +137,10 @@ public void

[GitHub] [hudi] hk-lrzy commented on a change in pull request #2600: [HUDI-1638] Some improvements to BucketAssignFunction

2021-02-25 Thread GitBox
hk-lrzy commented on a change in pull request #2600: URL: https://github.com/apache/hudi/pull/2600#discussion_r583366344 ## File path: hudi-flink/src/main/java/org/apache/hudi/operator/partitioner/BucketAssignFunction.java ## @@ -136,15 +137,10 @@ public void

[GitHub] [hudi] hk-lrzy closed pull request #2604: [hudi-1639][hudi-flink] fix BucketAssigner npe

2021-02-25 Thread GitBox
hk-lrzy closed pull request #2604: URL: https://github.com/apache/hudi/pull/2604 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[hudi] branch asf-site updated: Travis CI build asf-site

2021-02-25 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 1f101fe Travis CI build asf-site 1f101fe is

[GitHub] [hudi] codejoyan commented on issue #2592: [SUPPORT] Does latest versions of Hudi (0.7.0, 0.6.0) work with Spark 2.3.0 when reading orc files?

2021-02-25 Thread GitBox
codejoyan commented on issue #2592: URL: https://github.com/apache/hudi/issues/2592#issuecomment-786379571 This is the spark version of the cluster being used at work so I will have to use Spark 2.3 until there is an upgrade. Since the documentation says: **Hudi works with Spark-2.x**, I

[hudi] branch asf-site updated: [HUDI 1642] Adding Hudi Learning series presentation & Uber eng blog links (#2602)

2021-02-25 Thread lamberken
This is an automated email from the ASF dual-hosted git repository. lamberken pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new eeb146a [HUDI 1642] Adding Hudi Learning

[GitHub] [hudi] lamber-ken merged pull request #2602: [HUDI 1642] Adding Hudi Learning series presentation & Uber eng blog links

2021-02-25 Thread GitBox
lamber-ken merged pull request #2602: URL: https://github.com/apache/hudi/pull/2602 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] lamber-ken edited a comment on pull request #2602: [HUDI 1642] Adding Hudi Learning series presentation & Uber eng blog links

2021-02-25 Thread GitBox
lamber-ken edited a comment on pull request #2602: URL: https://github.com/apache/hudi/pull/2602#issuecomment-786378892 Thanks @n3nash  LGTM This is an automated message from the Apache Git Service. To respond to the

[GitHub] [hudi] lamber-ken commented on pull request #2602: [HUDI 1642] Adding Hudi Learning series presentation & Uber eng blog links

2021-02-25 Thread GitBox
lamber-ken commented on pull request #2602: URL: https://github.com/apache/hudi/pull/2602#issuecomment-786378892 Thanks @n3nash  This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [hudi] lamber-ken commented on a change in pull request #2602: [HUDI 1642] Adding Hudi Learning series presentation & Uber eng blog links

2021-02-25 Thread GitBox
lamber-ken commented on a change in pull request #2602: URL: https://github.com/apache/hudi/pull/2602#discussion_r583352619 ## File path: content/docs/0.5.3-powered_by.html ## @@ -462,13 +462,17 @@ Talks Presentations

[GitHub] [hudi] codecov-io edited a comment on pull request #2596: [HUDI-1636] Support Builder Pattern To Build Table Properties For Hoo…

2021-02-25 Thread GitBox
codecov-io edited a comment on pull request #2596: URL: https://github.com/apache/hudi/pull/2596#issuecomment-784717451 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2596?src=pr=h1) Report > Merging [#2596](https://codecov.io/gh/apache/hudi/pull/2596?src=pr=desc) (c71fe74) into

[GitHub] [hudi] garyli1019 commented on issue #2498: [SUPPORT] Hudi MERGE_ON_READ load to dataframe fails for the versions [0.6.0],[0.7.0] and runs for [0.5.3]

2021-02-25 Thread GitBox
garyli1019 commented on issue #2498: URL: https://github.com/apache/hudi/issues/2498#issuecomment-786364265 I am seeing the same problem when the compiled spark distribution is different from the runtime spark distribution. Compile hudi jar against the runtime spark distribution should

[GitHub] [hudi] garyli1019 commented on pull request #2584: [Hudi-1583]: Fix bug that Hudi will skip remaining log files if there is logFile with zero size in logFileList when merge on read.

2021-02-25 Thread GitBox
garyli1019 commented on pull request #2584: URL: https://github.com/apache/hudi/pull/2584#issuecomment-786361769 hi @satishkotha , this PR seems related to #2583 , would you take a look? This is an automated message from the

[GitHub] [hudi] pengzhiwei2018 commented on a change in pull request #2596: [HUDI-1636] Support Builder Pattern To Build Table Properties For Hoo…

2021-02-25 Thread GitBox
pengzhiwei2018 commented on a change in pull request #2596: URL: https://github.com/apache/hudi/pull/2596#discussion_r583334728 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/TableCommand.java ## @@ -106,10 +106,13 @@ public String createTable( throw

[GitHub] [hudi] yanghua commented on a change in pull request #2596: [HUDI-1636] Support Builder Pattern To Build Table Properties For Hoo…

2021-02-25 Thread GitBox
yanghua commented on a change in pull request #2596: URL: https://github.com/apache/hudi/pull/2596#discussion_r583313615 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/TableCommand.java ## @@ -106,10 +106,13 @@ public String createTable( throw new

[GitHub] [hudi] n3nash merged pull request #2565: [HUDI-1611] Added a configuration to allow specific directories to be filtered out during Metadata Table bootstrap.

2021-02-25 Thread GitBox
n3nash merged pull request #2565: URL: https://github.com/apache/hudi/pull/2565 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[hudi] branch master updated: [HUDI-1611] Added a configuration to allow specific directories to be filtered out during Metadata Table bootstrap. (#2565)

2021-02-25 Thread nagarwal
This is an automated email from the ASF dual-hosted git repository. nagarwal pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 022df0d [HUDI-1611] Added a configuration to

[hudi] branch master updated: Fixing README for hudi test suite long running job (#2578)

2021-02-25 Thread nagarwal
This is an automated email from the ASF dual-hosted git repository. nagarwal pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 9f5e8cc Fixing README for hudi test suite long

[GitHub] [hudi] n3nash merged pull request #2578: [MINOR] Fixing Hudi Test suite readme for long running job

2021-02-25 Thread GitBox
n3nash merged pull request #2578: URL: https://github.com/apache/hudi/pull/2578 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[jira] [Created] (HUDI-1642) Add Links to Uber engineering blog and meet up slides

2021-02-25 Thread Nishith Agarwal (Jira)
Nishith Agarwal created HUDI-1642: - Summary: Add Links to Uber engineering blog and meet up slides Key: HUDI-1642 URL: https://issues.apache.org/jira/browse/HUDI-1642 Project: Apache Hudi

[GitHub] [hudi] n3nash commented on a change in pull request #2602: Adding Hudi Learning series presentation & Uber eng blog links

2021-02-25 Thread GitBox
n3nash commented on a change in pull request #2602: URL: https://github.com/apache/hudi/pull/2602#discussion_r583286829 ## File path: content/docs/0.5.3-powered_by.html ## @@ -462,13 +462,17 @@ Talks Presentations

[GitHub] [hudi] afeldman1 closed issue #2399: [SUPPORT] Hudi deletes not being properly commited

2021-02-25 Thread GitBox
afeldman1 closed issue #2399: URL: https://github.com/apache/hudi/issues/2399 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] afeldman1 commented on issue #2399: [SUPPORT] Hudi deletes not being properly commited

2021-02-25 Thread GitBox
afeldman1 commented on issue #2399: URL: https://github.com/apache/hudi/issues/2399#issuecomment-786204481 Apologies for the delayed response. And thank you to @bvaradar for the initial hint. The issue turned out to be caused not by the keys but another one of the configuration

[GitHub] [hudi] kpurella commented on issue #2240: [SUPPORT] Performance Issue : HUDI MOR ,UPSERT Job running forever

2021-02-25 Thread GitBox
kpurella commented on issue #2240: URL: https://github.com/apache/hudi/issues/2240#issuecomment-786183675 @vinothchandar Sure ,i will !! This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [hudi] toninis commented on issue #2149: Help with Reading Kafka topic written using Debezium Connector - Deltastreamer

2021-02-25 Thread GitBox
toninis commented on issue #2149: URL: https://github.com/apache/hudi/issues/2149#issuecomment-786183618 @vinothchandar Sorry I took so long to respond . It had worked and compiled successfully . I probably had missed something at the time . Thanks for your response at the time

[jira] [Updated] (HUDI-1641) Issue for Integrating Hudi with Kafka using Avro Schema

2021-02-25 Thread PRASHANT BHOSALE (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] PRASHANT BHOSALE updated HUDI-1641: --- Description: I am trying to integrate Hudi with Kafka topic. teps followed : # Created Kafka

[GitHub] [hudi] t0il3ts0ap edited a comment on issue #2589: [SUPPORT] Issue with adding column while running deltastreamer with kafka source.

2021-02-25 Thread GitBox
t0il3ts0ap edited a comment on issue #2589: URL: https://github.com/apache/hudi/issues/2589#issuecomment-786029402 @satishkotha Ran again on fresh table, still same issue. SparkSubmit: ``` spark-submit --master yarn --packages

[GitHub] [hudi] t0il3ts0ap commented on issue #2589: [SUPPORT] Issue with adding column while running deltastreamer with kafka source.

2021-02-25 Thread GitBox
t0il3ts0ap commented on issue #2589: URL: https://github.com/apache/hudi/issues/2589#issuecomment-786029402 Ran again on fresh table, still same issue. SparkSubmit: ``` spark-submit --master yarn --packages

[jira] [Resolved] (HUDI-1269) Make whether the failure of connect hive affects hudi ingest process configurable

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-1269. --- Fix Version/s: 0.8.0 Resolution: Fixed > Make whether the failure of connect

[hudi] branch master updated: [HUDI-1269] Make whether the failure of connect hive affects hudi ingest process configurable (#2443)

2021-02-25 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 8c2197a [HUDI-1269] Make whether the failure

[jira] [Updated] (HUDI-1269) Make whether the failure of connect hive affects hudi ingest process configurable

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1269: -- Status: Open (was: New) > Make whether the failure of connect hive affects hudi ingest

[jira] [Updated] (HUDI-1269) Make whether the failure of connect hive affects hudi ingest process configurable

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1269: -- Status: In Progress (was: Open) > Make whether the failure of connect hive affects

[GitHub] [hudi] nsivabalan merged pull request #2443: [HUDI-1269] Make whether the failure of connect hive affects hudi ingest process configurable

2021-02-25 Thread GitBox
nsivabalan merged pull request #2443: URL: https://github.com/apache/hudi/pull/2443 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] Rap70r commented on issue #2586: [SUPPORT] - How to guarantee snapshot isolation when reading Hudi tables in S3?

2021-02-25 Thread GitBox
Rap70r commented on issue #2586: URL: https://github.com/apache/hudi/issues/2586#issuecomment-785941366 Hi nsivabalan, Thank you for your reply. * Incremental updates include both inserts and updates. Mostly updates. * We can try increasing retention version to a higher

[GitHub] [hudi] rubenssoto commented on issue #2588: [SUPPORT] Cannot create hive connection

2021-02-25 Thread GitBox
rubenssoto commented on issue #2588: URL: https://github.com/apache/hudi/issues/2588#issuecomment-785933427 @bvaradar I think it is a hive issue, I'm trying to increase hive heap size, I hope it helps. I process the tables in threads, so I have almost 20 hive connections open.

[GitHub] [hudi] bvaradar commented on issue #2448: [SUPPORT] deltacommit for client 172.16.116.102 already exists

2021-02-25 Thread GitBox
bvaradar commented on issue #2448: URL: https://github.com/apache/hudi/issues/2448#issuecomment-785932137 @root18039532923 : Please look at https://hudi.apache.org/blog/async-compaction-deployment-model/ for running async compactions

[GitHub] [hudi] bvaradar commented on issue #2555: [SUPPORT] Trying and Understanding Clustering

2021-02-25 Thread GitBox
bvaradar commented on issue #2555: URL: https://github.com/apache/hudi/issues/2555#issuecomment-785930358 For bulkinsert, you need to size the parallelism to control. Please see https://cwiki.apache.org/confluence/display/HUDI/FAQ#FAQ-Whatperformance/ingestlatencycanIexpectforHudiwriting

[GitHub] [hudi] bvaradar commented on issue #2588: [SUPPORT] Cannot create hive connection

2021-02-25 Thread GitBox
bvaradar commented on issue #2588: URL: https://github.com/apache/hudi/issues/2588#issuecomment-785928870 @rubenssoto : The stack-trace does not contain Hudi in it. So, I dont know how to help in this regard. Regarding high cpu load on hive server, Are you also running hive queries apart

[jira] [Created] (HUDI-1641) Issue for Integrating Hudi with Kafka using Avro Schema

2021-02-25 Thread PRASHANT BHOSALE (Jira)
PRASHANT BHOSALE created HUDI-1641: -- Summary: Issue for Integrating Hudi with Kafka using Avro Schema Key: HUDI-1641 URL: https://issues.apache.org/jira/browse/HUDI-1641 Project: Apache Hudi

[GitHub] [hudi] bvaradar commented on a change in pull request #2520: [HUDI-1446] Support skip bootstrapIndex's init in abstract fs view init

2021-02-25 Thread GitBox
bvaradar commented on a change in pull request #2520: URL: https://github.com/apache/hudi/pull/2520#discussion_r582853801 ## File path: hudi-common/src/main/java/org/apache/hudi/common/bootstrap/index/DefaultBootstrapIndex.java ## @@ -0,0 +1,82 @@ +/* + * Licensed to the

[GitHub] [hudi] bvaradar commented on issue #2605: [SUPPORT] How to reload a writeConfig from a existed hudi path ?

2021-02-25 Thread GitBox
bvaradar commented on issue #2605: URL: https://github.com/apache/hudi/issues/2605#issuecomment-785906565 Ideally, spark data-source should provide that option (like optionFromFile(...). Not sure if there is anything like that. Created :

[jira] [Commented] (HUDI-1640) Implement Spark Datasource option to read hudi configs from properties file

2021-02-25 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17290922#comment-17290922 ] Balaji Varadarajan commented on HUDI-1640: -- [~shivnarayan]: Can you vet this and add to the work

[jira] [Created] (HUDI-1640) Implement Spark Datasource option to read hudi configs from properties file

2021-02-25 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1640: Summary: Implement Spark Datasource option to read hudi configs from properties file Key: HUDI-1640 URL: https://issues.apache.org/jira/browse/HUDI-1640

[GitHub] [hudi] bvaradar commented on issue #2592: [SUPPORT] Does latest versions of Hudi (0.7.0, 0.6.0) work with Spark 2.3.0 when reading orc files?

2021-02-25 Thread GitBox
bvaradar commented on issue #2592: URL: https://github.com/apache/hudi/issues/2592#issuecomment-785896778 I was unable to setup spark-2.3.0 in my setup. But,with spark-2.4.4, this works fine as below. Can you use spark-2.4.x version. spark-2.3 seems too old though ? `21/02/25

[GitHub] [hudi] codecov-io edited a comment on pull request #2443: [HUDI-1269] Make whether the failure of connect hive affects hudi ingest process configurable

2021-02-25 Thread GitBox
codecov-io edited a comment on pull request #2443: URL: https://github.com/apache/hudi/pull/2443#issuecomment-760147630 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2443?src=pr=h1) Report > Merging [#2443](https://codecov.io/gh/apache/hudi/pull/2443?src=pr=desc) (7baf5de) into

[GitHub] [hudi] codecov-io edited a comment on pull request #2378: [HUDI-1491] Support partition pruning for MOR snapshot query

2021-02-25 Thread GitBox
codecov-io edited a comment on pull request #2378: URL: https://github.com/apache/hudi/pull/2378#issuecomment-751218636 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2378?src=pr=h1) Report > Merging [#2378](https://codecov.io/gh/apache/hudi/pull/2378?src=pr=desc) (ef2107f) into

[jira] [Resolved] (HUDI-1367) Make delastreamer transition from dfsSouce to kafkasouce

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-1367. --- Fix Version/s: 0.8.0 Resolution: Fixed > Make delastreamer transition from

[jira] [Reopened] (HUDI-1367) Make delastreamer transition from dfsSouce to kafkasouce

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reopened HUDI-1367: --- > Make delastreamer transition from dfsSouce to kafkasouce >

[jira] [Updated] (HUDI-1367) Make delastreamer transition from dfsSouce to kafkasouce

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1367: -- Status: Patch Available (was: In Progress) > Make delastreamer transition from

[jira] [Updated] (HUDI-1367) Make delastreamer transition from dfsSouce to kafkasouce

2021-02-25 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1367: -- Status: Closed (was: Patch Available) > Make delastreamer transition from dfsSouce to

[hudi] branch master updated: [HUDI-1367] Make deltaStreamer transition from dfsSouce to kafkasouce (#2227)

2021-02-25 Thread sivabalan
This is an automated email from the ASF dual-hosted git repository. sivabalan pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 617cc24 [HUDI-1367] Make deltaStreamer

[GitHub] [hudi] nsivabalan merged pull request #2227: [HUDI-1367] Make deltaStreamer transition from dfsSouce to kafkasouce

2021-02-25 Thread GitBox
nsivabalan merged pull request #2227: URL: https://github.com/apache/hudi/pull/2227 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] codecov-io edited a comment on pull request #2519: [HUDI-1573] Spark Sql Writer support Multi preCmp Field

2021-02-25 Thread GitBox
codecov-io edited a comment on pull request #2519: URL: https://github.com/apache/hudi/pull/2519#issuecomment-771782258 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2519?src=pr=h1) Report > Merging [#2519](https://codecov.io/gh/apache/hudi/pull/2519?src=pr=desc) (6f0fd84) into

[GitHub] [hudi] rakeshramakrishnan edited a comment on issue #2439: [SUPPORT] Unable to sync with external hive metastore via metastore uris in the thrift protocol

2021-02-25 Thread GitBox
rakeshramakrishnan edited a comment on issue #2439: URL: https://github.com/apache/hudi/issues/2439#issuecomment-785795722 @nsivabalan : There are no errors, however through hudi, the connection is made to the local hive metastore (from spark). It doesn't connect to the external hive

[GitHub] [hudi] rakeshramakrishnan commented on issue #2439: [SUPPORT] Unable to sync with external hive metastore via metastore uris in the thrift protocol

2021-02-25 Thread GitBox
rakeshramakrishnan commented on issue #2439: URL: https://github.com/apache/hudi/issues/2439#issuecomment-785795722 @nsivabalan : There are no errors, however through hudi, the connection is made to the local hive metastore (from spark). It doesn't connect to the external hive metastore.

[GitHub] [hudi] Xoln commented on a change in pull request #2520: [HUDI-1446] Support skip bootstrapIndex's init in abstract fs view init

2021-02-25 Thread GitBox
Xoln commented on a change in pull request #2520: URL: https://github.com/apache/hudi/pull/2520#discussion_r582717002 ## File path: hudi-common/src/main/java/org/apache/hudi/common/bootstrap/index/DefaultBootstrapIndex.java ## @@ -0,0 +1,61 @@ +package

[GitHub] [hudi] codecov-io commented on pull request #2604: [hudi-1639][hudi-flink] fix BucketAssigner npe

2021-02-25 Thread GitBox
codecov-io commented on pull request #2604: URL: https://github.com/apache/hudi/pull/2604#issuecomment-785704660 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2604?src=pr=h1) Report > Merging [#2604](https://codecov.io/gh/apache/hudi/pull/2604?src=pr=desc) (fed6575) into