[incubator-hudi] branch master updated: [MINOR] Fix resource cleanup in TestTableSchemaEvolution (#1640)

2020-05-20 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/master by this push: new f802d44 [MINOR] Fix resource cleanup

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1640: [MINOR] Fix resource cleanup in TestTableSchemaEvolution

2020-05-20 Thread GitBox
vinothchandar commented on a change in pull request #1640: URL: https://github.com/apache/incubator-hudi/pull/1640#discussion_r427956293 ## File path: pom.xml ## @@ -245,7 +245,8 @@ ${maven-surefire-plugin.version} ${skipUTs} - -Xms256m

[jira] [Updated] (HUDI-889) Writer supports useJdbc configuration when hive synchronization is enabled

2020-05-20 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-889: - Fix Version/s: (was: 0.5.3) > Writer supports useJdbc configuration when hive

[jira] [Reopened] (HUDI-889) Writer supports useJdbc configuration when hive synchronization is enabled

2020-05-20 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reopened HUDI-889: -- > Writer supports useJdbc configuration when hive synchronization is enabled >

[GitHub] [incubator-hudi] vinothchandar commented on pull request #1538: [HUDI-803]: added more test cases in TestHoodieAvroUtils.class

2020-05-20 Thread GitBox
vinothchandar commented on pull request #1538: URL: https://github.com/apache/incubator-hudi/pull/1538#issuecomment-631444156 @pratyakshsharma by close, you mean final review and merge right? :) This is an automated message

[GitHub] [incubator-hudi] pratyakshsharma opened a new pull request #1647: [HUDI-867]: fixed IllegalArgumentException from graphite metrics in deltaStreamer continuous mode

2020-05-20 Thread GitBox
pratyakshsharma opened a new pull request #1647: URL: https://github.com/apache/incubator-hudi/pull/1647 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is

[jira] [Updated] (HUDI-867) Graphite metrics are throwing IllegalArgumentException on continuous mode

2020-05-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-867: Labels: bug-bash-0.6.0 pull-request-available (was: bug-bash-0.6.0) > Graphite metrics are throwing

[GitHub] [incubator-hudi] bvaradar commented on a change in pull request #1566: [HUDI-603]: DeltaStreamer can now fetch schema before every run in continuous mode

2020-05-20 Thread GitBox
bvaradar commented on a change in pull request #1566: URL: https://github.com/apache/incubator-hudi/pull/1566#discussion_r428209317 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/schema/SchemaRegistryProvider.java ## @@ -81,11 +66,22 @@ private static

[jira] [Commented] (HUDI-890) Prepare for 0.5.3 patch release

2020-05-20 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112436#comment-17112436 ] Balaji Varadarajan commented on HUDI-890: - [~shivnarayan]: HUDI-846 is merged to master (as part of

[jira] [Updated] (HUDI-848) Turn on embedded timeline server by default for all writes

2020-05-20 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-848: Status: In Progress (was: Open) > Turn on embedded timeline server by default for all

[GitHub] [incubator-hudi] vinothchandar merged pull request #1538: [HUDI-803]: added more test cases in TestHoodieAvroUtils.class

2020-05-20 Thread GitBox
vinothchandar merged pull request #1538: URL: https://github.com/apache/incubator-hudi/pull/1538 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[incubator-hudi] branch master updated: [HUDI-803] Replaced used of NullNode with JsonProperties.NULL_VALUE in HoodieAvroUtils (#1538)

2020-05-20 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/master by this push: new 6a0aa9a [HUDI-803] Replaced used of

[GitHub] [incubator-hudi] xushiyan commented on pull request #1644: [HUDI-811] Restructure test packages in hudi-common

2020-05-20 Thread GitBox
xushiyan commented on pull request #1644: URL: https://github.com/apache/incubator-hudi/pull/1644#issuecomment-631577878 ![Screen Shot 2020-05-20 at 8 57 05 AM](https://user-images.githubusercontent.com/2701446/82468795-15294280-9a78-11ea-909a-bf09da83d7a4.png) Test classes under

[GitHub] [incubator-hudi] bvaradar commented on pull request #1645: [HUDI-707]Add unit test for StatsCommand

2020-05-20 Thread GitBox
bvaradar commented on pull request #1645: URL: https://github.com/apache/incubator-hudi/pull/1645#issuecomment-631624593 @yanghua @leesf : Would you be interested in shepherding this PR when it is ready ? This is an

[jira] [Commented] (HUDI-846) Turn on incremental cleaning bu default in 0.6.0

2020-05-20 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112434#comment-17112434 ] Balaji Varadarajan commented on HUDI-846: - Incremental cleaning is enabled by default as part of 

[jira] [Resolved] (HUDI-846) Turn on incremental cleaning bu default in 0.6.0

2020-05-20 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan resolved HUDI-846. - Resolution: Fixed > Turn on incremental cleaning bu default in 0.6.0 >

[jira] [Updated] (HUDI-846) Turn on incremental cleaning bu default in 0.6.0

2020-05-20 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-846: Status: In Progress (was: Open) > Turn on incremental cleaning bu default in 0.6.0 >

[GitHub] [incubator-hudi] bvaradar commented on a change in pull request #1566: [HUDI-603]: DeltaStreamer can now fetch schema before every run in continuous mode

2020-05-20 Thread GitBox
bvaradar commented on a change in pull request #1566: URL: https://github.com/apache/incubator-hudi/pull/1566#discussion_r428206400 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/schema/SchemaSet.java ## @@ -0,0 +1,44 @@ +/* + * Licensed to the Apache

[GitHub] [incubator-hudi] garyli1019 commented on pull request #1643: [HUDI-110] Spark Datasource Auto Partition Extractor

2020-05-20 Thread GitBox
garyli1019 commented on pull request #1643: URL: https://github.com/apache/incubator-hudi/pull/1643#issuecomment-631589860 @vinothchandar Yes this feature was already supported. Maybe I misunderstand this ticket https://issues.apache.org/jira/browse/HUDI-110. Need @bvaradar 's input here.

[incubator-hudi] branch hudi_test_suite_refactor updated (6472886 -> 2773fe9)

2020-05-20 Thread nagarwal
This is an automated email from the ASF dual-hosted git repository. nagarwal pushed a change to branch hudi_test_suite_refactor in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. discard 6472886 [HUDI-394] Provide a basic implementation of test suite add 2773fe9

[GitHub] [incubator-hudi] bvaradar commented on pull request #1643: [HUDI-110] Spark Datasource Auto Partition Extractor

2020-05-20 Thread GitBox
bvaradar commented on pull request #1643: URL: https://github.com/apache/incubator-hudi/pull/1643#issuecomment-631630091 @garyli1019 : There are 2 parts to it : The ticket was originally created to track making hive-style partitioning scheme as default in Hudi. Spark supports this same

[GitHub] [incubator-hudi] bvaradar commented on a change in pull request #1566: [HUDI-603]: DeltaStreamer can now fetch schema before every run in continuous mode

2020-05-20 Thread GitBox
bvaradar commented on a change in pull request #1566: URL: https://github.com/apache/incubator-hudi/pull/1566#discussion_r428208406 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/schema/SchemaRegistryProvider.java ## @@ -81,11 +66,22 @@ private static

[GitHub] [incubator-hudi] bvaradar commented on a change in pull request #1566: [HUDI-603]: DeltaStreamer can now fetch schema before every run in continuous mode

2020-05-20 Thread GitBox
bvaradar commented on a change in pull request #1566: URL: https://github.com/apache/incubator-hudi/pull/1566#discussion_r428210549 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/schema/SchemaSet.java ## @@ -0,0 +1,44 @@ +/* + * Licensed to the Apache

[jira] [Resolved] (HUDI-848) Turn on embedded timeline server by default for all writes

2020-05-20 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan resolved HUDI-848. - Resolution: Fixed > Turn on embedded timeline server by default for all writes >

[incubator-hudi] branch hudi_test_suite_refactor updated (2773fe9 -> 7781692)

2020-05-20 Thread nagarwal
This is an automated email from the ASF dual-hosted git repository. nagarwal pushed a change to branch hudi_test_suite_refactor in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. discard 2773fe9 [HUDI-394] Provide a basic implementation of test suite add 7781692

[GitHub] [incubator-hudi] vinothchandar commented on pull request #1638: HUDI-515 Resolve API conflict for Hive 2 & Hive 3

2020-05-20 Thread GitBox
vinothchandar commented on pull request #1638: URL: https://github.com/apache/incubator-hudi/pull/1638#issuecomment-631440488 FWIW using reflection is probably the only way, if the API between Hive 2 and 3 has broken (IIUC it has)

[jira] [Updated] (HUDI-916) Add support for multiple date/time formats in TimestampBasedKeyGenerator

2020-05-20 Thread Pratyaksh Sharma (Jira)
[ https://issues.apache.org/jira/browse/HUDI-916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratyaksh Sharma updated HUDI-916: -- Status: Open (was: New) > Add support for multiple date/time formats in

[jira] [Created] (HUDI-916) Add support for multiple date/time formats in TimestampBasedKeyGenerator

2020-05-20 Thread Pratyaksh Sharma (Jira)
Pratyaksh Sharma created HUDI-916: - Summary: Add support for multiple date/time formats in TimestampBasedKeyGenerator Key: HUDI-916 URL: https://issues.apache.org/jira/browse/HUDI-916 Project: Apache

[GitHub] [incubator-hudi] vinothchandar commented on pull request #1484: [HUDI-316] : Hbase qps repartition writestatus

2020-05-20 Thread GitBox
vinothchandar commented on pull request #1484: URL: https://github.com/apache/incubator-hudi/pull/1484#issuecomment-631426244 @v3nkatesh allow me to give some context around why we are strict around guava.. Hudi as you know has to be dropped under many different services

[GitHub] [incubator-hudi] vinothchandar commented on issue #1641: [SUPPORT] Failed to merge old record into new file for key xxx from old file 123.parquet to new file 456.parquet

2020-05-20 Thread GitBox
vinothchandar commented on issue #1641: URL: https://github.com/apache/incubator-hudi/issues/1641#issuecomment-631438361 Looks like a schema mismatch.. did you change a number to a string for .eg? This is an automated

[GitHub] [incubator-hudi] vinothchandar commented on issue #1641: [SUPPORT] Failed to merge old record into new file for key xxx from old file 123.parquet to new file 456.parquet

2020-05-20 Thread GitBox
vinothchandar commented on issue #1641: URL: https://github.com/apache/incubator-hudi/issues/1641#issuecomment-631438538 cc @lamber-ken @leesf any of you , interested in helping here? :) This is an automated message from

[jira] [Commented] (HUDI-914) support different target data clusters

2020-05-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112128#comment-17112128 ] Vinoth Chandar commented on HUDI-914: - For my understanding, whats a specific scenario where you cannot

[GitHub] [incubator-hudi] vinothchandar commented on pull request #1538: [HUDI-803]: added more test cases in TestHoodieAvroUtils.class

2020-05-20 Thread GitBox
vinothchandar commented on pull request #1538: URL: https://github.com/apache/incubator-hudi/pull/1538#issuecomment-631445518 @pratyakshsharma could you rebase again and repush .. codecov seems to need that to work.. This

[GitHub] [incubator-hudi] pratyakshsharma commented on pull request #1538: [HUDI-803]: added more test cases in TestHoodieAvroUtils.class

2020-05-20 Thread GitBox
pratyakshsharma commented on pull request #1538: URL: https://github.com/apache/incubator-hudi/pull/1538#issuecomment-631472874 > @pratyakshsharma by close, you mean final review and merge right? :) Yes :) Rebased and pushed again.

[GitHub] [incubator-hudi] codecov-commenter commented on pull request #1647: [HUDI-867]: fixed IllegalArgumentException from graphite metrics in deltaStreamer continuous mode

2020-05-20 Thread GitBox
codecov-commenter commented on pull request #1647: URL: https://github.com/apache/incubator-hudi/pull/1647#issuecomment-631511240 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1647?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] codecov-commenter edited a comment on pull request #1538: [HUDI-803]: added more test cases in TestHoodieAvroUtils.class

2020-05-20 Thread GitBox
codecov-commenter edited a comment on pull request #1538: URL: https://github.com/apache/incubator-hudi/pull/1538#issuecomment-631510907 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1538?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] hddong commented on a change in pull request #1558: [HUDI-796]: added deduping logic for upserts case

2020-05-20 Thread GitBox
hddong commented on a change in pull request #1558: URL: https://github.com/apache/incubator-hudi/pull/1558#discussion_r428056297 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/SparkMain.java ## @@ -263,13 +265,26 @@ private static int

[GitHub] [incubator-hudi] codecov-commenter edited a comment on pull request #1433: [HUDI-728]: Implement custom key generator

2020-05-20 Thread GitBox
codecov-commenter edited a comment on pull request #1433: URL: https://github.com/apache/incubator-hudi/pull/1433#issuecomment-631535136 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1433?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] vinothchandar merged pull request #1640: [MINOR] Fix resource cleanup in TestTableSchemaEvolution

2020-05-20 Thread GitBox
vinothchandar merged pull request #1640: URL: https://github.com/apache/incubator-hudi/pull/1640 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [incubator-hudi] codecov-commenter edited a comment on pull request #1647: [HUDI-867]: fixed IllegalArgumentException from graphite metrics in deltaStreamer continuous mode

2020-05-20 Thread GitBox
codecov-commenter edited a comment on pull request #1647: URL: https://github.com/apache/incubator-hudi/pull/1647#issuecomment-631511240 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1647?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] codecov-commenter commented on pull request #1433: [HUDI-728]: Implement custom key generator

2020-05-20 Thread GitBox
codecov-commenter commented on pull request #1433: URL: https://github.com/apache/incubator-hudi/pull/1433#issuecomment-631535136 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1433?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] vinothchandar edited a comment on pull request #1484: [HUDI-316] : Hbase qps repartition writestatus

2020-05-20 Thread GitBox
vinothchandar edited a comment on pull request #1484: URL: https://github.com/apache/incubator-hudi/pull/1484#issuecomment-631426244 @v3nkatesh allow me to give some context around why we are strict around guava.. Hudi as you know has to be dropped under many different services

[jira] [Commented] (HUDI-890) Prepare for 0.5.3 patch release

2020-05-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112120#comment-17112120 ] Vinoth Chandar commented on HUDI-890: - [~vbalaji] can make the call on HUDI-846 (I am okay turning them

[GitHub] [incubator-hudi] vinothchandar commented on pull request #1643: [HUDI-110] Spark Datasource Auto Partition Extractor

2020-05-20 Thread GitBox
vinothchandar commented on pull request #1643: URL: https://github.com/apache/incubator-hudi/pull/1643#issuecomment-631431498 IIRC we already support generating partition path in the `/partitionKey=partitionValue` folder strucutre.. Not sure what this PR is adding

[jira] [Commented] (HUDI-914) support different target data clusters

2020-05-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112126#comment-17112126 ] Vinoth Chandar commented on HUDI-914: - >Although specifying the namenode IP address of the target

[incubator-hudi] branch master updated: [HUDI-846][HUDI-848] Enable Incremental cleaning and embedded timeline-server by default (#1634)

2020-05-20 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/master by this push: new 74ecc27 [HUDI-846][HUDI-848] Enable

[jira] [Commented] (HUDI-890) Prepare for 0.5.3 patch release

2020-05-20 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112136#comment-17112136 ] sivabalan narayanan commented on HUDI-890: -- sure. Will wait to hear from [~vbalaji] > Prepare for

[jira] [Comment Edited] (HUDI-890) Prepare for 0.5.3 patch release

2020-05-20 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112136#comment-17112136 ] sivabalan narayanan edited comment on HUDI-890 at 5/20/20, 12:30 PM: -

[GitHub] [incubator-hudi] pratyakshsharma opened a new pull request #1648: [HUDI-916]: added support for multiple input formats in TimestampBasedKeyGenerator

2020-05-20 Thread GitBox
pratyakshsharma opened a new pull request #1648: URL: https://github.com/apache/incubator-hudi/pull/1648 …dKeyGenerator ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.*

[jira] [Updated] (HUDI-916) Add support for multiple date/time formats in TimestampBasedKeyGenerator

2020-05-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-916: Labels: pull-request-available (was: ) > Add support for multiple date/time formats in

[GitHub] [incubator-hudi] pratyakshsharma commented on pull request #1648: [HUDI-916]: added support for multiple input formats in TimestampBasedKeyGenerator

2020-05-20 Thread GitBox
pratyakshsharma commented on pull request #1648: URL: https://github.com/apache/incubator-hudi/pull/1648#issuecomment-631488125 @nsivabalan Raised a separate PR for https://github.com/apache/incubator-hudi/pull/1597. Please take a look.

[jira] [Updated] (HUDI-889) Writer supports useJdbc configuration when hive synchronization is enabled

2020-05-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-889: Fix Version/s: (was: 0.5.3) 0.6.0 > Writer supports useJdbc configuration

[GitHub] [incubator-hudi] vinothchandar merged pull request #1634: [HUDI-846][HUDI-848] Enable Incremental cleaning and embedded timeline-server by default

2020-05-20 Thread GitBox
vinothchandar merged pull request #1634: URL: https://github.com/apache/incubator-hudi/pull/1634 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[jira] [Updated] (HUDI-889) Writer supports useJdbc configuration when hive synchronization is enabled

2020-05-20 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-889: Fix Version/s: 0.5.3 > Writer supports useJdbc configuration when hive synchronization is enabled >

[GitHub] [incubator-hudi] pratyakshsharma commented on pull request #1538: [HUDI-803]: added more test cases in TestHoodieAvroUtils.class

2020-05-20 Thread GitBox
pratyakshsharma commented on pull request #1538: URL: https://github.com/apache/incubator-hudi/pull/1538#issuecomment-631453314 > @pratyakshsharma could you rebase again and repush .. codecov seems to need that to work.. Ack.

[jira] [Resolved] (HUDI-889) Writer supports useJdbc configuration when hive synchronization is enabled

2020-05-20 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-889. -- Fix Version/s: (was: 0.6.0) 0.5.3 Resolution: Fixed

[GitHub] [incubator-hudi] codecov-commenter commented on pull request #1538: [HUDI-803]: added more test cases in TestHoodieAvroUtils.class

2020-05-20 Thread GitBox
codecov-commenter commented on pull request #1538: URL: https://github.com/apache/incubator-hudi/pull/1538#issuecomment-631510907 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1538?src=pr=h1) Report > Merging

[incubator-hudi] branch hudi_test_suite_refactor updated (681cce9 -> 6472886)

2020-05-20 Thread nagarwal
This is an automated email from the ASF dual-hosted git repository. nagarwal pushed a change to branch hudi_test_suite_refactor in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. discard 681cce9 [HUDI-394] Provide a basic implementation of test suite add 6472886

[GitHub] [incubator-hudi] xushiyan commented on a change in pull request #1640: [MINOR] Fix resource cleanup in TestTableSchemaEvolution

2020-05-20 Thread GitBox
xushiyan commented on a change in pull request #1640: URL: https://github.com/apache/incubator-hudi/pull/1640#discussion_r428110614 ## File path: pom.xml ## @@ -245,7 +245,8 @@ ${maven-surefire-plugin.version} ${skipUTs} - -Xms256m -Xmx2g

[GitHub] [incubator-hudi] sathyaprakashg commented on issue #143: Tracking ticket for folks to be added to slack group

2020-05-20 Thread GitBox
sathyaprakashg commented on issue #143: URL: https://github.com/apache/incubator-hudi/issues/143#issuecomment-631715287 please add sathyapraka...@zillowgroup.com This is an automated message from the Apache Git Service. To

[incubator-hudi] branch hudi_test_suite_refactor updated (51048f6 -> 894ab75)

2020-05-20 Thread nagarwal
This is an automated email from the ASF dual-hosted git repository. nagarwal pushed a change to branch hudi_test_suite_refactor in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. discard 51048f6 [HUDI-394] Provide a basic implementation of test suite add 894ab75

[GitHub] [incubator-hudi] garyli1019 commented on pull request #1643: [HUDI-110] Spark Datasource Auto Partition Extractor

2020-05-20 Thread GitBox
garyli1019 commented on pull request #1643: URL: https://github.com/apache/incubator-hudi/pull/1643#issuecomment-631709558 Thanks @bvaradar , so this is more on the writer side. I will take a closer look. This is an

[GitHub] [incubator-hudi] garyli1019 closed pull request #1643: [HUDI-110] Spark Datasource Auto Partition Extractor

2020-05-20 Thread GitBox
garyli1019 closed pull request #1643: URL: https://github.com/apache/incubator-hudi/pull/1643 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [incubator-hudi] codecov-commenter edited a comment on pull request #1644: [HUDI-811] Restructure test packages in hudi-common

2020-05-20 Thread GitBox
codecov-commenter edited a comment on pull request #1644: URL: https://github.com/apache/incubator-hudi/pull/1644#issuecomment-631673778 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1644?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] codecov-commenter commented on pull request #1644: [HUDI-811] Restructure test packages in hudi-common

2020-05-20 Thread GitBox
codecov-commenter commented on pull request #1644: URL: https://github.com/apache/incubator-hudi/pull/1644#issuecomment-631673778 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1644?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] codecov-commenter edited a comment on pull request #1644: [HUDI-811] Restructure test packages in hudi-common

2020-05-20 Thread GitBox
codecov-commenter edited a comment on pull request #1644: URL: https://github.com/apache/incubator-hudi/pull/1644#issuecomment-631673778 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1644?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] codecov-commenter commented on pull request #1650: [HUDI-541]: replaced dataFile/df with baseFile/bf throughout code base

2020-05-20 Thread GitBox
codecov-commenter commented on pull request #1650: URL: https://github.com/apache/incubator-hudi/pull/1650#issuecomment-631782718 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1650?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] pratyakshsharma commented on pull request #1562: [HUDI-837]: implemented custom deserializer for AvroKafkaSource

2020-05-20 Thread GitBox
pratyakshsharma commented on pull request #1562: URL: https://github.com/apache/incubator-hudi/pull/1562#issuecomment-631763979 @n3nash got a chance to look at this? :) This is an automated message from the Apache Git

[GitHub] [incubator-hudi] codecov-commenter edited a comment on pull request #1650: [HUDI-541]: replaced dataFile/df with baseFile/bf throughout code base

2020-05-20 Thread GitBox
codecov-commenter edited a comment on pull request #1650: URL: https://github.com/apache/incubator-hudi/pull/1650#issuecomment-631782718 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1650?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] HariprasadAllaka1612 commented on issue #1641: [SUPPORT] Failed to merge old record into new file for key xxx from old file 123.parquet to new file 456.parquet

2020-05-20 Thread GitBox
HariprasadAllaka1612 commented on issue #1641: URL: https://github.com/apache/incubator-hudi/issues/1641#issuecomment-631674561 We can close this issue. This is a problem of having the parquet and hive table synced to parquet file having 2 different schemas. Its fixed by forcing the

[GitHub] [incubator-hudi] codecov-commenter edited a comment on pull request #1644: [HUDI-811] Restructure test packages in hudi-common

2020-05-20 Thread GitBox
codecov-commenter edited a comment on pull request #1644: URL: https://github.com/apache/incubator-hudi/pull/1644#issuecomment-631673778 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1644?src=pr=h1) Report > Merging

[incubator-hudi] branch hudi_test_suite_refactor updated (7781692 -> 51048f6)

2020-05-20 Thread nagarwal
This is an automated email from the ASF dual-hosted git repository. nagarwal pushed a change to branch hudi_test_suite_refactor in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. discard 7781692 [HUDI-394] Provide a basic implementation of test suite add 3c9da2e

[jira] [Updated] (HUDI-541) Replace variables/comments named "data files" to "base file"

2020-05-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-541: Labels: pull-request-available (was: ) > Replace variables/comments named "data files" to "base

[GitHub] [incubator-hudi] pratyakshsharma opened a new pull request #1650: [HUDI-541]: replaced dataFile/df with baseFile/bf throughout code base

2020-05-20 Thread GitBox
pratyakshsharma opened a new pull request #1650: URL: https://github.com/apache/incubator-hudi/pull/1650 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the

[GitHub] [incubator-hudi] codecov-commenter edited a comment on pull request #1644: [HUDI-811] Restructure test packages in hudi-common

2020-05-20 Thread GitBox
codecov-commenter edited a comment on pull request #1644: URL: https://github.com/apache/incubator-hudi/pull/1644#issuecomment-631673778 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1644?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] xushiyan commented on pull request #1644: [HUDI-811] Restructure test packages in hudi-common

2020-05-20 Thread GitBox
xushiyan commented on pull request #1644: URL: https://github.com/apache/incubator-hudi/pull/1644#issuecomment-631820696 @yanghua The PR is ready for review. Thanks. This is an automated message from the Apache Git Service.

[GitHub] [incubator-hudi] xushiyan edited a comment on pull request #1644: [HUDI-811] Restructure test packages in hudi-common

2020-05-20 Thread GitBox
xushiyan edited a comment on pull request #1644: URL: https://github.com/apache/incubator-hudi/pull/1644#issuecomment-631577878 ![Screen Shot 2020-05-20 at 7 24 30 PM](https://user-images.githubusercontent.com/2701446/82516434-906a1300-9acf-11ea-8d40-03d21d4ccaf2.png) Test

[incubator-hudi] branch asf-site updated: Travis CI build asf-site

2020-05-20 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new d0c3b9f Travis CI build asf-site

[jira] [Updated] (HUDI-905) Support PrunedFilteredScan for Spark Datasource

2020-05-20 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-905: Priority: Minor (was: Major) > Support PrunedFilteredScan for Spark Datasource >

[jira] [Updated] (HUDI-905) Support PrunedFilteredScan for Spark Datasource

2020-05-20 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-905: Status: Open (was: New) > Support PrunedFilteredScan for Spark Datasource >

[jira] [Updated] (HUDI-905) Support PrunedFilteredScan for Spark Datasource

2020-05-20 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-905: Component/s: Spark Integration > Support PrunedFilteredScan for Spark Datasource >

[GitHub] [incubator-hudi] xushiyan edited a comment on pull request #1644: [HUDI-811] Restructure test packages in hudi-common

2020-05-20 Thread GitBox
xushiyan edited a comment on pull request #1644: URL: https://github.com/apache/incubator-hudi/pull/1644#issuecomment-631577878 ![Screen Shot 2020-05-20 at 8 57 05 AM](https://user-images.githubusercontent.com/2701446/82468795-15294280-9a78-11ea-909a-bf09da83d7a4.png) Test classes

[GitHub] [incubator-hudi] codecov-commenter edited a comment on pull request #1645: [HUDI-707]Add unit test for StatsCommand

2020-05-20 Thread GitBox
codecov-commenter edited a comment on pull request #1645: URL: https://github.com/apache/incubator-hudi/pull/1645#issuecomment-631841340 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1645?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] leesf commented on a change in pull request #1651: [MINOR] add impala release and spark partition discovery

2020-05-20 Thread GitBox
leesf commented on a change in pull request #1651: URL: https://github.com/apache/incubator-hudi/pull/1651#discussion_r428440522 ## File path: docs/_docs/1_1_quick_start_guide.md ## @@ -297,6 +298,7 @@ tripsSnapshotDF = spark. \ read. \ format("hudi"). \ load(basePath

[GitHub] [incubator-hudi] leesf commented on a change in pull request #1651: [MINOR] add impala release and spark partition discovery

2020-05-20 Thread GitBox
leesf commented on a change in pull request #1651: URL: https://github.com/apache/incubator-hudi/pull/1651#discussion_r428440522 ## File path: docs/_docs/1_1_quick_start_guide.md ## @@ -297,6 +298,7 @@ tripsSnapshotDF = spark. \ read. \ format("hudi"). \ load(basePath

Build failed in Jenkins: hudi-snapshot-deployment-0.5 #284

2020-05-20 Thread Apache Jenkins Server
See Changes: -- [...truncated 2.40 KB...] /home/jenkins/tools/maven/apache-maven-3.5.4/conf: logging settings.xml toolchains.xml

[GitHub] [incubator-hudi] garyli1019 opened a new pull request #1651: [MINOR] add impala release and spark partition discovery

2020-05-20 Thread GitBox
garyli1019 opened a new pull request #1651: URL: https://github.com/apache/incubator-hudi/pull/1651 Minor doc edit This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [incubator-hudi] garyli1019 commented on a change in pull request #1651: [MINOR] add impala release and spark partition discovery

2020-05-20 Thread GitBox
garyli1019 commented on a change in pull request #1651: URL: https://github.com/apache/incubator-hudi/pull/1651#discussion_r428441106 ## File path: docs/_docs/1_1_quick_start_guide.md ## @@ -297,6 +298,7 @@ tripsSnapshotDF = spark. \ read. \ format("hudi"). \

[GitHub] [incubator-hudi] codecov-commenter edited a comment on pull request #1644: [HUDI-811] Restructure test packages in hudi-common

2020-05-20 Thread GitBox
codecov-commenter edited a comment on pull request #1644: URL: https://github.com/apache/incubator-hudi/pull/1644#issuecomment-631673778 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1644?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] yanghua commented on pull request #1645: [HUDI-707]Add unit test for StatsCommand

2020-05-20 Thread GitBox
yanghua commented on pull request #1645: URL: https://github.com/apache/incubator-hudi/pull/1645#issuecomment-631836600 > you be interested in shepherding this PR when it is Yes, of course. This is an automated

[GitHub] [incubator-hudi] codecov-commenter commented on pull request #1645: [HUDI-707]Add unit test for StatsCommand

2020-05-20 Thread GitBox
codecov-commenter commented on pull request #1645: URL: https://github.com/apache/incubator-hudi/pull/1645#issuecomment-631841340 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1645?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] codecov-commenter edited a comment on pull request #1645: [HUDI-707]Add unit test for StatsCommand

2020-05-20 Thread GitBox
codecov-commenter edited a comment on pull request #1645: URL: https://github.com/apache/incubator-hudi/pull/1645#issuecomment-631841340 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1645?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] leesf merged pull request #1651: [MINOR] add impala release and spark partition discovery

2020-05-20 Thread GitBox
leesf merged pull request #1651: URL: https://github.com/apache/incubator-hudi/pull/1651 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[incubator-hudi] branch asf-site updated: [MINOR] add impala release and spark partition discovery (#1651)

2020-05-20 Thread leesf
This is an automated email from the ASF dual-hosted git repository. leesf pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 349be47 [MINOR] add impala release

[jira] [Updated] (HUDI-905) Support PrunedFilteredScan for Spark Datasource

2020-05-20 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-905: Description: Hudi Spark Datasource incremental view currently is using 

[jira] [Commented] (HUDI-648) Implement error log/table for Datasource/DeltaStreamer/WriteClient/Compaction writes

2020-05-20 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112712#comment-17112712 ] Raymond Xu commented on HUDI-648: - I guess RFC may not be necessary. Had a look into HoodieWriteClient and

[GitHub] [incubator-hudi] yanghua commented on pull request #1572: [HUDI-836] Implement datadog metrics reporter

2020-05-20 Thread GitBox
yanghua commented on pull request #1572: URL: https://github.com/apache/incubator-hudi/pull/1572#issuecomment-631840124 @vinothchandar If you are busy with other things. Can we merge it firstly? Then, iterate later. This is

[GitHub] [incubator-hudi] codecov-commenter edited a comment on pull request #1644: [HUDI-811] Restructure test packages in hudi-common

2020-05-20 Thread GitBox
codecov-commenter edited a comment on pull request #1644: URL: https://github.com/apache/incubator-hudi/pull/1644#issuecomment-631673778 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1644?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] yanghua commented on pull request #1644: [HUDI-811] Restructure test packages in hudi-common

2020-05-20 Thread GitBox
yanghua commented on pull request #1644: URL: https://github.com/apache/incubator-hudi/pull/1644#issuecomment-631836269 OK, will review later This is an automated message from the Apache Git Service. To respond to the

[jira] [Updated] (HUDI-905) Support PrunedFilteredScan for Spark Datasource

2020-05-20 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-905: Summary: Support PrunedFilteredScan for Spark Datasource (was: Support native filter pushdown for

  1   2   >