[GitHub] [hudi] vinothchandar commented on issue #1705: Tracking Hudi Data along transaction time and buisness time

2020-06-06 Thread GitBox
vinothchandar commented on issue #1705: URL: https://github.com/apache/hudi/issues/1705#issuecomment-640158649 That’s a larger book :).. can you please each explain your use case in more detail This is an automated message

[jira] [Updated] (HUDI-974) Fields out of order in MOR mode when using Hive

2020-06-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-974: Labels: pull-request-available (was: ) > Fields out of order in MOR mode when using Hive >

[GitHub] [hudi] lw309637554 opened a new pull request #1711: [HUDI-974] fix fields out of order in MOR mode when using Hive

2020-06-06 Thread GitBox
lw309637554 opened a new pull request #1711: URL: https://github.com/apache/hudi/pull/1711 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of

Build failed in Jenkins: hudi-snapshot-deployment-0.5 #301

2020-06-06 Thread Apache Jenkins Server
See Changes: -- [...truncated 2.49 KB...] settings.xml toolchains.xml /home/jenkins/tools/maven/apache-maven-3.5.4/conf/logging: simplelogger.properties

[GitHub] [hudi] xushiyan commented on a change in pull request #1698: [HUDI-986] Support staging site for per pull request

2020-06-06 Thread GitBox
xushiyan commented on a change in pull request #1698: URL: https://github.com/apache/hudi/pull/1698#discussion_r436319767 ## File path: docs/_pages/contributing.cn.md ## @@ -25,7 +25,7 @@ To contribute code, you need To contribute, you would need to do the following - -

[GitHub] [hudi] leesf edited a comment on pull request #1647: [HUDI-867]: fixed IllegalArgumentException from graphite metrics in deltaStreamer continuous mode

2020-06-06 Thread GitBox
leesf edited a comment on pull request #1647: URL: https://github.com/apache/hudi/pull/1647#issuecomment-640034374 > @pratyakshsharma IIUC, this will introduce infinite number of metrics being sent to monitoring system in theory when it's set to continuous mode? Normally for a Hudi table,

[GitHub] [hudi] xushiyan commented on pull request #1647: [HUDI-867]: fixed IllegalArgumentException from graphite metrics in deltaStreamer continuous mode

2020-06-06 Thread GitBox
xushiyan commented on pull request #1647: URL: https://github.com/apache/hudi/pull/1647#issuecomment-640146574 @leesf There is some markdown format issue with your typings...not sure what is suggested. Just to clarify from user perspective. Say a user runs a delta streamer, he

[GitHub] [hudi] xushiyan commented on a change in pull request #1710: [MINOR] Fix delta streamer write config

2020-06-06 Thread GitBox
xushiyan commented on a change in pull request #1710: URL: https://github.com/apache/hudi/pull/1710#discussion_r436316239 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/deltastreamer/DeltaSync.java ## @@ -497,25 +499,30 @@ private void setupWriteClient()

[GitHub] [hudi] xushiyan opened a new pull request #1710: [MINOR] Fix delta streamer write config

2020-06-06 Thread GitBox
xushiyan opened a new pull request #1710: URL: https://github.com/apache/hudi/pull/1710 - avoid overwrite index type - add more validation error message - make write config constants public This change added tests and can be verified as follows: - [ ] Test by running

[GitHub] [hudi] shenh062326 commented on a change in pull request #1690: [HUDI-908] Add decimals to HoodieTestDataGenerator

2020-06-06 Thread GitBox
shenh062326 commented on a change in pull request #1690: URL: https://github.com/apache/hudi/pull/1690#discussion_r436313137 ## File path: hudi-common/src/main/java/org/apache/hudi/avro/MercifulJsonConverter.java ## @@ -245,10 +245,14 @@ private static

[GitHub] [hudi] shenh062326 commented on a change in pull request #1690: [HUDI-908] Add decimals to HoodieTestDataGenerator

2020-06-06 Thread GitBox
shenh062326 commented on a change in pull request #1690: URL: https://github.com/apache/hudi/pull/1690#discussion_r436313087 ## File path: hudi-common/src/main/java/org/apache/hudi/avro/MercifulJsonConverter.java ## @@ -245,10 +245,14 @@ private static

[GitHub] [hudi] shenh062326 commented on a change in pull request #1690: [HUDI-908] Add decimals to HoodieTestDataGenerator

2020-06-06 Thread GitBox
shenh062326 commented on a change in pull request #1690: URL: https://github.com/apache/hudi/pull/1690#discussion_r436311736 ## File path: hudi-common/src/main/java/org/apache/hudi/avro/MercifulJsonConverter.java ## @@ -245,10 +245,14 @@ private static

[GitHub] [hudi] RocMarshal commented on issue #143: Tracking ticket for folks to be added to slack group

2020-06-06 Thread GitBox
RocMarshal commented on issue #143: URL: https://github.com/apache/hudi/issues/143#issuecomment-640085114 Could you add me to the slack channel? flin...@126.com Thank you. This is an automated message from the Apache Git

[jira] [Commented] (HUDI-944) Support more complete concurrency control when writing data

2020-06-06 Thread liwei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17127395#comment-17127395 ] liwei commented on HUDI-944: Thanks so much [~vinoth] I am so agree with you. First , I also think (a)  is

[jira] [Updated] (HUDI-998) Introduce a robot to build testing website automatically

2020-06-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-998: Labels: pull-request-available (was: ) > Introduce a robot to build testing website automatically >

[jira] [Updated] (HUDI-1000) incremental query for COW non-partitioned table no data

2020-06-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1000: - Labels: pull-request-available (was: ) > incremental query for COW non-partitioned table no data

[jira] [Created] (HUDI-1003) Handle partitions when sync non-partitioned table to hive.

2020-06-06 Thread leesf (Jira)
leesf created HUDI-1003: --- Summary: Handle partitions when sync non-partitioned table to hive. Key: HUDI-1003 URL: https://issues.apache.org/jira/browse/HUDI-1003 Project: Apache Hudi Issue Type: Bug

[jira] [Updated] (HUDI-1003) Handle partitions correctly when sync non-partitioned table to hive.

2020-06-06 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf updated HUDI-1003: Summary: Handle partitions correctly when sync non-partitioned table to hive. (was: Handle partitions when sync

[jira] [Assigned] (HUDI-1001) Add implementation to translate source partition paths when doing metadata bootstrap

2020-06-06 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-1001: Assignee: Balaji Varadarajan > Add implementation to translate source partition

[jira] [Updated] (HUDI-999) Parallelize listing of Source dataset partitions

2020-06-06 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-999: Status: Open (was: New) > Parallelize listing of Source dataset partitions >

[jira] [Created] (HUDI-1001) Add implementation to translate source partition paths when doing metadata bootstrap

2020-06-06 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-1001: Summary: Add implementation to translate source partition paths when doing metadata bootstrap Key: HUDI-1001 URL: https://issues.apache.org/jira/browse/HUDI-1001

[jira] [Updated] (HUDI-1001) Add implementation to translate source partition paths when doing metadata bootstrap

2020-06-06 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-1001: - Status: Open (was: New) > Add implementation to translate source partition paths when

[jira] [Updated] (HUDI-289) Implement a test suite to support long running test for Hudi writing and querying end-end

2020-06-06 Thread jing (Jira)
[ https://issues.apache.org/jira/browse/HUDI-289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jing updated HUDI-289: -- Issue Type: Test (was: Bug) > Implement a test suite to support long running test for Hudi writing and > querying

[jira] [Created] (HUDI-1002) Ignore case when setting incremental mode in hive query

2020-06-06 Thread leesf (Jira)
leesf created HUDI-1002: --- Summary: Ignore case when setting incremental mode in hive query Key: HUDI-1002 URL: https://issues.apache.org/jira/browse/HUDI-1002 Project: Apache Hudi Issue Type: Bug

[jira] [Created] (HUDI-1000) incremental query for COW non-partitioned table no data

2020-06-06 Thread jing (Jira)
jing created HUDI-1000: -- Summary: incremental query for COW non-partitioned table no data Key: HUDI-1000 URL: https://issues.apache.org/jira/browse/HUDI-1000 Project: Apache Hudi Issue Type: Bug

[jira] [Updated] (HUDI-289) Implement a test suite to support long running test for Hudi writing and querying end-end

2020-06-06 Thread jing (Jira)
[ https://issues.apache.org/jira/browse/HUDI-289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jing updated HUDI-289: -- Issue Type: Bug (was: Test) > Implement a test suite to support long running test for Hudi writing and > querying

[jira] [Resolved] (HUDI-990) Timeline API : filterCompletedAndCompactionInstants needs to handle requested state correctly

2020-06-06 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf resolved HUDI-990. Resolution: Fixed Fixed via master: fb283934a33a0bc7b11f80e4149f7922fa4f0af5 > Timeline API :

[jira] [Closed] (HUDI-990) Timeline API : filterCompletedAndCompactionInstants needs to handle requested state correctly

2020-06-06 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf closed HUDI-990. -- > Timeline API : filterCompletedAndCompactionInstants needs to handle requested > state correctly >

[jira] [Updated] (HUDI-1002) Ignore case when setting incremental mode in hive query

2020-06-06 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf updated HUDI-1002: Issue Type: Improvement (was: Bug) > Ignore case when setting incremental mode in hive query >

[jira] [Resolved] (HUDI-988) Fix unit test flakiness in Hudi

2020-06-06 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf resolved HUDI-988. Fix Version/s: 0.6.0 Resolution: Fixed Fixed via master: a68180b179ae57f16ff0a8d74b72b43b501d36c6 > Fix unit

[jira] [Closed] (HUDI-988) Fix unit test flakiness in Hudi

2020-06-06 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf closed HUDI-988. -- > Fix unit test flakiness in Hudi > --- > > Key: HUDI-988 > URL:

[jira] [Resolved] (HUDI-975) Add unit tests in TestHoodieTableFileSystemView to test view for non-partitioned table

2020-06-06 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf resolved HUDI-975. Resolution: Fixed Fxied via master: 7c59095314f5525590eea308084079a31bde3e17 > Add unit tests in

[jira] [Updated] (HUDI-934) Hive query does not work with realtime table which contain decimal type

2020-06-06 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf updated HUDI-934: --- Fix Version/s: 0.6.0 > Hive query does not work with realtime table which contain decimal type >

[jira] [Closed] (HUDI-975) Add unit tests in TestHoodieTableFileSystemView to test view for non-partitioned table

2020-06-06 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf closed HUDI-975. -- > Add unit tests in TestHoodieTableFileSystemView to test view for > non-partitioned table >

[jira] [Commented] (HUDI-1000) incremental query for COW non-partitioned table no data

2020-06-06 Thread jing (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17127268#comment-17127268 ] jing commented on HUDI-1000: pr :[https://github.com/apache/hudi/pull/1708]   Please refer to the attachment

[jira] [Updated] (HUDI-1000) incremental query for COW non-partitioned table no data

2020-06-06 Thread jing (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jing updated HUDI-1000: --- Attachment: 修复后的查询结果.png > incremental query for COW non-partitioned table no data >

[jira] [Updated] (HUDI-992) For hive-style partitioned source data, partition columns synced with Hive will always have String type

2020-06-06 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-992: Status: Open (was: New) > For hive-style partitioned source data, partition columns synced

[jira] [Commented] (HUDI-999) Parallelize listing of Source dataset partitions

2020-06-06 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17127229#comment-17127229 ] Balaji Varadarajan commented on HUDI-999: - cc [~uditme] > Parallelize listing of Source dataset

[jira] [Created] (HUDI-999) Parallelize listing of Source dataset partitions

2020-06-06 Thread Balaji Varadarajan (Jira)
Balaji Varadarajan created HUDI-999: --- Summary: Parallelize listing of Source dataset partitions Key: HUDI-999 URL: https://issues.apache.org/jira/browse/HUDI-999 Project: Apache Hudi

[jira] [Assigned] (HUDI-999) Parallelize listing of Source dataset partitions

2020-06-06 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan reassigned HUDI-999: --- Assignee: Balaji Varadarajan > Parallelize listing of Source dataset partitions >

[GitHub] [hudi] codecov-commenter edited a comment on pull request #1708: [HUDI-1000] Fix incremental query for COW non-partitioned table with no data

2020-06-06 Thread GitBox
codecov-commenter edited a comment on pull request #1708: URL: https://github.com/apache/hudi/pull/1708#issuecomment-640051563 # [Codecov](https://codecov.io/gh/apache/hudi/pull/1708?src=pr=h1) Report > Merging [#1708](https://codecov.io/gh/apache/hudi/pull/1708?src=pr=desc) into

[GitHub] [hudi] codecov-commenter commented on pull request #1708: [hudi-1000] fix incremental query for COW non-partitioned table no data

2020-06-06 Thread GitBox
codecov-commenter commented on pull request #1708: URL: https://github.com/apache/hudi/pull/1708#issuecomment-640051563 # [Codecov](https://codecov.io/gh/apache/hudi/pull/1708?src=pr=h1) Report > Merging [#1708](https://codecov.io/gh/apache/hudi/pull/1708?src=pr=desc) into

[GitHub] [hudi] bvaradar commented on issue #1679: How to disable Hive JDBC and enable metastore

2020-06-06 Thread GitBox
bvaradar commented on issue #1679: URL: https://github.com/apache/hudi/issues/1679#issuecomment-640034575 @selvarajperiyasamy : Hope you were able to resolve the issue. Let us know if any help is needed. This is an

[GitHub] [hudi] leesf commented on pull request #1647: [HUDI-867]: fixed IllegalArgumentException from graphite metrics in deltaStreamer continuous mode

2020-06-06 Thread GitBox
leesf commented on pull request #1647: URL: https://github.com/apache/hudi/pull/1647#issuecomment-640034374 > @pratyakshsharma IIUC, this will introduce infinite number of metrics being sent to monitoring system in theory when it's set to continuous mode? Normally for a Hudi table, we'd

[GitHub] [hudi] bvaradar commented on issue #1709: [SUPPORT]

2020-06-06 Thread GitBox
bvaradar commented on issue #1709: URL: https://github.com/apache/hudi/issues/1709#issuecomment-640034197 @DragonPrince1992 : I dont see any details on the issue. Please add details about any issue you are facing with Hudi.

[GitHub] [hudi] bvaradar closed issue #1696: COW Error on existing Hive Table

2020-06-06 Thread GitBox
bvaradar closed issue #1696: URL: https://github.com/apache/hudi/issues/1696 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] bvaradar commented on a change in pull request #1690: [HUDI-908] Add decimals to HoodieTestDataGenerator

2020-06-06 Thread GitBox
bvaradar commented on a change in pull request #1690: URL: https://github.com/apache/hudi/pull/1690#discussion_r436257453 ## File path: hudi-common/src/main/java/org/apache/hudi/avro/MercifulJsonConverter.java ## @@ -245,10 +245,14 @@ private static JsonToAvroFieldProcessor

[GitHub] [hudi] DragonPrince1992 opened a new issue #1709: [SUPPORT]

2020-06-06 Thread GitBox
DragonPrince1992 opened a new issue #1709: URL: https://github.com/apache/hudi/issues/1709 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://cwiki.apache.org/confluence/display/HUDI/FAQ)? - Join the mailing list to engage in conversations and get

[GitHub] [hudi] leesf commented on a change in pull request #1708: [hudi-1000] fix incremental query for COW non-partitioned table no data

2020-06-06 Thread GitBox
leesf commented on a change in pull request #1708: URL: https://github.com/apache/hudi/pull/1708#discussion_r436255512 ## File path: hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/HoodieParquetInputFormat.java ## @@ -183,7 +183,7 @@ protected HoodieDefaultTimeline

[GitHub] [hudi] bvaradar commented on pull request #1678: [HUDI-242] Metadata Bootstrap changes

2020-06-06 Thread GitBox
bvaradar commented on pull request #1678: URL: https://github.com/apache/hudi/pull/1678#issuecomment-640020465 I will look at any new test failures if it happens. It will likely be of log-limit or other CI specific issues.

[GitHub] [hudi] bvaradar commented on pull request #1678: [HUDI-242] Metadata Bootstrap changes

2020-06-06 Thread GitBox
bvaradar commented on pull request #1678: URL: https://github.com/apache/hudi/pull/1678#issuecomment-640020195 @vinothchandar : Added comments in the PR. Please take a look when you get a chance. cc @umehrot2

[GitHub] [hudi] bvaradar commented on a change in pull request #1678: [WIP] [HUDI-242] Metadata Bootstrap changes

2020-06-06 Thread GitBox
bvaradar commented on a change in pull request #1678: URL: https://github.com/apache/hudi/pull/1678#discussion_r435985550 ## File path: hudi-client/src/main/java/org/apache/hudi/client/bootstrap/selector/BootstrapRegexModeSelector.java ## @@ -0,0 +1,51 @@ +/* + * Licensed to