[jira] [Updated] (HUDI-732) Generate site to content folder

2020-03-23 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lamber-ken updated HUDI-732: Status: Open (was: New) > Generate site to content folder > --- > >

[jira] [Created] (HUDI-732) Generate site to content folder

2020-03-23 Thread lamber-ken (Jira)
lamber-ken created HUDI-732: --- Summary: Generate site to content folder Key: HUDI-732 URL: https://issues.apache.org/jira/browse/HUDI-732 Project: Apache Hudi (incubating) Issue Type: Task

[jira] [Assigned] (HUDI-732) Generate site to content folder

2020-03-23 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lamber-ken reassigned HUDI-732: --- Assignee: lamber-ken > Generate site to content folder > --- > >

[jira] [Comment Edited] (HUDI-686) Implement BloomIndexV2 that does not depend on memory caching

2020-03-23 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17065030#comment-17065030 ] lamber-ken edited comment on HUDI-686 at 3/24/20, 5:41 AM: --- right, this is a nice

[jira] [Closed] (HUDI-504) Restructuring and auto-generation of docs

2020-03-23 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lamber-ken closed HUDI-504. --- Resolution: Fixed > Restructuring and auto-generation of docs > - > >

[jira] [Closed] (HUDI-646) Re-enable TestUpdateSchemaEvolution after triaging weird CI issue

2020-03-23 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lamber-ken closed HUDI-646. --- Resolution: Fixed > Re-enable TestUpdateSchemaEvolution after triaging weird CI issue >

[GitHub] [incubator-hudi] codecov-io edited a comment on issue #1440: [HUDI-731] Add ChainedTransformer

2020-03-23 Thread GitBox
codecov-io edited a comment on issue #1440: [HUDI-731] Add ChainedTransformer URL: https://github.com/apache/incubator-hudi/pull/1440#issuecomment-602962209 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1440?src=pr=h1) Report > Merging

[incubator-hudi] branch asf-site updated: [HUDI-504] Restructuring and auto-generation of docs (#1412)

2020-03-23 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 9c80aa5 [HUDI-504] Restructuring

[GitHub] [incubator-hudi] xushiyan commented on issue #1440: [HUDI-731] Add ChainedTransformer

2020-03-23 Thread GitBox
xushiyan commented on issue #1440: [HUDI-731] Add ChainedTransformer URL: https://github.com/apache/incubator-hudi/pull/1440#issuecomment-603012975 @vinothchandar The current approach is also compatible with existing CLI usage. The user extends this abstract class and plugs that custom

[GitHub] [incubator-hudi] vinothchandar merged pull request #1412: [HUDI-504] Restructuring and auto-generation of docs

2020-03-23 Thread GitBox
vinothchandar merged pull request #1412: [HUDI-504] Restructuring and auto-generation of docs URL: https://github.com/apache/incubator-hudi/pull/1412 This is an automated message from the Apache Git Service. To respond to

[GitHub] [incubator-hudi] vinothchandar commented on issue #1438: How to get the file name corresponding to HoodieKey through the GlobalBloomIndex

2020-03-23 Thread GitBox
vinothchandar commented on issue #1438: How to get the file name corresponding to HoodieKey through the GlobalBloomIndex URL: https://github.com/apache/incubator-hudi/issues/1438#issuecomment-603008577 That does not count right.. may be there is a gap here w.r.t Global index?

[GitHub] [incubator-hudi] xushiyan commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer

2020-03-23 Thread GitBox
xushiyan commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer URL: https://github.com/apache/incubator-hudi/pull/1440#discussion_r396894929 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/transform/ChainedTransformer.java ##

[GitHub] [incubator-hudi] xushiyan commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer

2020-03-23 Thread GitBox
xushiyan commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer URL: https://github.com/apache/incubator-hudi/pull/1440#discussion_r396894929 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/transform/ChainedTransformer.java ##

Build failed in Jenkins: hudi-snapshot-deployment-0.5 #226

2020-03-23 Thread Apache Jenkins Server
See Changes: -- [...truncated 2.36 KB...] /home/jenkins/tools/maven/apache-maven-3.5.4/conf: logging settings.xml toolchains.xml

[GitHub] [incubator-hudi] yanghua commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer

2020-03-23 Thread GitBox
yanghua commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer URL: https://github.com/apache/incubator-hudi/pull/1440#discussion_r396879048 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/transform/ChainedTransformer.java ##

[GitHub] [incubator-hudi] umehrot2 commented on a change in pull request #1427: [HUDI-727]: Copy default values of fields if not present when rewriting incoming record with new schema

2020-03-23 Thread GitBox
umehrot2 commented on a change in pull request #1427: [HUDI-727]: Copy default values of fields if not present when rewriting incoming record with new schema URL: https://github.com/apache/incubator-hudi/pull/1427#discussion_r396877351 ## File path:

[GitHub] [incubator-hudi] umehrot2 commented on a change in pull request #1427: [HUDI-727]: Copy default values of fields if not present when rewriting incoming record with new schema

2020-03-23 Thread GitBox
umehrot2 commented on a change in pull request #1427: [HUDI-727]: Copy default values of fields if not present when rewriting incoming record with new schema URL: https://github.com/apache/incubator-hudi/pull/1427#discussion_r396877767 ## File path:

[GitHub] [incubator-hudi] xushiyan commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer

2020-03-23 Thread GitBox
xushiyan commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer URL: https://github.com/apache/incubator-hudi/pull/1440#discussion_r396873427 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/transform/ChainedTransformer.java ##

[GitHub] [incubator-hudi] xushiyan commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer

2020-03-23 Thread GitBox
xushiyan commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer URL: https://github.com/apache/incubator-hudi/pull/1440#discussion_r396873391 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/transform/ChainedTransformer.java ##

[GitHub] [incubator-hudi] yanghua commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer

2020-03-23 Thread GitBox
yanghua commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer URL: https://github.com/apache/incubator-hudi/pull/1440#discussion_r396869571 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/transform/ChainedTransformer.java ##

[GitHub] [incubator-hudi] yanghua commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer

2020-03-23 Thread GitBox
yanghua commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer URL: https://github.com/apache/incubator-hudi/pull/1440#discussion_r396869939 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/transform/ChainedTransformer.java ##

[GitHub] [incubator-hudi] yanghua commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer

2020-03-23 Thread GitBox
yanghua commented on a change in pull request #1440: [HUDI-731] Add ChainedTransformer URL: https://github.com/apache/incubator-hudi/pull/1440#discussion_r396870236 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/transform/ChainedTransformer.java ##

[GitHub] [incubator-hudi] codecov-io edited a comment on issue #1440: [HUDI-731] Add ChainedTransformer

2020-03-23 Thread GitBox
codecov-io edited a comment on issue #1440: [HUDI-731] Add ChainedTransformer URL: https://github.com/apache/incubator-hudi/pull/1440#issuecomment-602962209 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1440?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] codecov-io commented on issue #1440: [HUDI-731] Add ChainedTransformer

2020-03-23 Thread GitBox
codecov-io commented on issue #1440: [HUDI-731] Add ChainedTransformer URL: https://github.com/apache/incubator-hudi/pull/1440#issuecomment-602962209 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1440?src=pr=h1) Report > Merging

[jira] [Updated] (HUDI-731) Implement a chained transformer for deltastreamer that can chain other transformer implementations

2020-03-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-731: Labels: pull-request-available (was: ) > Implement a chained transformer for deltastreamer that can

[GitHub] [incubator-hudi] xushiyan opened a new pull request #1440: [HUDI-731] Add ChainedTransformer

2020-03-23 Thread GitBox
xushiyan opened a new pull request #1440: [HUDI-731] Add ChainedTransformer URL: https://github.com/apache/incubator-hudi/pull/1440 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull

[jira] [Assigned] (HUDI-731) Implement a chained transformer for deltastreamer that can chain other transformer implementations

2020-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-731: --- Assignee: Raymond Xu > Implement a chained transformer for deltastreamer that can chain other >

[jira] [Updated] (HUDI-731) Implement a chained transformer for deltastreamer that can chain other transformer implementations

2020-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-731: Status: In Progress (was: Open) > Implement a chained transformer for deltastreamer that can chain other >

[jira] [Updated] (HUDI-731) Implement a chained transformer for deltastreamer that can chain other transformer implementations

2020-03-23 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-731: Status: Open (was: New) > Implement a chained transformer for deltastreamer that can chain other >

[GitHub] [incubator-hudi] umehrot2 commented on issue #1406: [HUDI-713] Fix conversion of Spark array of struct type to Avro schema

2020-03-23 Thread GitBox
umehrot2 commented on issue #1406: [HUDI-713] Fix conversion of Spark array of struct type to Avro schema URL: https://github.com/apache/incubator-hudi/pull/1406#issuecomment-602933007 > > > So anyone who has written data using databricks-avro will face issues reading. > > By this

[jira] [Created] (HUDI-731) Implement a chained transformer for deltastreamer that can chain other transformer implementations

2020-03-23 Thread Vinoth Chandar (Jira)
Vinoth Chandar created HUDI-731: --- Summary: Implement a chained transformer for deltastreamer that can chain other transformer implementations Key: HUDI-731 URL: https://issues.apache.org/jira/browse/HUDI-731

[GitHub] [incubator-hudi] vinothchandar commented on issue #1406: [HUDI-713] Fix conversion of Spark array of struct type to Avro schema

2020-03-23 Thread GitBox
vinothchandar commented on issue #1406: [HUDI-713] Fix conversion of Spark array of struct type to Avro schema URL: https://github.com/apache/incubator-hudi/pull/1406#issuecomment-602881385 >>So anyone who has written data using databricks-avro will face issues reading. By this you

[GitHub] [incubator-hudi] vinothchandar edited a comment on issue #1406: [HUDI-713] Fix conversion of Spark array of struct type to Avro schema

2020-03-23 Thread GitBox
vinothchandar edited a comment on issue #1406: [HUDI-713] Fix conversion of Spark array of struct type to Avro schema URL: https://github.com/apache/incubator-hudi/pull/1406#issuecomment-602881385 >>So anyone who has written data using databricks-avro will face issues reading. By

[GitHub] [incubator-hudi] umehrot2 edited a comment on issue #1427: [HUDI-727]: Copy default values of fields if not present when rewriting incoming record with new schema

2020-03-23 Thread GitBox
umehrot2 edited a comment on issue #1427: [HUDI-727]: Copy default values of fields if not present when rewriting incoming record with new schema URL: https://github.com/apache/incubator-hudi/pull/1427#issuecomment-602847315 > @umehrot2 could you please help review? Will take a

[GitHub] [incubator-hudi] umehrot2 commented on issue #1427: [HUDI-727]: Copy default values of fields if not present when rewriting incoming record with new schema

2020-03-23 Thread GitBox
umehrot2 commented on issue #1427: [HUDI-727]: Copy default values of fields if not present when rewriting incoming record with new schema URL: https://github.com/apache/incubator-hudi/pull/1427#issuecomment-602847315 > @umehrot2 could you please help review? > @umehrot2

[GitHub] [incubator-hudi] umehrot2 edited a comment on issue #1406: [HUDI-713] Fix conversion of Spark array of struct type to Avro schema

2020-03-23 Thread GitBox
umehrot2 edited a comment on issue #1406: [HUDI-713] Fix conversion of Spark array of struct type to Avro schema URL: https://github.com/apache/incubator-hudi/pull/1406#issuecomment-602846762 > LGTM overall.. > > @umehrot2 @zhedoubushishi generally speaking, this schema namespace

[GitHub] [incubator-hudi] umehrot2 commented on issue #1406: [HUDI-713] Fix conversion of Spark array of struct type to Avro schema

2020-03-23 Thread GitBox
umehrot2 commented on issue #1406: [HUDI-713] Fix conversion of Spark array of struct type to Avro schema URL: https://github.com/apache/incubator-hudi/pull/1406#issuecomment-602846762 > LGTM overall.. > > @umehrot2 @zhedoubushishi generally speaking, this schema namespace

[GitHub] [incubator-hudi] lamber-ken commented on issue #1439: Hudi class loading problem

2020-03-23 Thread GitBox
lamber-ken commented on issue #1439: Hudi class loading problem URL: https://github.com/apache/incubator-hudi/issues/1439#issuecomment-602812140 I'm not familiar with apache tez, but from the above stracktrace, tez works on yarn cluster, so I thinks it may work if we place

[jira] [Commented] (HUDI-686) Implement BloomIndexV2 that does not depend on memory caching

2020-03-23 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17065030#comment-17065030 ] lamber-ken commented on HUDI-686: - right, this is a nice design, some thoughts: * if the input data is

[GitHub] [incubator-hudi] lamber-ken commented on a change in pull request #1412: [HUDI-504] Restructuring and auto-generation of docs

2020-03-23 Thread GitBox
lamber-ken commented on a change in pull request #1412: [HUDI-504] Restructuring and auto-generation of docs URL: https://github.com/apache/incubator-hudi/pull/1412#discussion_r396655762 ## File path: .travis.yml ## @@ -31,7 +31,7 @@ after_success: - echo

[GitHub] [incubator-hudi] lamber-ken commented on a change in pull request #1412: [HUDI-504] Restructuring and auto-generation of docs

2020-03-23 Thread GitBox
lamber-ken commented on a change in pull request #1412: [HUDI-504] Restructuring and auto-generation of docs URL: https://github.com/apache/incubator-hudi/pull/1412#discussion_r396657847 ## File path: .travis.yml ## @@ -31,7 +31,7 @@ after_success: - echo

[jira] [Commented] (HUDI-724) Parallelize GetSmallFiles For Partitions

2020-03-23 Thread Feichi Feng (Jira)
[ https://issues.apache.org/jira/browse/HUDI-724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17064994#comment-17064994 ] Feichi Feng commented on HUDI-724: -- Hi [~vbalaji], is there anything else I need to address for the PR? 

[GitHub] [incubator-hudi] satishkotha commented on issue #1396: [HUDI-687] Stop incremental reader on RO table before a pending compaction

2020-03-23 Thread GitBox
satishkotha commented on issue #1396: [HUDI-687] Stop incremental reader on RO table before a pending compaction URL: https://github.com/apache/incubator-hudi/pull/1396#issuecomment-602769754 @vinothchandar could you take another look at this one?

[GitHub] [incubator-hudi] lamber-ken commented on a change in pull request #1412: [HUDI-504] Restructuring and auto-generation of docs

2020-03-23 Thread GitBox
lamber-ken commented on a change in pull request #1412: [HUDI-504] Restructuring and auto-generation of docs URL: https://github.com/apache/incubator-hudi/pull/1412#discussion_r396655762 ## File path: .travis.yml ## @@ -31,7 +31,7 @@ after_success: - echo

[GitHub] [incubator-hudi] nsivabalan commented on a change in pull request #1402: [WIP][HUDI-407] Adding Simple Index

2020-03-23 Thread GitBox
nsivabalan commented on a change in pull request #1402: [WIP][HUDI-407] Adding Simple Index URL: https://github.com/apache/incubator-hudi/pull/1402#discussion_r396570732 ## File path: hudi-client/src/main/java/org/apache/hudi/index/bloom/HoodieSimpleIndex.java ## @@ -0,0

[GitHub] [incubator-hudi] melkimohamed opened a new issue #1439: [SUPPORT] Hudi class loading problem

2020-03-23 Thread GitBox
melkimohamed opened a new issue #1439: [SUPPORT] Hudi class loading problem URL: https://github.com/apache/incubator-hudi/issues/1439 **Describe the problem you faced** I tested hudi and everything works fine except the count requests  The only problem when I do a count (select count

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1412: [HUDI-504] Restructuring and auto-generation of docs

2020-03-23 Thread GitBox
vinothchandar commented on a change in pull request #1412: [HUDI-504] Restructuring and auto-generation of docs URL: https://github.com/apache/incubator-hudi/pull/1412#discussion_r396498871 ## File path: .travis.yml ## @@ -31,7 +31,7 @@ after_success: - echo

[GitHub] [incubator-hudi] s-sanjay commented on a change in pull request #1350: [HUDI-629]: Replace Guava's Hashing with an equivalent in NumericUtils.java

2020-03-23 Thread GitBox
s-sanjay commented on a change in pull request #1350: [HUDI-629]: Replace Guava's Hashing with an equivalent in NumericUtils.java URL: https://github.com/apache/incubator-hudi/pull/1350#discussion_r396443518 ## File path:

[GitHub] [incubator-hudi] gdineshbabu88 commented on issue #1150: [HUDI-288]: Add support for ingesting multiple kafka streams in a single DeltaStreamer deployment

2020-03-23 Thread GitBox
gdineshbabu88 commented on issue #1150: [HUDI-288]: Add support for ingesting multiple kafka streams in a single DeltaStreamer deployment URL: https://github.com/apache/incubator-hudi/pull/1150#issuecomment-602577700 @pratyakshsharma Can you update the wiki for HoodieMultiDeltaStreamer

[GitHub] [incubator-hudi] loagosad opened a new issue #1438: How to get the file name corresponding to HoodieKey through the GlobalBloomIndex

2020-03-23 Thread GitBox
loagosad opened a new issue #1438: How to get the file name corresponding to HoodieKey through the GlobalBloomIndex URL: https://github.com/apache/incubator-hudi/issues/1438 I use the `fetchRecordLocation` method in BloomIndex, which returns null, but in fact needs to return the

[GitHub] [incubator-hudi] codecov-io edited a comment on issue #1418: [HUDI-678] Make config package spark free

2020-03-23 Thread GitBox
codecov-io edited a comment on issue #1418: [HUDI-678] Make config package spark free URL: https://github.com/apache/incubator-hudi/pull/1418#issuecomment-601151728 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1418?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] codecov-io edited a comment on issue #1418: [HUDI-678] Make config package spark free

2020-03-23 Thread GitBox
codecov-io edited a comment on issue #1418: [HUDI-678] Make config package spark free URL: https://github.com/apache/incubator-hudi/pull/1418#issuecomment-601151728 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1418?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] leesf edited a comment on issue #1418: [HUDI-678] Make config package spark free

2020-03-23 Thread GitBox
leesf edited a comment on issue #1418: [HUDI-678] Make config package spark free URL: https://github.com/apache/incubator-hudi/pull/1418#issuecomment-602409914 @vinothchandar Updated the PR to address your comments. Please take a look when you are free. @vinothchandar @yanghua

[GitHub] [incubator-hudi] codecov-io edited a comment on issue #1418: [HUDI-678] Make config package spark free

2020-03-23 Thread GitBox
codecov-io edited a comment on issue #1418: [HUDI-678] Make config package spark free URL: https://github.com/apache/incubator-hudi/pull/1418#issuecomment-601151728 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1418?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] codecov-io edited a comment on issue #1418: [HUDI-678] Make config package spark free

2020-03-23 Thread GitBox
codecov-io edited a comment on issue #1418: [HUDI-678] Make config package spark free URL: https://github.com/apache/incubator-hudi/pull/1418#issuecomment-601151728 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1418?src=pr=h1) Report > Merging

[GitHub] [incubator-hudi] leesf commented on issue #1418: [HUDI-678] Make config package spark free

2020-03-23 Thread GitBox
leesf commented on issue #1418: [HUDI-678] Make config package spark free URL: https://github.com/apache/incubator-hudi/pull/1418#issuecomment-602409914 @vinothchandar Updated the PR to address your comments. This is an