[jira] [Commented] (HUDI-722) IndexOutOfBoundsException in MessageColumnIORecordConsumer.addBinary when writing parquet

2020-04-08 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17078432#comment-17078432 ] Vinoth Chandar commented on HUDI-722: - recently, we landed some fixes around nested data structures..

[GitHub] [incubator-hudi] vontman opened a new issue #1498: Migrating parquet table to hudi issue [SUPPORT]

2020-04-08 Thread GitBox
vontman opened a new issue #1498: Migrating parquet table to hudi issue [SUPPORT] URL: https://github.com/apache/incubator-hudi/issues/1498 **Describe the problem you faced** I have questions regarding the Hudi table initial loading (migrating from parquet to Hudi table,

[GitHub] [incubator-hudi] vinothchandar commented on issue #1174: [HUDI-96]: Implemented command line options instead of positional arguments for CLI commands

2020-04-08 Thread GitBox
vinothchandar commented on issue #1174: [HUDI-96]: Implemented command line options instead of positional arguments for CLI commands URL: https://github.com/apache/incubator-hudi/pull/1174#issuecomment-611059024 @pratyakshsharma sure! can you rebase and let us know if its ready for a

[GitHub] [incubator-hudi] vinothchandar commented on issue #1394: [HUDI-656][Performance] Return a dummy Spark relation after writing the DataFrame

2020-04-08 Thread GitBox
vinothchandar commented on issue #1394: [HUDI-656][Performance] Return a dummy Spark relation after writing the DataFrame URL: https://github.com/apache/incubator-hudi/pull/1394#issuecomment-611068333 @Antauri we will be having a release end of April.. Since its a small fix, you try

[GitHub] [incubator-hudi] lamber-ken removed a comment on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
lamber-ken removed a comment on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-611092165 Something cached in memory

[GitHub] [incubator-hudi] lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-610982711 Ok, I had downloaded the CSV data, trying This is an

[jira] [Commented] (HUDI-57) [UMBRELLA] Support ORC Storage

2020-04-08 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-57?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17078462#comment-17078462 ] Vinoth Chandar commented on HUDI-57: [~lamberken] do you think. we can target thiswork for 0.6.0? >

[GitHub] [incubator-hudi] lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-611092165 Something cached in memory

[GitHub] [incubator-hudi] lamber-ken edited a comment on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
lamber-ken edited a comment on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-610899494 @tverdokhlebd, thanks for your feedback, will try to reproduce it follow your steps. 1. bulk_insert the CSV

[GitHub] [incubator-hudi] Jiayi-Liao opened a new pull request #1497: [MINOR] Fix typo in TimelineService

2020-04-08 Thread GitBox
Jiayi-Liao opened a new pull request #1497: [MINOR] Fix typo in TimelineService URL: https://github.com/apache/incubator-hudi/pull/1497 ## What is the purpose of the pull request Fix typo in TimelineService ## Brief change log stand alone -> standalone ## Verify

[GitHub] [incubator-hudi] Antauri commented on issue #1394: [HUDI-656][Performance] Return a dummy Spark relation after writing the DataFrame

2020-04-08 Thread GitBox
Antauri commented on issue #1394: [HUDI-656][Performance] Return a dummy Spark relation after writing the DataFrame URL: https://github.com/apache/incubator-hudi/pull/1394#issuecomment-611032718 Present in 0.5.2-incubating which we're using. We're in development of a framework that does

[GitHub] [incubator-hudi] lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-611069603 hi @vinothchandar @bvaradar I think we can analyze this issue in parallel, reproduce steps: 1. Download CSV

[GitHub] [incubator-hudi] tverdokhlebd commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
tverdokhlebd commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-610980530 > upsert (use the same CSV dataset) ? Yes, use the same CSV dataset.

[GitHub] [incubator-hudi] Antauri edited a comment on issue #1394: [HUDI-656][Performance] Return a dummy Spark relation after writing the DataFrame

2020-04-08 Thread GitBox
Antauri edited a comment on issue #1394: [HUDI-656][Performance] Return a dummy Spark relation after writing the DataFrame URL: https://github.com/apache/incubator-hudi/pull/1394#issuecomment-611032718 Present in 0.5.2-incubating on EMR 6.x which we're using. We're in development of a

[GitHub] [incubator-hudi] lamber-ken edited a comment on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
lamber-ken edited a comment on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-66577 hi @tverdokhlebd, it works fine in my local env, just some warning, no OOM ``` 20/04/09 02:07:32 WARN

[GitHub] [incubator-hudi] hikiyoung commented on issue #1499: [SUPPORT] DeltaStreamer - NoClassDefFoundError for HiveDriver

2020-04-08 Thread GitBox
hikiyoung commented on issue #1499: [SUPPORT] DeltaStreamer - NoClassDefFoundError for HiveDriver URL: https://github.com/apache/incubator-hudi/issues/1499#issuecomment-67191 @lamber-ken Thanks for the quick response. Let me try.

[GitHub] [incubator-hudi] tverdokhlebd commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
tverdokhlebd commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-611158024 @lamber-ken , did you do "bulk insert" on a partition and then "upsert" to the same partition, yes? Did you

[jira] [Created] (HUDI-772) Make UserDefinedBulkInsertPartitioner configurable for DataSource

2020-04-08 Thread Dongwook Kwon (Jira)
Dongwook Kwon created HUDI-772: -- Summary: Make UserDefinedBulkInsertPartitioner configurable for DataSource Key: HUDI-772 URL: https://issues.apache.org/jira/browse/HUDI-772 Project: Apache Hudi

[GitHub] [incubator-hudi] garyli1019 commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer

2020-04-08 Thread GitBox
garyli1019 commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer URL: https://github.com/apache/incubator-hudi/pull/1486#discussion_r405719163 ## File path:

[jira] [Updated] (HUDI-771) Add HoodieWriteClient config to throw error on partial failures

2020-04-08 Thread satish (Jira)
[ https://issues.apache.org/jira/browse/HUDI-771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] satish updated HUDI-771: Description: Introduce new config "hoodie.write.tolerate.failure.percentage". During upsert, if number of errors is

[GitHub] [incubator-hudi] tverdokhlebd commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
tverdokhlebd commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-611161702 @lamber-ken , can you try with those params? This

[GitHub] [incubator-hudi] lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-611161092 1. use the same CSV dataset, bulk_insert, then upsert. 2. no, other params keep default

[GitHub] [incubator-hudi] ramachandranms commented on a change in pull request #1473: [HUDI-568] Improve unit test coverage

2020-04-08 Thread GitBox
ramachandranms commented on a change in pull request #1473: [HUDI-568] Improve unit test coverage URL: https://github.com/apache/incubator-hudi/pull/1473#discussion_r405804255 ## File path: hudi-common/src/main/java/org/apache/hudi/common/util/collection/RocksDBDAO.java

[GitHub] [incubator-hudi] lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-611164201 > @lamber-ken , can you try with those params? No proplem, will reply to you in a few hours, need to deal with

[GitHub] [incubator-hudi] lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-66577 hi @tverdokhlebd, it works fine in my local env, just some warning, no OOM ``` 20/04/09 02:07:32 WARN

[jira] [Created] (HUDI-771) Add HoodieWriteClient config to throw error on partial failures

2020-04-08 Thread satish (Jira)
satish created HUDI-771: --- Summary: Add HoodieWriteClient config to throw error on partial failures Key: HUDI-771 URL: https://issues.apache.org/jira/browse/HUDI-771 Project: Apache Hudi (incubating)

[GitHub] [incubator-hudi] lamber-ken edited a comment on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
lamber-ken edited a comment on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-66577 hi @tverdokhlebd, it works fine in my local env(didn't run command on docker). My local machine: 16GB,

[GitHub] [incubator-hudi] lamber-ken commented on issue #1499: [SUPPORT] DeltaStreamer - NoClassDefFoundError for HiveDriver

2020-04-08 Thread GitBox
lamber-ken commented on issue #1499: [SUPPORT] DeltaStreamer - NoClassDefFoundError for HiveDriver URL: https://github.com/apache/incubator-hudi/issues/1499#issuecomment-64936 hi @hikiyoung, there are two ways to solve your problem 1. Keep hudi-0.5.0 version ``` 1.

[jira] [Commented] (HUDI-57) [UMBRELLA] Support ORC Storage

2020-04-08 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-57?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17078643#comment-17078643 ] lamber-ken commented on HUDI-57: [~vinoth], I'm not sure.   ||task||version|| |1. add storage type|0.6.0|

[GitHub] [incubator-hudi] hikiyoung opened a new issue #1499: [SUPPORT] DeltaStreamer - NoClassDefFoundError for HiveDriver

2020-04-08 Thread GitBox
hikiyoung opened a new issue #1499: [SUPPORT] DeltaStreamer - NoClassDefFoundError for HiveDriver URL: https://github.com/apache/incubator-hudi/issues/1499 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://cwiki.apache.org/confluence/display/HUDI/FAQ)?

[jira] [Assigned] (HUDI-771) Add HoodieWriteClient config to throw error on partial failures

2020-04-08 Thread satish (Jira)
[ https://issues.apache.org/jira/browse/HUDI-771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] satish reassigned HUDI-771: --- Assignee: satish > Add HoodieWriteClient config to throw error on partial failures >

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1396: [HUDI-687] Stop incremental reader on RO table before a pending compaction

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1396: [HUDI-687] Stop incremental reader on RO table before a pending compaction URL: https://github.com/apache/incubator-hudi/pull/1396#discussion_r405824062 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on issue #1396: [HUDI-687] Stop incremental reader on RO table before a pending compaction

2020-04-08 Thread GitBox
vinothchandar commented on issue #1396: [HUDI-687] Stop incremental reader on RO table before a pending compaction URL: https://github.com/apache/incubator-hudi/pull/1396#issuecomment-611202945 @bvaradar Please take a final pass and merge.. I don't have concerns to move forward.. cc

[jira] [Comment Edited] (HUDI-57) [UMBRELLA] Support ORC Storage

2020-04-08 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-57?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17078762#comment-17078762 ] Vinoth Chandar edited comment on HUDI-57 at 4/8/20, 10:13 PM: -- Great! that

[jira] [Updated] (HUDI-558) Introduce ability to compress bloom filters while storing in parquet

2020-04-08 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-558: Labels: help-wanted pull-request-available (was: pull-request-available) > Introduce

[jira] [Updated] (HUDI-772) Make UserDefinedBulkInsertPartitioner configurable for DataSource

2020-04-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-772: Labels: pull-request-available (was: ) > Make UserDefinedBulkInsertPartitioner configurable for

[jira] [Updated] (HUDI-772) Make UserDefinedBulkInsertPartitioner configurable for DataSource

2020-04-08 Thread Dongwook Kwon (Jira)
[ https://issues.apache.org/jira/browse/HUDI-772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongwook Kwon updated HUDI-772: --- Description: The [issue-238|[https://github.com/apache/incubator-hudi/issues/238]] and the

[GitHub] [incubator-hudi] kwondw opened a new pull request #1500: [HUDI-772] Make UserDefinedBulkInsertPartitioner configurable for DataSource

2020-04-08 Thread GitBox
kwondw opened a new pull request #1500: [HUDI-772] Make UserDefinedBulkInsertPartitioner configurable for DataSource URL: https://github.com/apache/incubator-hudi/pull/1500 ## What is the purpose of the pull request * The

[jira] [Updated] (HUDI-772) Make UserDefinedBulkInsertPartitioner configurable for DataSource

2020-04-08 Thread Dongwook Kwon (Jira)
[ https://issues.apache.org/jira/browse/HUDI-772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongwook Kwon updated HUDI-772: --- Description: The [issue-238|https://github.com/apache/incubator-hudi/issues/238] and the

[GitHub] [incubator-hudi] codecov-io commented on issue #1479: [HUDI-758] Modify Integration test to include incremental queries on MOR tables

2020-04-08 Thread GitBox
codecov-io commented on issue #1479: [HUDI-758] Modify Integration test to include incremental queries on MOR tables URL: https://github.com/apache/incubator-hudi/pull/1479#issuecomment-611277336 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1479?src=pr=h1) Report >

[jira] [Commented] (HUDI-57) [UMBRELLA] Support ORC Storage

2020-04-08 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-57?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17078762#comment-17078762 ] Vinoth Chandar commented on HUDI-57: Great! that sounds like a plan... having just 1 engine support with

[incubator-hudi] branch master updated: [MINOR] Fix typo in TimelineService (#1497)

2020-04-08 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/master by this push: new f7b55af [MINOR] Fix typo in

[GitHub] [incubator-hudi] vinothchandar merged pull request #1497: [MINOR] Fix typo in TimelineService

2020-04-08 Thread GitBox
vinothchandar merged pull request #1497: [MINOR] Fix typo in TimelineService URL: https://github.com/apache/incubator-hudi/pull/1497 This is an automated message from the Apache Git Service. To respond to the message, please

[jira] [Updated] (HUDI-773) Hudi On Azure Data Lake Storage V2

2020-04-08 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanjia Gary Li updated HUDI-773: Summary: Hudi On Azure Data Lake Storage V2 (was: Hudi On Azure Data Lake Storage) > Hudi On Azure

[GitHub] [incubator-hudi] bhasudha commented on a change in pull request #1479: [HUDI-758] Modify Integration test to include incremental queries on MOR tables

2020-04-08 Thread GitBox
bhasudha commented on a change in pull request #1479: [HUDI-758] Modify Integration test to include incremental queries on MOR tables URL: https://github.com/apache/incubator-hudi/pull/1479#discussion_r405870533 ## File path:

[jira] [Created] (HUDI-774) Spark to Avro converter incorrectly generates optional fields

2020-04-08 Thread Alexander Filipchik (Jira)
Alexander Filipchik created HUDI-774: Summary: Spark to Avro converter incorrectly generates optional fields Key: HUDI-774 URL: https://issues.apache.org/jira/browse/HUDI-774 Project: Apache Hudi

[jira] [Updated] (HUDI-677) Abstract/Refactor all transaction management logic into a set of classes

2020-04-08 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-677: Labels: help-wanted (was: ) > Abstract/Refactor all transaction management logic into a set

[jira] [Updated] (HUDI-760) Remove Rolling Stat management from Hudi Writer

2020-04-08 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-760: Labels: help-wanted newbie, (was: newbie,) > Remove Rolling Stat management from Hudi

[jira] [Updated] (HUDI-772) Make UserDefinedBulkInsertPartitioner configurable for DataSource

2020-04-08 Thread Dongwook Kwon (Jira)
[ https://issues.apache.org/jira/browse/HUDI-772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongwook Kwon updated HUDI-772: --- [https://github.com/apache/incubator-hudi/pull/1500] > Make UserDefinedBulkInsertPartitioner configurable

[jira] [Created] (HUDI-773) Hudi On Azure Data Lake Storage

2020-04-08 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-773: --- Summary: Hudi On Azure Data Lake Storage Key: HUDI-773 URL: https://issues.apache.org/jira/browse/HUDI-773 Project: Apache Hudi (incubating) Issue Type: New

[jira] [Updated] (HUDI-772) Make UserDefinedBulkInsertPartitioner configurable for DataSource

2020-04-08 Thread Dongwook Kwon (Jira)
[ https://issues.apache.org/jira/browse/HUDI-772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongwook Kwon updated HUDI-772: --- Description: The [issue-238|[https://github.com/apache/incubator-hudi/issues/238]] and the

[incubator-hudi] branch hudi_test_suite_refactor updated (3e2e710 -> 47f4af6)

2020-04-08 Thread nagarwal
This is an automated email from the ASF dual-hosted git repository. nagarwal pushed a change to branch hudi_test_suite_refactor in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. from 3e2e710 Fix Compilation Issues + Port Bug Fixes add 47f4af6 Build fixes after

[jira] [Updated] (HUDI-684) Introduce abstraction for writing and reading and compacting from FileGroups

2020-04-08 Thread Balaji Varadarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Balaji Varadarajan updated HUDI-684: Labels: help-wanted (was: ) > Introduce abstraction for writing and reading and compacting

[GitHub] [incubator-hudi] vinothchandar commented on issue #1500: [HUDI-772] Make UserDefinedBulkInsertPartitioner configurable for DataSource

2020-04-08 Thread GitBox
vinothchandar commented on issue #1500: [HUDI-772] Make UserDefinedBulkInsertPartitioner configurable for DataSource URL: https://github.com/apache/incubator-hudi/pull/1500#issuecomment-611276825 @yihua are you able to you review this?

[GitHub] [incubator-hudi] lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-611288468 Hi @tverdokhlebd, the whole upsert cost about 30min ``` dcadmin-imac:hudi-debug dcadmin$ export

[incubator-hudi] 02/02: rename variable per review comments

2020-04-08 Thread bhavanisudha
This is an automated email from the ASF dual-hosted git repository. bhavanisudha pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git commit 3c803421e0e138c223c9bec90f9b7c8a6751317d Author: Satish Kotha AuthorDate: Wed Apr 8 16:35:15 2020 -0700

[GitHub] [incubator-hudi] bhasudha merged pull request #1479: [HUDI-758] Modify Integration test to include incremental queries on MOR tables

2020-04-08 Thread GitBox
bhasudha merged pull request #1479: [HUDI-758] Modify Integration test to include incremental queries on MOR tables URL: https://github.com/apache/incubator-hudi/pull/1479 This is an automated message from the Apache Git

[GitHub] [incubator-hudi] qiuchen100 commented on issue #143: Tracking ticket for folks to be added to slack group

2020-04-08 Thread GitBox
qiuchen100 commented on issue #143: Tracking ticket for folks to be added to slack group URL: https://github.com/apache/incubator-hudi/issues/143#issuecomment-611302392 Please add me too qiuchen...@163.com This is an

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer URL: https://github.com/apache/incubator-hudi/pull/1486#discussion_r405958980 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer URL: https://github.com/apache/incubator-hudi/pull/1486#discussion_r405967820 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer URL: https://github.com/apache/incubator-hudi/pull/1486#discussion_r405968782 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1457: [HUDI-741] Added checks to validate Hoodie's schema evolution.

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1457: [HUDI-741] Added checks to validate Hoodie's schema evolution. URL: https://github.com/apache/incubator-hudi/pull/1457#discussion_r405969176 ## File path:

[GitHub] [incubator-hudi] lamber-ken edited a comment on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
lamber-ken edited a comment on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-611288468 Hi @tverdokhlebd, the whole upsert cost about 30min ``` dcadmin-imac:hudi-debug dcadmin$ export

[GitHub] [incubator-hudi] hddong commented on a change in pull request #1490: [HUDI-700]Add unit test for FileSystemViewCommand

2020-04-08 Thread GitBox
hddong commented on a change in pull request #1490: [HUDI-700]Add unit test for FileSystemViewCommand URL: https://github.com/apache/incubator-hudi/pull/1490#discussion_r405921464 ## File path: hudi-cli/src/test/java/org/apache/hudi/cli/commands/TestFileSystemViewCommand.java

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer URL: https://github.com/apache/incubator-hudi/pull/1486#discussion_r405958797 ## File path:

[incubator-hudi] branch hudi_test_suite_refactor updated (47f4af6 -> 5ee8a85)

2020-04-08 Thread nagarwal
This is an automated email from the ASF dual-hosted git repository. nagarwal pushed a change to branch hudi_test_suite_refactor in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. discard 47f4af6 Build fixes after rebase add 5ee8a85 Build fixes after rebase This

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer URL: https://github.com/apache/incubator-hudi/pull/1486#discussion_r405957805 ## File path:

[jira] [Resolved] (HUDI-758) Modify Integration test to include incremental queries on RO, RT views

2020-04-08 Thread satish (Jira)
[ https://issues.apache.org/jira/browse/HUDI-758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] satish resolved HUDI-758. - Resolution: Fixed > Modify Integration test to include incremental queries on RO, RT views >

[jira] [Updated] (HUDI-758) Modify Integration test to include incremental queries on RO, RT views

2020-04-08 Thread satish (Jira)
[ https://issues.apache.org/jira/browse/HUDI-758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] satish updated HUDI-758: Status: Open (was: New) > Modify Integration test to include incremental queries on RO, RT views >

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer URL: https://github.com/apache/incubator-hudi/pull/1486#discussion_r405966015 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer URL: https://github.com/apache/incubator-hudi/pull/1486#discussion_r405959193 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer URL: https://github.com/apache/incubator-hudi/pull/1486#discussion_r405966361 ## File path:

[GitHub] [incubator-hudi] codecov-io commented on issue #1449: [WIP][HUDI-698]Add unit test for CleansCommand

2020-04-08 Thread GitBox
codecov-io commented on issue #1449: [WIP][HUDI-698]Add unit test for CleansCommand URL: https://github.com/apache/incubator-hudi/pull/1449#issuecomment-611344210 # [Codecov](https://codecov.io/gh/apache/incubator-hudi/pull/1449?src=pr=h1) Report > Merging

Build failed in Jenkins: hudi-snapshot-deployment-0.5 #242

2020-04-08 Thread Apache Jenkins Server
See Changes: -- [...truncated 2.35 KB...] /home/jenkins/tools/maven/apache-maven-3.5.4/conf: logging settings.xml toolchains.xml

[GitHub] [incubator-hudi] n3nash commented on issue #1310: [HUDI-601] Improve unit test coverage for HoodieAvroWriteSupport, HoodieRealtimeRecordReader, RealtimeCompactedRecordReader

2020-04-08 Thread GitBox
n3nash commented on issue #1310: [HUDI-601] Improve unit test coverage for HoodieAvroWriteSupport, HoodieRealtimeRecordReader, RealtimeCompactedRecordReader URL: https://github.com/apache/incubator-hudi/pull/1310#issuecomment-611335457 @modi95 please squash your commits to 1

[GitHub] [incubator-hudi] modi95 opened a new pull request #1501: [HUDI-601] Improve unit test coverage for HoodieAvroWriteSupport, HoodieRealtimeRecordReader, RealtimeCompactedRecordReader

2020-04-08 Thread GitBox
modi95 opened a new pull request #1501: [HUDI-601] Improve unit test coverage for HoodieAvroWriteSupport, HoodieRealtimeRecordReader, RealtimeCompactedRecordReader URL: https://github.com/apache/incubator-hudi/pull/1501 NOTE - this is a copy of

[GitHub] [incubator-hudi] lamber-ken commented on issue #143: Tracking ticket for folks to be added to slack group

2020-04-08 Thread GitBox
lamber-ken commented on issue #143: Tracking ticket for folks to be added to slack group URL: https://github.com/apache/incubator-hudi/issues/143#issuecomment-611308595 > Please add me too [qiuchen...@163.com](mailto:qiuchen...@163.com) Done and welcome

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package URL: https://github.com/apache/incubator-hudi/pull/1495#discussion_r405954917 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package URL: https://github.com/apache/incubator-hudi/pull/1495#discussion_r405955610 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package URL: https://github.com/apache/incubator-hudi/pull/1495#discussion_r405957239 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package URL: https://github.com/apache/incubator-hudi/pull/1495#discussion_r405956603 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package URL: https://github.com/apache/incubator-hudi/pull/1495#discussion_r405954276 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package URL: https://github.com/apache/incubator-hudi/pull/1495#discussion_r405953421 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package URL: https://github.com/apache/incubator-hudi/pull/1495#discussion_r405954061 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package

2020-04-08 Thread GitBox
vinothchandar commented on a change in pull request #1495: [HUDI-770] Organize upsert/insert API implementation under a single package URL: https://github.com/apache/incubator-hudi/pull/1495#discussion_r405956442 ## File path:

[GitHub] [incubator-hudi] lamber-ken opened a new pull request #1496: [MINOR] Update community info

2020-04-08 Thread GitBox
lamber-ken opened a new pull request #1496: [MINOR] Update community info URL: https://github.com/apache/incubator-hudi/pull/1496 ## What is the purpose of the pull request Update community info ## Verify this pull request This pull request is a trivial work. ##

[GitHub] [incubator-hudi] yanghua commented on a change in pull request #1449: [WIP][HUDI-698]Add unit test for CleansCommand

2020-04-08 Thread GitBox
yanghua commented on a change in pull request #1449: [WIP][HUDI-698]Add unit test for CleansCommand URL: https://github.com/apache/incubator-hudi/pull/1449#discussion_r405277008 ## File path: hudi-cli/src/test/java/org/apache/hudi/cli/commands/TestCleansCommand.java ##

[GitHub] [incubator-hudi] yanghua commented on a change in pull request #1449: [WIP][HUDI-698]Add unit test for CleansCommand

2020-04-08 Thread GitBox
yanghua commented on a change in pull request #1449: [WIP][HUDI-698]Add unit test for CleansCommand URL: https://github.com/apache/incubator-hudi/pull/1449#discussion_r405277008 ## File path: hudi-cli/src/test/java/org/apache/hudi/cli/commands/TestCleansCommand.java ##

[GitHub] [incubator-hudi] hddong commented on a change in pull request #1449: [WIP][HUDI-698]Add unit test for CleansCommand

2020-04-08 Thread GitBox
hddong commented on a change in pull request #1449: [WIP][HUDI-698]Add unit test for CleansCommand URL: https://github.com/apache/incubator-hudi/pull/1449#discussion_r405278391 ## File path: hudi-cli/src/test/java/org/apache/hudi/cli/commands/TestCleansCommand.java ## @@

[GitHub] [incubator-hudi] hddong commented on a change in pull request #1449: [WIP][HUDI-698]Add unit test for CleansCommand

2020-04-08 Thread GitBox
hddong commented on a change in pull request #1449: [WIP][HUDI-698]Add unit test for CleansCommand URL: https://github.com/apache/incubator-hudi/pull/1449#discussion_r402189178 ## File path: hudi-cli/src/test/java/org/apache/hudi/cli/commands/TestCleansCommand.java ## @@

[GitHub] [incubator-hudi] lamber-ken merged pull request #1496: [MINOR] Update community info

2020-04-08 Thread GitBox
lamber-ken merged pull request #1496: [MINOR] Update community info URL: https://github.com/apache/incubator-hudi/pull/1496 This is an automated message from the Apache Git Service. To respond to the message, please log on

[incubator-hudi] branch asf-site updated: [MINOR] Update community info

2020-04-08 Thread lamberken
This is an automated email from the ASF dual-hosted git repository. lamberken pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 988cd09 [MINOR] Update

[GitHub] [incubator-hudi] yanghua commented on a change in pull request #1490: [HUDI-700]Add unit test for FileSystemViewCommand

2020-04-08 Thread GitBox
yanghua commented on a change in pull request #1490: [HUDI-700]Add unit test for FileSystemViewCommand URL: https://github.com/apache/incubator-hudi/pull/1490#discussion_r405372064 ## File path: hudi-cli/src/test/java/org/apache/hudi/cli/commands/TestFileSystemViewCommand.java

[GitHub] [incubator-hudi] pratyakshsharma commented on a change in pull request #1452: [HUDI-740]Fix can not specify the sparkMaster and code clean for SparkUtil

2020-04-08 Thread GitBox
pratyakshsharma commented on a change in pull request #1452: [HUDI-740]Fix can not specify the sparkMaster and code clean for SparkUtil URL: https://github.com/apache/incubator-hudi/pull/1452#discussion_r405384142 ## File path:

[GitHub] [incubator-hudi] yanghua commented on a change in pull request #1490: [HUDI-700]Add unit test for FileSystemViewCommand

2020-04-08 Thread GitBox
yanghua commented on a change in pull request #1490: [HUDI-700]Add unit test for FileSystemViewCommand URL: https://github.com/apache/incubator-hudi/pull/1490#discussion_r405371789 ## File path: hudi-cli/src/test/java/org/apache/hudi/cli/commands/TestFileSystemViewCommand.java

[GitHub] [incubator-hudi] yanghua commented on a change in pull request #1490: [HUDI-700]Add unit test for FileSystemViewCommand

2020-04-08 Thread GitBox
yanghua commented on a change in pull request #1490: [HUDI-700]Add unit test for FileSystemViewCommand URL: https://github.com/apache/incubator-hudi/pull/1490#discussion_r405375896 ## File path: hudi-cli/src/test/java/org/apache/hudi/cli/commands/TestFileSystemViewCommand.java

[GitHub] [incubator-hudi] yanghua commented on a change in pull request #1490: [HUDI-700]Add unit test for FileSystemViewCommand

2020-04-08 Thread GitBox
yanghua commented on a change in pull request #1490: [HUDI-700]Add unit test for FileSystemViewCommand URL: https://github.com/apache/incubator-hudi/pull/1490#discussion_r405375308 ## File path: hudi-cli/src/test/java/org/apache/hudi/cli/commands/TestFileSystemViewCommand.java

[incubator-hudi] branch asf-site updated: Travis CI build asf-site

2020-04-08 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new d99609b Travis CI build asf-site

[GitHub] [incubator-hudi] pratyakshsharma commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer

2020-04-08 Thread GitBox
pratyakshsharma commented on a change in pull request #1486: [HUDI-759] Integrate checkpoint privoder with delta streamer URL: https://github.com/apache/incubator-hudi/pull/1486#discussion_r405412784 ## File path:

[GitHub] [incubator-hudi] lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records

2020-04-08 Thread GitBox
lamber-ken commented on issue #1491: [SUPPORT] OutOfMemoryError during upsert 53M records URL: https://github.com/apache/incubator-hudi/issues/1491#issuecomment-610899494 @tverdokhlebd, thanks for your feedback, will try to reproduce it follow your steps.

  1   2   >