[jira] [Commented] (HUDI-127) [Good to do] Tidy up cWiki

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009483#comment-17009483 ] Vinoth Chandar commented on HUDI-127: - with the 

[jira] [Closed] (HUDI-127) [Good to do] Tidy up cWiki

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar closed HUDI-127. --- Resolution: Fixed > [Good to do] Tidy up cWiki > -- > > Key:

[jira] [Updated] (HUDI-127) [Good to do] Tidy up cWiki

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-127: Status: In Progress (was: Open) > [Good to do] Tidy up cWiki > -- > >

[GitHub] [incubator-hudi] Guru107 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
Guru107 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571486297 @bvaradar @lamber-ken I am still getting this error after I took the latest pull and

[GitHub] [incubator-hudi] yanghua commented on issue #1191: [HUDI-503] Add hudi test suite documentation into the README file of the test suite module

2020-01-07 Thread GitBox
yanghua commented on issue #1191: [HUDI-503] Add hudi test suite documentation into the README file of the test suite module URL: https://github.com/apache/incubator-hudi/pull/1191#issuecomment-571541880 cc @n3nash This is

[GitHub] [incubator-hudi] hddong commented on issue #1114: [HUDI-438] Merge duplicated code fragment

2020-01-07 Thread GitBox
hddong commented on issue #1114: [HUDI-438] Merge duplicated code fragment URL: https://github.com/apache/incubator-hudi/pull/1114#issuecomment-571518957 @leesf @nsivabalan thanks for your review. This is an automated message

[GitHub] [incubator-hudi] Panxing4game edited a comment on issue #1189: [HUDI-376]: AWS Glue dependency issue for EMR 5.28.0

2020-01-07 Thread GitBox
Panxing4game edited a comment on issue #1189: [HUDI-376]: AWS Glue dependency issue for EMR 5.28.0 URL: https://github.com/apache/incubator-hudi/pull/1189#issuecomment-571518371 > Thanks for opening this PR @Panxing4game ! You might only need to modify s3_filesystem.md and

[GitHub] [incubator-hudi] Panxing4game commented on issue #1189: [HUDI-376]: AWS Glue dependency issue for EMR 5.28.0

2020-01-07 Thread GitBox
Panxing4game commented on issue #1189: [HUDI-376]: AWS Glue dependency issue for EMR 5.28.0 URL: https://github.com/apache/incubator-hudi/pull/1189#issuecomment-571518371 > Thanks for opening this PR @Panxing4game ! You might only need to modify s3_filesystem.md and s3_filesystem.cn.md,

[GitHub] [incubator-hudi] Panxing4game commented on issue #1189: [HUDI-376]: AWS Glue dependency issue for EMR 5.28.0

2020-01-07 Thread GitBox
Panxing4game commented on issue #1189: [HUDI-376]: AWS Glue dependency issue for EMR 5.28.0 URL: https://github.com/apache/incubator-hudi/pull/1189#issuecomment-571520147 I think I could translate this s3 cn.md page to Chinese as well. Maybe in another ticket :)

[GitHub] [incubator-hudi] Guru107 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
Guru107 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571496082 @lamber-ken I had run the spark job after merging of

[GitHub] [incubator-hudi] hddong commented on issue #1157: [HUDI-332]Add operation type (insert/upsert/bulkinsert/delete) to HoodieCommitMetadata

2020-01-07 Thread GitBox
hddong commented on issue #1157: [HUDI-332]Add operation type (insert/upsert/bulkinsert/delete) to HoodieCommitMetadata URL: https://github.com/apache/incubator-hudi/pull/1157#issuecomment-571521331 @bvaradar Operation type is stored in the avro objects when archiving, but there are a

[GitHub] [incubator-hudi] yanghua commented on issue #1115: [HUDI-392] Introduce DIstributedTestDataSource to generate test data

2020-01-07 Thread GitBox
yanghua commented on issue #1115: [HUDI-392] Introduce DIstributedTestDataSource to generate test data URL: https://github.com/apache/incubator-hudi/pull/1115#issuecomment-571537763 @n3nash OK, will try to review the whole test suite again to see if I can find some issues.

[GitHub] [incubator-hudi] lamber-ken commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
lamber-ken commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571490358 > @bvaradar @lamber-ken I am still getting this error after I took the latest pull

[GitHub] [incubator-hudi] lamber-ken commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
lamber-ken commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571499096 > @lamber-ken I had run the spark job after merging of

[GitHub] [incubator-hudi] yanghua commented on a change in pull request #1191: [HUDI-503] Add hudi test suite documentation into the README file of the test suite module

2020-01-07 Thread GitBox
yanghua commented on a change in pull request #1191: [HUDI-503] Add hudi test suite documentation into the README file of the test suite module URL: https://github.com/apache/incubator-hudi/pull/1191#discussion_r363690393 ## File path: hudi-test-suite/README.md ## @@ -0,0

[jira] [Created] (HUDI-508) Standardize on using "Table" instead of "Dataset"

2020-01-07 Thread Vinoth Chandar (Jira)
Vinoth Chandar created HUDI-508: --- Summary: Standardize on using "Table" instead of "Dataset" Key: HUDI-508 URL: https://issues.apache.org/jira/browse/HUDI-508 Project: Apache Hudi (incubating)

[jira] [Updated] (HUDI-508) Standardize on using "Table" instead of "Dataset"

2020-01-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-508: Labels: pull-request-available (was: ) > Standardize on using "Table" instead of "Dataset" >

[jira] [Updated] (HUDI-509) Rename "views" into "query types" according to cWiki

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-509: Status: Open (was: New) > Rename "views" into "query types" according to cWiki >

[jira] [Updated] (HUDI-510) Update site documentation in sync with cWiki

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-510: Status: Open (was: New) > Update site documentation in sync with cWiki >

[jira] [Updated] (HUDI-508) Standardize on using "Table" instead of "Dataset"

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-508: Status: Open (was: New) > Standardize on using "Table" instead of "Dataset" >

[jira] [Created] (HUDI-510) Update site documentation in sync with cWiki

2020-01-07 Thread Vinoth Chandar (Jira)
Vinoth Chandar created HUDI-510: --- Summary: Update site documentation in sync with cWiki Key: HUDI-510 URL: https://issues.apache.org/jira/browse/HUDI-510 Project: Apache Hudi (incubating)

[GitHub] [incubator-hudi] vinothchandar opened a new pull request #1197: [WIP] [HUDI-508] Standardizing on "Table" instead of "Dataset" across code

2020-01-07 Thread GitBox
vinothchandar opened a new pull request #1197: [WIP] [HUDI-508] Standardizing on "Table" instead of "Dataset" across code URL: https://github.com/apache/incubator-hudi/pull/1197 - Docs were talking about storage types before, cWiki moved to "Table" - Most of code already has

[jira] [Created] (HUDI-507) Support \ t split hdfs source

2020-01-07 Thread liujinhui (Jira)
liujinhui created HUDI-507: -- Summary: Support \ t split hdfs source Key: HUDI-507 URL: https://issues.apache.org/jira/browse/HUDI-507 Project: Apache Hudi (incubating) Issue Type: Improvement

[jira] [Created] (HUDI-509) Rename "views" into "query types" according to cWiki

2020-01-07 Thread Vinoth Chandar (Jira)
Vinoth Chandar created HUDI-509: --- Summary: Rename "views" into "query types" according to cWiki Key: HUDI-509 URL: https://issues.apache.org/jira/browse/HUDI-509 Project: Apache Hudi (incubating)

[GitHub] [incubator-hudi] vinothchandar commented on issue #1197: [HUDI-508] Standardizing on "Table" instead of "Dataset" across code

2020-01-07 Thread GitBox
vinothchandar commented on issue #1197: [HUDI-508] Standardizing on "Table" instead of "Dataset" across code URL: https://github.com/apache/incubator-hudi/pull/1197#issuecomment-571693255 @n3nash can you please review This

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1159: [WIP][HUDI-479] Eliminate or Minimize use of Guava if possible

2020-01-07 Thread GitBox
vinothchandar commented on a change in pull request #1159: [WIP][HUDI-479] Eliminate or Minimize use of Guava if possible URL: https://github.com/apache/incubator-hudi/pull/1159#discussion_r363820025 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1159: [WIP][HUDI-479] Eliminate or Minimize use of Guava if possible

2020-01-07 Thread GitBox
vinothchandar commented on a change in pull request #1159: [WIP][HUDI-479] Eliminate or Minimize use of Guava if possible URL: https://github.com/apache/incubator-hudi/pull/1159#discussion_r363820681 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1159: [WIP][HUDI-479] Eliminate or Minimize use of Guava if possible

2020-01-07 Thread GitBox
vinothchandar commented on a change in pull request #1159: [WIP][HUDI-479] Eliminate or Minimize use of Guava if possible URL: https://github.com/apache/incubator-hudi/pull/1159#discussion_r363821619 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1195: [HUDI-319] Add a new maven profile to generate unified Javadoc for all Java and Scala classes

2020-01-07 Thread GitBox
vinothchandar commented on a change in pull request #1195: [HUDI-319] Add a new maven profile to generate unified Javadoc for all Java and Scala classes URL: https://github.com/apache/incubator-hudi/pull/1195#discussion_r363823740 ## File path: pom.xml ## @@ -938,6

[GitHub] [incubator-hudi] vinothchandar commented on issue #1195: [HUDI-319] Add a new maven profile to generate unified Javadoc for all Java and Scala classes

2020-01-07 Thread GitBox
vinothchandar commented on issue #1195: [HUDI-319] Add a new maven profile to generate unified Javadoc for all Java and Scala classes URL: https://github.com/apache/incubator-hudi/pull/1195#issuecomment-571655264 Can merge once we resolve the naming... and also please add a line in

[GitHub] [incubator-hudi] n3nash commented on a change in pull request #1197: [HUDI-508] Standardizing on "Table" instead of "Dataset" across code

2020-01-07 Thread GitBox
n3nash commented on a change in pull request #1197: [HUDI-508] Standardizing on "Table" instead of "Dataset" across code URL: https://github.com/apache/incubator-hudi/pull/1197#discussion_r363901294 ## File path:

[GitHub] [incubator-hudi] n3nash commented on a change in pull request #1194: [HUDI-326] Add support to delete records with only record_key

2020-01-07 Thread GitBox
n3nash commented on a change in pull request #1194: [HUDI-326] Add support to delete records with only record_key URL: https://github.com/apache/incubator-hudi/pull/1194#discussion_r363910472 ## File path: hudi-spark/src/main/scala/org/apache/hudi/HoodieSparkSqlWriter.scala

[GitHub] [incubator-hudi] bschell commented on a change in pull request #1194: [HUDI-326] Add support to delete records with only record_key

2020-01-07 Thread GitBox
bschell commented on a change in pull request #1194: [HUDI-326] Add support to delete records with only record_key URL: https://github.com/apache/incubator-hudi/pull/1194#discussion_r363869991 ## File path: hudi-spark/src/main/scala/org/apache/hudi/HoodieSparkSqlWriter.scala

[GitHub] [incubator-hudi] bschell commented on a change in pull request #1194: [HUDI-326] Add support to delete records with only record_key

2020-01-07 Thread GitBox
bschell commented on a change in pull request #1194: [HUDI-326] Add support to delete records with only record_key URL: https://github.com/apache/incubator-hudi/pull/1194#discussion_r363869991 ## File path: hudi-spark/src/main/scala/org/apache/hudi/HoodieSparkSqlWriter.scala

[GitHub] [incubator-hudi] umehrot2 commented on issue #1005: [HUDI-91][HUDI-12]Migrate to spark 2.4.4, migrate to spark-avro library instead of databricks-avro, add support for Decimal/Date types

2020-01-07 Thread GitBox
umehrot2 commented on issue #1005: [HUDI-91][HUDI-12]Migrate to spark 2.4.4, migrate to spark-avro library instead of databricks-avro, add support for Decimal/Date types URL: https://github.com/apache/incubator-hudi/pull/1005#issuecomment-571730182 > @umehrot2 are you still driving this?

[GitHub] [incubator-hudi] zhedoubushishi commented on a change in pull request #1175: [HUDI-495] Update deprecated HBase API

2020-01-07 Thread GitBox
zhedoubushishi commented on a change in pull request #1175: [HUDI-495] Update deprecated HBase API URL: https://github.com/apache/incubator-hudi/pull/1175#discussion_r363919247 ## File path: hudi-client/src/main/java/org/apache/hudi/index/hbase/HBaseIndex.java ## @@

[jira] [Commented] (HUDI-494) [DEBUGGING] Huge amount of tasks when writing files into HDFS

2020-01-07 Thread Yanjia Gary Li (Jira)
[ https://issues.apache.org/jira/browse/HUDI-494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010031#comment-17010031 ] Yanjia Gary Li commented on HUDI-494: - [~vinoth] Thanks for the feedback. The code snippets were

[GitHub] [incubator-hudi] zhedoubushishi commented on a change in pull request #1175: [HUDI-495] Update deprecated HBase API

2020-01-07 Thread GitBox
zhedoubushishi commented on a change in pull request #1175: [HUDI-495] Update deprecated HBase API URL: https://github.com/apache/incubator-hudi/pull/1175#discussion_r363919247 ## File path: hudi-client/src/main/java/org/apache/hudi/index/hbase/HBaseIndex.java ## @@

[GitHub] [incubator-hudi] bvaradar commented on issue #1185: [HUDI-500] Use enum method to replace switch case

2020-01-07 Thread GitBox
bvaradar commented on issue #1185: [HUDI-500] Use enum method to replace switch case URL: https://github.com/apache/incubator-hudi/pull/1185#issuecomment-571760972 Thanks @dengziming. I agree with @vinothchandar to keep abstraction layers smaller and well defined to preserve

[GitHub] [incubator-hudi] bvaradar commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
bvaradar commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571765113 @Guru107 : Can you double confirm if the change (

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1197: [HUDI-508] Standardizing on "Table" instead of "Dataset" across code

2020-01-07 Thread GitBox
vinothchandar commented on a change in pull request #1197: [HUDI-508] Standardizing on "Table" instead of "Dataset" across code URL: https://github.com/apache/incubator-hudi/pull/1197#discussion_r363944418 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on issue #1005: [HUDI-91][HUDI-12]Migrate to spark 2.4.4, migrate to spark-avro library instead of databricks-avro, add support for Decimal/Date types

2020-01-07 Thread GitBox
vinothchandar commented on issue #1005: [HUDI-91][HUDI-12]Migrate to spark 2.4.4, migrate to spark-avro library instead of databricks-avro, add support for Decimal/Date types URL: https://github.com/apache/incubator-hudi/pull/1005#issuecomment-571764759 My suggestion is to freeze code by

[GitHub] [incubator-hudi] vinothchandar commented on issue #1197: [HUDI-508] Standardizing on "Table" instead of "Dataset" across code

2020-01-07 Thread GitBox
vinothchandar commented on issue #1197: [HUDI-508] Standardizing on "Table" instead of "Dataset" across code URL: https://github.com/apache/incubator-hudi/pull/1197#issuecomment-571766001 yes. have a separate sub tasks under https://jira.apache.org/jira/browse/HUDI-334 Views =>

[incubator-hudi] branch master updated (8306f74 -> 9706f65)

2020-01-07 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. from 8306f74 [HUDI-417] Refactor HoodieWriteClient so that commit logic can be shareable by both bootstrap and

[GitHub] [incubator-hudi] vinothchandar merged pull request #1197: [HUDI-508] Standardizing on "Table" instead of "Dataset" across code

2020-01-07 Thread GitBox
vinothchandar merged pull request #1197: [HUDI-508] Standardizing on "Table" instead of "Dataset" across code URL: https://github.com/apache/incubator-hudi/pull/1197 This is an automated message from the Apache Git Service.

[jira] [Commented] (HUDI-397) Normalize log print statement

2020-01-07 Thread Nishith Agarwal (Jira)
[ https://issues.apache.org/jira/browse/HUDI-397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010096#comment-17010096 ] Nishith Agarwal commented on HUDI-397: -- [~yanghua] I agree, this was introduced by me during tests to

[jira] [Commented] (HUDI-433) Improve the way log block magic header is identified when a corrupt block is encountered #416

2020-01-07 Thread Nishith Agarwal (Jira)
[ https://issues.apache.org/jira/browse/HUDI-433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010098#comment-17010098 ] Nishith Agarwal commented on HUDI-433: -- Yes, we should close this one. > Improve the way log block

[jira] [Comment Edited] (HUDI-433) Improve the way log block magic header is identified when a corrupt block is encountered #416

2020-01-07 Thread Nishith Agarwal (Jira)
[ https://issues.apache.org/jira/browse/HUDI-433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010098#comment-17010098 ] Nishith Agarwal edited comment on HUDI-433 at 1/7/20 9:20 PM: -- Yes, we should

[jira] [Comment Edited] (HUDI-433) Improve the way log block magic header is identified when a corrupt block is encountered #416

2020-01-07 Thread Nishith Agarwal (Jira)
[ https://issues.apache.org/jira/browse/HUDI-433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010098#comment-17010098 ] Nishith Agarwal edited comment on HUDI-433 at 1/7/20 9:20 PM: -- [~vinoth]  Yes,

[jira] [Commented] (HUDI-41) Get rid of special casing Global Index for MOR rollback #394

2020-01-07 Thread Nishith Agarwal (Jira)
[ https://issues.apache.org/jira/browse/HUDI-41?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010103#comment-17010103 ] Nishith Agarwal commented on HUDI-41: - We do have some special casing 

[GitHub] [incubator-hudi] zhedoubushishi commented on issue #1175: [HUDI-495] Update deprecated HBase API

2020-01-07 Thread GitBox
zhedoubushishi commented on issue #1175: [HUDI-495] Update deprecated HBase API URL: https://github.com/apache/incubator-hudi/pull/1175#issuecomment-571788946 I think in our cases doMutations method will always do flush right after doing mutator.mutate(...). So each time you call

[GitHub] [incubator-hudi] bvaradar commented on a change in pull request #1122: [HUDI-29]: Support hudi COW table to use *ANALYZE TABLE table_name COMMPUTE STATISTICS* to get table current rows

2020-01-07 Thread GitBox
bvaradar commented on a change in pull request #1122: [HUDI-29]: Support hudi COW table to use *ANALYZE TABLE table_name COMMPUTE STATISTICS* to get table current rows URL: https://github.com/apache/incubator-hudi/pull/1122#discussion_r363992460 ## File path:

[GitHub] [incubator-hudi] leesf merged pull request #1189: [HUDI-376]: AWS Glue dependency issue for EMR 5.28.0

2020-01-07 Thread GitBox
leesf merged pull request #1189: [HUDI-376]: AWS Glue dependency issue for EMR 5.28.0 URL: https://github.com/apache/incubator-hudi/pull/1189 This is an automated message from the Apache Git Service. To respond to the

[incubator-hudi] branch asf-site updated: [HUDI-376]: AWS Glue dependency issue for EMR 5.28.0 (#1189)

2020-01-07 Thread leesf
This is an automated email from the ASF dual-hosted git repository. leesf pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 6433e14 [HUDI-376]: AWS Glue

[GitHub] [incubator-hudi] lamber-ken opened a new pull request #1198: [MINOR] Remove old jekyll config file

2020-01-07 Thread GitBox
lamber-ken opened a new pull request #1198: [MINOR] Remove old jekyll config file URL: https://github.com/apache/incubator-hudi/pull/1198 ## What is the purpose of the pull request Remove old jekyll config file, `_config.yml` is useless in master branch. ## Brief change log

[jira] [Resolved] (HUDI-444) Refactor the codes based on scala codestyle NullChecker rule

2020-01-07 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lamber-ken resolved HUDI-444. - Resolution: Fixed Fixed at master 313fab5fd1ef715f98a123d0e09f6010daacab68 > Refactor the codes based on

[jira] [Commented] (HUDI-417) Refactor HoodieWriteClient so that commit logic can be shareable by both bootstrap and normal write operations

2020-01-07 Thread Nicholas Jiang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010298#comment-17010298 ] Nicholas Jiang commented on HUDI-417: - [~vbalaji]OK. I will try Spark datasource. > Refactor

[jira] [Updated] (HUDI-472) Make sortBy() inside bulkInsertInternal() configurable for bulk_insert

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-472: Fix Version/s: (was: 0.5.1) 0.5.2 > Make sortBy() inside bulkInsertInternal()

[jira] [Updated] (HUDI-483) Fix unit test for Archiving to reflect empty instant files for requested commit/deltacommits

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-483: Status: Open (was: New) > Fix unit test for Archiving to reflect empty instant files for requested

[jira] [Closed] (HUDI-439) Fix HoodieSparkSqlWriter wrt code refactoring

2020-01-07 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan closed HUDI-439. Resolution: Duplicate HUDI-438 > Fix HoodieSparkSqlWriter wrt code refactoring >

[GitHub] [incubator-hudi] Guru107 edited a comment on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
Guru107 edited a comment on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571910084 @bvaradar .aux folder is empty. Will, there be any issue if I deleted those empty

[GitHub] [incubator-hudi] bvaradar commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
bvaradar commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571919614 @Guru107 : My bad, It should be empty with the new format. In this case, just deleting

[GitHub] [incubator-hudi] lamber-ken commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
lamber-ken commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571919780 > @bvaradar Yes, it is part of the code, I checked it. I think the problem came

[GitHub] [incubator-hudi] lamber-ken edited a comment on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
lamber-ken edited a comment on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571919780 > @bvaradar Yes, it is part of the code, I checked it. I think the problem

[GitHub] [incubator-hudi] hddong edited a comment on issue #1157: [HUDI-332]Add operation type (insert/upsert/bulkinsert/delete) to HoodieCommitMetadata

2020-01-07 Thread GitBox
hddong edited a comment on issue #1157: [HUDI-332]Add operation type (insert/upsert/bulkinsert/delete) to HoodieCommitMetadata URL: https://github.com/apache/incubator-hudi/pull/1157#issuecomment-571921938 > @bvaradar Operation type is stored in the avro objects when archiving, but there

[GitHub] [incubator-hudi] hddong commented on issue #1157: [HUDI-332]Add operation type (insert/upsert/bulkinsert/delete) to HoodieCommitMetadata

2020-01-07 Thread GitBox
hddong commented on issue #1157: [HUDI-332]Add operation type (insert/upsert/bulkinsert/delete) to HoodieCommitMetadata URL: https://github.com/apache/incubator-hudi/pull/1157#issuecomment-571921938 > @bvaradar Operation type is stored in the avro objects when archiving, but there are a

[GitHub] [incubator-hudi] Guru107 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
Guru107 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571910084 @bvaradar .aux folder is empty

[GitHub] [incubator-hudi] bvaradar commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
bvaradar commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571920527 @Guru107 : The reason why there is no tooling to seamlessly fix this is because this

[GitHub] [incubator-hudi] hddong edited a comment on issue #1157: [HUDI-332]Add operation type (insert/upsert/bulkinsert/delete) to HoodieCommitMetadata

2020-01-07 Thread GitBox
hddong edited a comment on issue #1157: [HUDI-332]Add operation type (insert/upsert/bulkinsert/delete) to HoodieCommitMetadata URL: https://github.com/apache/incubator-hudi/pull/1157#issuecomment-571921938 > @bvaradar Operation type is stored in the avro objects when archiving, but there

[GitHub] [incubator-hudi] bhasudha commented on a change in pull request #1122: [HUDI-29]: Support hudi COW table to use *ANALYZE TABLE table_name COMMPUTE STATISTICS* to get table current rows

2020-01-07 Thread GitBox
bhasudha commented on a change in pull request #1122: [HUDI-29]: Support hudi COW table to use *ANALYZE TABLE table_name COMMPUTE STATISTICS* to get table current rows URL: https://github.com/apache/incubator-hudi/pull/1122#discussion_r364017471 ## File path:

[jira] [Commented] (HUDI-500) Use enum method to replace switch case in HoodieTableMetaClient

2020-01-07 Thread dengziming (Jira)
[ https://issues.apache.org/jira/browse/HUDI-500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010238#comment-17010238 ] dengziming commented on HUDI-500: - this is invalid change, who helps to close it, thank you. > Use enum

[jira] [Commented] (HUDI-507) Support \ t split hdfs source

2020-01-07 Thread liujinhui (Jira)
[ https://issues.apache.org/jira/browse/HUDI-507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010247#comment-17010247 ] liujinhui commented on HUDI-507: [~vinoth]  _please give me the contributor permission, Email sent before,

[jira] [Issue Comment Deleted] (HUDI-507) Support \ t split hdfs source

2020-01-07 Thread liujinhui (Jira)
[ https://issues.apache.org/jira/browse/HUDI-507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] liujinhui updated HUDI-507: --- Comment: was deleted (was: [~vinoth]  _please give me the contributor permission, Email sent before, but not

[jira] [Commented] (HUDI-450) Refactor the codes based on scala codestyle MagicNumberChecker rule

2020-01-07 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010268#comment-17010268 ] lamber-ken commented on HUDI-450: - hi because of the checkstyle rules are under discusstion, don't fix

[jira] [Commented] (HUDI-447) Refactor the codes based on scala codestyle IfBraceChecker rule

2020-01-07 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010267#comment-17010267 ] lamber-ken commented on HUDI-447: - hi because of the checkstyle rules are under discusstion, don't fix

[jira] [Commented] (HUDI-507) Support \ t split hdfs source

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010281#comment-17010281 ] Vinoth Chandar commented on HUDI-507: -  I apologize. I probably missed some notification.. You have

[GitHub] [incubator-hudi] vinothchandar commented on issue #1122: [HUDI-29]: Support hudi COW table to use *ANALYZE TABLE table_name COMMPUTE STATISTICS* to get table current rows

2020-01-07 Thread GitBox
vinothchandar commented on issue #1122: [HUDI-29]: Support hudi COW table to use *ANALYZE TABLE table_name COMMPUTE STATISTICS* to get table current rows URL: https://github.com/apache/incubator-hudi/pull/1122#issuecomment-571864785 cc @n3nash as well.. Lets ensure we don't have a

Build failed in Jenkins: hudi-snapshot-deployment-0.5 #153

2020-01-07 Thread Apache Jenkins Server
See Changes: -- [...truncated 2.17 KB...] /home/jenkins/tools/maven/apache-maven-3.5.4/bin: m2.conf mvn mvn.cmd mvnDebug mvnDebug.cmd mvnyjp

[GitHub] [incubator-hudi] Guru107 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
Guru107 commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571891243 @bvaradar Yes, it is part of the code, I checked it. I think the problem came because, I

[jira] [Updated] (HUDI-493) Add docs for delete support in Hudi client apis

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-493: Status: Open (was: New) > Add docs for delete support in Hudi client apis >

[jira] [Assigned] (HUDI-506) Broken/Wrong links in new website

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-506: --- Assignee: lamber-ken > Broken/Wrong links in new website > -

[jira] [Updated] (HUDI-506) Broken/Wrong links in new website

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-506: Status: Open (was: New) > Broken/Wrong links in new website > - > >

[jira] [Updated] (HUDI-86) Add indexing support to the log file format

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-86?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-86: --- Fix Version/s: (was: 0.5.1) 0.6.0 > Add indexing support to the log file format

[jira] [Commented] (HUDI-238) Make separate release for hudi spark/scala based packages for scala 2.12

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010372#comment-17010372 ] Vinoth Chandar commented on HUDI-238: - Hi are you still working on this?  > Make separate release for

[jira] [Created] (HUDI-512) Decouple logical partitioning from physical one.

2020-01-07 Thread Alexander Filipchik (Jira)
Alexander Filipchik created HUDI-512: Summary: Decouple logical partitioning from physical one. Key: HUDI-512 URL: https://issues.apache.org/jira/browse/HUDI-512 Project: Apache Hudi (incubating)

[GitHub] [incubator-hudi] bvaradar commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
bvaradar commented on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571898125 @Guru107 : The corresponding non-empty clean.requested is under .aux folder. As a one

[jira] [Updated] (HUDI-512) Decouple logical partitioning from physical one.

2020-01-07 Thread Alexander Filipchik (Jira)
[ https://issues.apache.org/jira/browse/HUDI-512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Filipchik updated HUDI-512: - Description: This one is more inspirational, but, I believe, will be very useful.

[jira] [Resolved] (HUDI-440) Rework the hudi web site

2020-01-07 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lamber-ken resolved HUDI-440. - Resolution: Fixed Fixed at asf-site 312711dd220f5ffaeecfe711f9011b651ded72a2 > Rework the hudi web site >

[jira] [Updated] (HUDI-507) Support \ t split hdfs source

2020-01-07 Thread liujinhui (Jira)
[ https://issues.apache.org/jira/browse/HUDI-507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] liujinhui updated HUDI-507: --- Description: hi,hudi   Current Hudi data source does not support HDFS file data splitting with \ t

[jira] [Assigned] (HUDI-450) Refactor the codes based on scala codestyle MagicNumberChecker rule

2020-01-07 Thread Zijie Lu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zijie Lu reassigned HUDI-450: - Assignee: Zijie Lu > Refactor the codes based on scala codestyle MagicNumberChecker rule >

[jira] [Assigned] (HUDI-447) Refactor the codes based on scala codestyle IfBraceChecker rule

2020-01-07 Thread Zijie Lu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zijie Lu reassigned HUDI-447: - Assignee: Zijie Lu > Refactor the codes based on scala codestyle IfBraceChecker rule >

[jira] [Commented] (HUDI-507) Support \ t split hdfs source

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010282#comment-17010282 ] Vinoth Chandar commented on HUDI-507: - There is a PR open for CSV source.. May be can see if sharing

[jira] [Updated] (HUDI-242) Support Efficient bootstrap of large parquet datasets to Hudi

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-242: Fix Version/s: (was: 0.5.1) 0.6.0 > Support Efficient bootstrap of large

[jira] [Updated] (HUDI-289) Implement a test suite to support long running test for Hudi writing and querying end-end

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-289: Fix Version/s: (was: 0.5.1) 0.5.2 > Implement a test suite to support long

[jira] [Commented] (HUDI-322) DeltaSteamer should pick checkpoints off only deltacommits for MOR tables

2020-01-07 Thread Shahida Khan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010379#comment-17010379 ] Shahida Khan commented on HUDI-322: --- Yes! [~vinoth]  I will need some more time, I know this should have

[jira] [Updated] (HUDI-403) Publish a deployment guide talking about deployment options, upgrading etc

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-403: Status: In Progress (was: Open) > Publish a deployment guide talking about deployment options,

[jira] [Updated] (HUDI-288) Add support for ingesting multiple kafka streams in a single DeltaStreamer deployment

2020-01-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-288: Fix Version/s: (was: 0.5.1) 0.6.0 > Add support for ingesting multiple kafka

[GitHub] [incubator-hudi] bhasudha commented on a change in pull request #1122: [HUDI-29]: Support hudi COW table to use *ANALYZE TABLE table_name COMMPUTE STATISTICS* to get table current rows

2020-01-07 Thread GitBox
bhasudha commented on a change in pull request #1122: [HUDI-29]: Support hudi COW table to use *ANALYZE TABLE table_name COMMPUTE STATISTICS* to get table current rows URL: https://github.com/apache/incubator-hudi/pull/1122#discussion_r364017471 ## File path:

[jira] [Updated] (HUDI-507) Support \ t split hdfs source

2020-01-07 Thread liujinhui (Jira)
[ https://issues.apache.org/jira/browse/HUDI-507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] liujinhui updated HUDI-507: --- Description: hi,hudi   Current Hudi data source does not support HDFS file data splitting with \ t

[GitHub] [incubator-hudi] Guru107 edited a comment on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table

2020-01-07 Thread GitBox
Guru107 edited a comment on issue #1128: [HUDI-453] Fix throw failed to archive commits error when writing data to MOR/COW table URL: https://github.com/apache/incubator-hudi/pull/1128#issuecomment-571891243 @bvaradar Yes, it is part of the code, I checked it. I think the problem came

  1   2   >