[GitHub] [hudi] sathyaprakashg commented on a change in pull request #2012: [HUDI-1129] Deltastreamer Add support for schema evolution

2020-09-29 Thread GitBox
sathyaprakashg commented on a change in pull request #2012: URL: https://github.com/apache/hudi/pull/2012#discussion_r497257812 ## File path: hudi-spark/src/main/scala/org/apache/hudi/AvroConversionHelper.scala ## @@ -364,4 +366,40 @@ object AvroConversionHelper { }

[GitHub] [hudi] sathyaprakashg commented on a change in pull request #2012: [HUDI-1129] Deltastreamer Add support for schema evolution

2020-09-29 Thread GitBox
sathyaprakashg commented on a change in pull request #2012: URL: https://github.com/apache/hudi/pull/2012#discussion_r497257812 ## File path: hudi-spark/src/main/scala/org/apache/hudi/AvroConversionHelper.scala ## @@ -364,4 +366,40 @@ object AvroConversionHelper { }

[GitHub] [hudi] satishkotha edited a comment on issue #2123: Timestamp not parsed correctly on Athena

2020-09-29 Thread GitBox
satishkotha edited a comment on issue #2123: URL: https://github.com/apache/hudi/issues/2123#issuecomment-701154113 This is a bit complicated. Hudi uses spark converters to convert dataframe type into parquet type. Spark SchemaConverters converts timestamp to

[GitHub] [hudi] satishkotha commented on issue #2123: Timestamp not parsed correctly on Athena

2020-09-29 Thread GitBox
satishkotha commented on issue #2123: URL: https://github.com/apache/hudi/issues/2123#issuecomment-701154113 This is a bit complicated. Hudi uses spark converters to convert dataframe type into parquet type. Spark SchemaConverters converts timestamp to

[GitHub] [hudi] lw309637554 commented on a change in pull request #2133: [HUDI-1298] Add better error messages when IOException occurs during …

2020-09-29 Thread GitBox
lw309637554 commented on a change in pull request #2133: URL: https://github.com/apache/hudi/pull/2133#discussion_r497237342 ## File path: hudi-common/src/main/java/org/apache/hudi/exception/HoodieIOException.java ## @@ -18,27 +18,25 @@ package org.apache.hudi.exception;

[GitHub] [hudi] n3nash commented on pull request #2092: [HUDI-1285] Fix merge on read DAG to make docker demo pass

2020-09-29 Thread GitBox
n3nash commented on pull request #2092: URL: https://github.com/apache/hudi/pull/2092#issuecomment-701149391 @nsivabalan I actually fixed the rollback node as well, need a couple of days to add tests for it after which we can merge it and you will have a fully working DAG with all nodes

[GitHub] [hudi] n3nash commented on a change in pull request #2133: [HUDI-1298] Add better error messages when IOException occurs during …

2020-09-29 Thread GitBox
n3nash commented on a change in pull request #2133: URL: https://github.com/apache/hudi/pull/2133#discussion_r497235545 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/log/AbstractHoodieLogRecordScanner.java ## @@ -244,7 +244,7 @@ public void scan() {

[GitHub] [hudi] n3nash commented on a change in pull request #2133: [HUDI-1298] Add better error messages when IOException occurs during …

2020-09-29 Thread GitBox
n3nash commented on a change in pull request #2133: URL: https://github.com/apache/hudi/pull/2133#discussion_r497235216 ## File path: hudi-common/src/main/java/org/apache/hudi/exception/HoodieIOException.java ## @@ -18,27 +18,25 @@ package org.apache.hudi.exception;

[GitHub] [hudi] lw309637554 commented on pull request #2133: [HUDI-1298] Add better error messages when IOException occurs during …

2020-09-29 Thread GitBox
lw309637554 commented on pull request #2133: URL: https://github.com/apache/hudi/pull/2133#issuecomment-701138200 @n3nash hi,please help to review . This is an automated message from the Apache Git Service. To respond to

[jira] [Updated] (HUDI-1298) Add better error messages when IOException occurs during log file reading

2020-09-29 Thread liwei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] liwei updated HUDI-1298: Status: In Progress (was: Open) > Add better error messages when IOException occurs during log file reading >

[jira] [Updated] (HUDI-1298) Add better error messages when IOException occurs during log file reading

2020-09-29 Thread liwei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] liwei updated HUDI-1298: Status: Open (was: New) > Add better error messages when IOException occurs during log file reading >

[jira] [Created] (HUDI-1306) Write documentation/blog about SchemaProvider and subclasses

2020-09-29 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-1306: Summary: Write documentation/blog about SchemaProvider and subclasses Key: HUDI-1306 URL: https://issues.apache.org/jira/browse/HUDI-1306 Project: Apache Hudi

[jira] [Updated] (HUDI-1298) Add better error messages when IOException occurs during log file reading

2020-09-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1298: - Labels: pull-request-available (was: ) > Add better error messages when IOException occurs

[GitHub] [hudi] lw309637554 opened a new pull request #2133: [HUDI-1298] Add better error messages when IOException occurs during …

2020-09-29 Thread GitBox
lw309637554 opened a new pull request #2133: URL: https://github.com/apache/hudi/pull/2133 …log file reading ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ##

[GitHub] [hudi] lw309637554 commented on pull request #2125: [HUDI-1301] use spark INCREMENTAL mode query hudi dataset support sch…

2020-09-29 Thread GitBox
lw309637554 commented on pull request #2125: URL: https://github.com/apache/hudi/pull/2125#issuecomment-701123956 @leesf the comment have fixed This is an automated message from the Apache Git Service. To respond to the

[GitHub] [hudi] bvaradar merged pull request #2048: [HUDI-1072] Introduce REPLACE top level action

2020-09-29 Thread GitBox
bvaradar merged pull request #2048: URL: https://github.com/apache/hudi/pull/2048 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] bvaradar commented on pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
bvaradar commented on pull request #2064: URL: https://github.com/apache/hudi/pull/2064#issuecomment-701052206 @prashantwason : Also Have you been able to run metadata list commands successfully ? Did you encounter this issue : Caused by: java.lang.ClassNotFoundException:

[GitHub] [hudi] bvaradar commented on a change in pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
bvaradar commented on a change in pull request #2064: URL: https://github.com/apache/hudi/pull/2064#discussion_r497149559 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/MetadataCommand.java ## @@ -0,0 +1,202 @@ +/* + * Licensed to the Apache Software

[GitHub] [hudi] bvaradar commented on a change in pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
bvaradar commented on a change in pull request #2064: URL: https://github.com/apache/hudi/pull/2064#discussion_r497148269 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/MetadataCommand.java ## @@ -0,0 +1,202 @@ +/* + * Licensed to the Apache Software

[GitHub] [hudi] prashantwason commented on pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
prashantwason commented on pull request #2064: URL: https://github.com/apache/hudi/pull/2064#issuecomment-701026945 > I instrumented the code to track writeConfig.useFileListingMetadata. It looks like it is being disabled for a dataset that has the metadata dataset > > 20/09/26

[GitHub] [hudi] prashantwason commented on a change in pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
prashantwason commented on a change in pull request #2064: URL: https://github.com/apache/hudi/pull/2064#discussion_r497097206 ## File path: hudi-client/src/main/java/org/apache/hudi/metadata/HoodieMetadataImpl.java ## @@ -0,0 +1,1104 @@ +/* + * Licensed to the Apache

[GitHub] [hudi] prashantwason commented on a change in pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
prashantwason commented on a change in pull request #2064: URL: https://github.com/apache/hudi/pull/2064#discussion_r497095419 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/MetadataCommand.java ## @@ -0,0 +1,202 @@ +/* + * Licensed to the Apache Software

[GitHub] [hudi] prashantwason commented on a change in pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
prashantwason commented on a change in pull request #2064: URL: https://github.com/apache/hudi/pull/2064#discussion_r497095299 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/MetadataCommand.java ## @@ -0,0 +1,202 @@ +/* + * Licensed to the Apache Software

[GitHub] [hudi] prashantwason commented on a change in pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
prashantwason commented on a change in pull request #2064: URL: https://github.com/apache/hudi/pull/2064#discussion_r497079467 ## File path: hudi-client/src/main/java/org/apache/hudi/metadata/HoodieMetadataImpl.java ## @@ -0,0 +1,1064 @@ +/* + * Licensed to the Apache

[GitHub] [hudi] prashantwason commented on a change in pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
prashantwason commented on a change in pull request #2064: URL: https://github.com/apache/hudi/pull/2064#discussion_r497074256 ## File path: hudi-client/src/main/java/org/apache/hudi/metadata/HoodieMetadataImpl.java ## @@ -0,0 +1,1064 @@ +/* + * Licensed to the Apache

[GitHub] [hudi] prashantwason commented on a change in pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
prashantwason commented on a change in pull request #2064: URL: https://github.com/apache/hudi/pull/2064#discussion_r497074074 ## File path: hudi-client/src/main/java/org/apache/hudi/metadata/HoodieMetadataImpl.java ## @@ -0,0 +1,1064 @@ +/* + * Licensed to the Apache

[GitHub] [hudi] prashantwason commented on a change in pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
prashantwason commented on a change in pull request #2064: URL: https://github.com/apache/hudi/pull/2064#discussion_r497072251 ## File path: hudi-client/src/main/java/org/apache/hudi/metadata/HoodieMetadataImpl.java ## @@ -0,0 +1,1064 @@ +/* + * Licensed to the Apache

[GitHub] [hudi] prashantwason commented on a change in pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
prashantwason commented on a change in pull request #2064: URL: https://github.com/apache/hudi/pull/2064#discussion_r497070209 ## File path: hudi-client/src/main/java/org/apache/hudi/metadata/HoodieMetadataIndex.java ## @@ -0,0 +1,189 @@ +/* + * Licensed to the Apache

[GitHub] [hudi] prashantwason commented on a change in pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
prashantwason commented on a change in pull request #2064: URL: https://github.com/apache/hudi/pull/2064#discussion_r497068099 ## File path: hudi-client/src/main/resources/metadataSchema.txt ## @@ -0,0 +1,43 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one

[GitHub] [hudi] prashantwason commented on a change in pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
prashantwason commented on a change in pull request #2064: URL: https://github.com/apache/hudi/pull/2064#discussion_r497066694 ## File path: hudi-client/src/main/resources/metadataSchema.txt ## @@ -0,0 +1,43 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one

[jira] [Updated] (HUDI-1305) Prevent log pollution from console metrics logger

2020-09-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1305: - Labels: pull-request-available (was: ) > Prevent log pollution from console metrics logger >

[GitHub] [hudi] prashantwason commented on pull request #2064: WIP - [HUDI-842] Implementation of HUDI RFC-15.

2020-09-29 Thread GitBox
prashantwason commented on pull request #2064: URL: https://github.com/apache/hudi/pull/2064#issuecomment-700996406 > Also see a lot of this polluting the test logs. can we remove or avoid them on travis Filed https://github.com/apache/hudi/pull/2132 for this issue.

[GitHub] [hudi] prashantwason opened a new pull request #2132: [HUDI-1305] Added an API to shutdown and remove the metrics reporter.

2020-09-29 Thread GitBox
prashantwason opened a new pull request #2132: URL: https://github.com/apache/hudi/pull/2132 This helps in removing reporter once the test has complete. Prevents log pollution from un-necessary metric logs. ## What is the purpose of the pull request Prevent log pollution from

[jira] [Created] (HUDI-1305) Prevent log pollution from console metrics logger

2020-09-29 Thread Prashant Wason (Jira)
Prashant Wason created HUDI-1305: Summary: Prevent log pollution from console metrics logger Key: HUDI-1305 URL: https://issues.apache.org/jira/browse/HUDI-1305 Project: Apache Hudi Issue

[GitHub] [hudi] n3nash commented on issue #2123: Timestamp not parsed correctly on Athena

2020-09-29 Thread GitBox
n3nash commented on issue #2123: URL: https://github.com/apache/hudi/issues/2123#issuecomment-700966424 @satishkotha can you please help here ? This seems like the same issue we encountered. This is an automated message

[jira] [Updated] (HUDI-1302) Add support for timestamp field in HiveSync

2020-09-29 Thread satish (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] satish updated HUDI-1302: - Status: In Progress (was: Open) > Add support for timestamp field in HiveSync >

[jira] [Updated] (HUDI-1302) Add support for timestamp field in HiveSync

2020-09-29 Thread satish (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] satish updated HUDI-1302: - Status: Open (was: New) > Add support for timestamp field in HiveSync >

[jira] [Updated] (HUDI-1302) Add support for timestamp field in HiveSync

2020-09-29 Thread satish (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] satish updated HUDI-1302: - Status: Patch Available (was: In Progress) > Add support for timestamp field in HiveSync >

[GitHub] [hudi] tandonraghavs opened a new issue #2131: HUDI with Mongo Oplogs (Debezium)

2020-09-29 Thread GitBox
tandonraghavs opened a new issue #2131: URL: https://github.com/apache/hudi/issues/2131 We are planning to use HUDI for our Data Warehouse, and we want to dump Mongo Data to S3. For Mongo we are relying on Oplogs(Debezium) . But as I experimented with Hudi, Hudi is not working if I

[jira] [Created] (HUDI-1304) test compaction workflow with replacecommit action

2020-09-29 Thread satish (Jira)
satish created HUDI-1304: Summary: test compaction workflow with replacecommit action Key: HUDI-1304 URL: https://issues.apache.org/jira/browse/HUDI-1304 Project: Apache Hudi Issue Type: Sub-task

[jira] [Commented] (HUDI-1205) Serialization fail when log file is larger than 2GB

2020-09-29 Thread Anton (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17204186#comment-17204186 ] Anton commented on HUDI-1205: - Is there any updates on this issue?  > Serialization fail when log file is

[GitHub] [hudi] satishkotha commented on pull request #2129: [HUDI-1302] Add support for timestamp field in HiveSync

2020-09-29 Thread GitBox
satishkotha commented on pull request #2129: URL: https://github.com/apache/hudi/pull/2129#issuecomment-700819993 > Can you test on Hive to check if Hive can read data with hudi avro timestamp type? > I think there are more test cases and other things to let Hive support timestamp

[GitHub] [hudi] lw309637554 commented on a change in pull request #2125: [HUDI-1301] use spark INCREMENTAL mode query hudi dataset support sch…

2020-09-29 Thread GitBox
lw309637554 commented on a change in pull request #2125: URL: https://github.com/apache/hudi/pull/2125#discussion_r496831803 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/TableSchemaResolver.java ## @@ -175,20 +175,45 @@ public MessageType

[GitHub] [hudi] lw309637554 commented on a change in pull request #2125: [HUDI-1301] use spark INCREMENTAL mode query hudi dataset support sch…

2020-09-29 Thread GitBox
lw309637554 commented on a change in pull request #2125: URL: https://github.com/apache/hudi/pull/2125#discussion_r496829881 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/TableSchemaResolver.java ## @@ -175,20 +175,45 @@ public MessageType

[GitHub] [hudi] Trevor-zhang opened a new issue #2130: [SUPPORT] Use hive jdbc to access the hudi table and report an error

2020-09-29 Thread GitBox
Trevor-zhang opened a new issue #2130: URL: https://github.com/apache/hudi/issues/2130 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://cwiki.apache.org/confluence/display/HUDI/FAQ)? - Join the mailing list to engage in conversations and get faster

[GitHub] [hudi] leesf commented on pull request #1978: [HUDI-1184] Fix the support of hbase index partition path change

2020-09-29 Thread GitBox
leesf commented on pull request #1978: URL: https://github.com/apache/hudi/pull/1978#issuecomment-700783352 > @n3nash I tried to rebase and squash the commits but there are still two commits. Can you help me? Since @hj2016 has some problem in rebasing. If you @n3nash have no other

[GitHub] [hudi] leesf commented on a change in pull request #2115: [MINOR] Mark started and shutdownRequested with volatile.

2020-09-29 Thread GitBox
leesf commented on a change in pull request #2115: URL: https://github.com/apache/hudi/pull/2115#discussion_r496810222 ## File path: hudi-client/src/main/java/org/apache/hudi/async/AbstractAsyncService.java ## @@ -39,9 +39,9 @@ private static final Logger LOG =

[GitHub] [hudi] leesf commented on a change in pull request #2125: [HUDI-1301] use spark INCREMENTAL mode query hudi dataset support sch…

2020-09-29 Thread GitBox
leesf commented on a change in pull request #2125: URL: https://github.com/apache/hudi/pull/2125#discussion_r496804501 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/TableSchemaResolver.java ## @@ -175,20 +175,45 @@ public MessageType

[GitHub] [hudi] leesf commented on a change in pull request #2125: [HUDI-1301] use spark INCREMENTAL mode query hudi dataset support sch…

2020-09-29 Thread GitBox
leesf commented on a change in pull request #2125: URL: https://github.com/apache/hudi/pull/2125#discussion_r496802810 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/TableSchemaResolver.java ## @@ -175,20 +175,45 @@ public MessageType

[hudi] branch master updated: [HUDI-840] Avoid blank file created by HoodieLogFormatWriter (#1567)

2020-09-29 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 32c9cad [HUDI-840] Avoid blank file created by

[GitHub] [hudi] vinothchandar merged pull request #1567: [HUDI-840]Clean blank file created by HoodieLogFormatWriter

2020-09-29 Thread GitBox
vinothchandar merged pull request #1567: URL: https://github.com/apache/hudi/pull/1567 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [hudi] vinothchandar commented on pull request #1567: [HUDI-840]Clean blank file created by HoodieLogFormatWriter

2020-09-29 Thread GitBox
vinothchandar commented on pull request #1567: URL: https://github.com/apache/hudi/pull/1567#issuecomment-700766623 LGTM This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [hudi] getniz removed a comment on issue #2101: [SUPPORT]Unable to interpret Child JSON fields value as a separate columns rather it is loaded as one single field value. Any way to interpret

2020-09-29 Thread GitBox
getniz removed a comment on issue #2101: URL: https://github.com/apache/hudi/issues/2101#issuecomment-700727885 Closing this issue as I could use KSQL to flatten the incoming Nested JSON and then consume in HUDI instantaneously without staging.

[GitHub] [hudi] getniz closed issue #2101: [SUPPORT]Unable to interpret Child JSON fields value as a separate columns rather it is loaded as one single field value. Any way to interpret that.

2020-09-29 Thread GitBox
getniz closed issue #2101: URL: https://github.com/apache/hudi/issues/2101 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] getniz commented on issue #2101: [SUPPORT]Unable to interpret Child JSON fields value as a separate columns rather it is loaded as one single field value. Any way to interpret that.

2020-09-29 Thread GitBox
getniz commented on issue #2101: URL: https://github.com/apache/hudi/issues/2101#issuecomment-700728117 Closing this issue as I could use KSQL to flatten the incoming Nested JSON and then consume in HUDI instantaneously without staging.

[GitHub] [hudi] getniz commented on issue #2101: [SUPPORT]Unable to interpret Child JSON fields value as a separate columns rather it is loaded as one single field value. Any way to interpret that.

2020-09-29 Thread GitBox
getniz commented on issue #2101: URL: https://github.com/apache/hudi/issues/2101#issuecomment-700727885 Closing this issue as I could use KSQL to flatten the incoming Nested JSON and then consume in HUDI instantaneously without staging.

[GitHub] [hudi] linshan-ma commented on pull request #2111: [HUDI-1234] Insert new records regardless of small file when using insert operation

2020-09-29 Thread GitBox
linshan-ma commented on pull request #2111: URL: https://github.com/apache/hudi/pull/2111#issuecomment-700597336 > > According to this([https://github.com/apache/hudi/issues/2051)](https://github.com/apache/hudi/issues/2051%EF%BC%89) test。I can't get the results I want。When we set

[GitHub] [hudi] ashishmgofficial commented on issue #2104: [SUPPORT] MOR Hive sync - _rt table read issue

2020-09-29 Thread GitBox
ashishmgofficial commented on issue #2104: URL: https://github.com/apache/hudi/issues/2104#issuecomment-700532786 @n3nash I terminated the EMR 6.0.0 and started a new EMR Cluster ver. 5.30 and everything works fine here

[GitHub] [hudi] ashishmgofficial closed issue #2104: [SUPPORT] MOR Hive sync - _rt table read issue

2020-09-29 Thread GitBox
ashishmgofficial closed issue #2104: URL: https://github.com/apache/hudi/issues/2104 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [hudi] SteNicholas edited a comment on pull request #2111: [HUDI-1234] Insert new records regardless of small file when using insert operation

2020-09-29 Thread GitBox
SteNicholas edited a comment on pull request #2111: URL: https://github.com/apache/hudi/pull/2111#issuecomment-700524082 > According to this([https://github.com/apache/hudi/issues/2051)](https://github.com/apache/hudi/issues/2051%EF%BC%89) test。I can't get the results I want。When we set

[GitHub] [hudi] SteNicholas commented on pull request #2111: [HUDI-1234] Insert new records regardless of small file when using insert operation

2020-09-29 Thread GitBox
SteNicholas commented on pull request #2111: URL: https://github.com/apache/hudi/pull/2111#issuecomment-700524082 > According to this([https://github.com/apache/hudi/issues/2051)](https://github.com/apache/hudi/issues/2051%EF%BC%89) test。I can't get the results I want。When we set

[GitHub] [hudi] shenh062326 commented on a change in pull request #2085: [HUDI-1209] Properties File must be optional when running deltastreamer

2020-09-29 Thread GitBox
shenh062326 commented on a change in pull request #2085: URL: https://github.com/apache/hudi/pull/2085#discussion_r496482252 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/deltastreamer/HoodieDeltaStreamer.java ## @@ -112,9 +112,14 @@ public

[GitHub] [hudi] shenh062326 commented on a change in pull request #2085: [HUDI-1209] Properties File must be optional when running deltastreamer

2020-09-29 Thread GitBox
shenh062326 commented on a change in pull request #2085: URL: https://github.com/apache/hudi/pull/2085#discussion_r494256445 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/deltastreamer/HoodieDeltaStreamer.java ## @@ -112,9 +112,14 @@ public

[jira] [Assigned] (HUDI-1298) Add better error messages when IOException occurs during log file reading

2020-09-29 Thread Nishith Agarwal (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nishith Agarwal reassigned HUDI-1298: - Assignee: liwei (was: Nishith Agarwal) > Add better error messages when IOException

[jira] [Commented] (HUDI-1298) Add better error messages when IOException occurs during log file reading

2020-09-29 Thread Nishith Agarwal (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17203720#comment-17203720 ] Nishith Agarwal commented on HUDI-1298: --- [~309637554] I'd be glad to get some help on this, assigned

[GitHub] [hudi] hj2016 edited a comment on pull request #1978: [HUDI-1184] Fix the support of hbase index partition path change

2020-09-29 Thread GitBox
hj2016 edited a comment on pull request #1978: URL: https://github.com/apache/hudi/pull/1978#issuecomment-700475643 @n3nash I tried to rebase and squash the commits but there are still two commits. Can you help me? This

[GitHub] [hudi] hj2016 commented on pull request #1978: [HUDI-1184] Fix the support of hbase index partition path change

2020-09-29 Thread GitBox
hj2016 commented on pull request #1978: URL: https://github.com/apache/hudi/pull/1978#issuecomment-700475643 @n3nash I tried to rebase and squash the commits but there are still two commits. Can you help me,think? This is

[jira] [Commented] (HUDI-1298) Add better error messages when IOException occurs during log file reading

2020-09-29 Thread liwei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17203668#comment-17203668 ] liwei commented on HUDI-1298: - [~nishith29] hi, if  have not in process .can i take this issue. And check

[GitHub] [hudi] yanghua commented on a change in pull request #2122: [HUDI-1274] Make hive synchronization supports hourly partition

2020-09-29 Thread GitBox
yanghua commented on a change in pull request #2122: URL: https://github.com/apache/hudi/pull/2122#discussion_r496439235 ## File path: hudi-sync/hudi-hive-sync/src/main/java/org/apache/hudi/hive/SlashEncodedDayPartitionValueExtractor.java ## @@ -36,30 +36,35 @@ private

[GitHub] [hudi] linshan-ma commented on pull request #2111: [HUDI-1234] Insert new records regardless of small file when using insert operation

2020-09-29 Thread GitBox
linshan-ma commented on pull request #2111: URL: https://github.com/apache/hudi/pull/2111#issuecomment-700471110 @leesf yes,it was updated.When we set hoodie.parquet.small.file.limit=0, it was inserted This is an

[hudi] branch master updated (c0c0095 -> 20b9b39)

2020-09-29 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository. vinoyang pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from c0c0095 [MINOR] Reformat prepare_integration_suite script (#2126) add 20b9b39 [HUDI-1233] Deltastreamer Kafka