[GitHub] [incubator-hudi] adamjoneill commented on issue #1325: presto - querying nested object in parquet file created by hudi

2020-02-13 Thread GitBox
adamjoneill commented on issue #1325: presto - querying nested object in parquet file created by hudi URL: https://github.com/apache/incubator-hudi/issues/1325#issuecomment-585777129 Had some success, if i added a property to the object of simple type ie. not a nested object it appears to

[jira] [Created] (HUDI-608) Implement a flink datastream execution context

2020-02-13 Thread vinoyang (Jira)
vinoyang created HUDI-608: - Summary: Implement a flink datastream execution context Key: HUDI-608 URL: https://issues.apache.org/jira/browse/HUDI-608 Project: Apache Hudi (incubating) Issue Type:

[incubator-hudi] branch hudi_test_suite_refactor updated (5f22849 -> 7a0794a)

2020-02-13 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository. vinoyang pushed a change to branch hudi_test_suite_refactor in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. omit 5f22849 [HUDI-592] Remove duplicated dependencies in the pom file of test suite module

[GitHub] [incubator-hudi] adamjoneill edited a comment on issue #1325: presto - querying nested object in parquet file created by hudi

2020-02-13 Thread GitBox
adamjoneill edited a comment on issue #1325: presto - querying nested object in parquet file created by hudi URL: https://github.com/apache/incubator-hudi/issues/1325#issuecomment-585748045 Futher update I've narrowed it down to the array on the nested object. The following

[GitHub] [incubator-hudi] adamjoneill commented on issue #1325: presto - querying nested object in parquet file created by hudi

2020-02-13 Thread GitBox
adamjoneill commented on issue #1325: presto - querying nested object in parquet file created by hudi URL: https://github.com/apache/incubator-hudi/issues/1325#issuecomment-585748045 Futher update I've narrowed it down to the array on the nested object. The following works when

[incubator-hudi] branch master updated: [HUDI-574] Fix CLI counts small file inserts as updates (#1321)

2020-02-13 Thread leesf
This is an automated email from the ASF dual-hosted git repository. leesf pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/master by this push: new 01c868a [HUDI-574] Fix CLI counts small

[GitHub] [incubator-hudi] leesf merged pull request #1321: [HUDI-574] Fix CLI counts small file inserts as updates

2020-02-13 Thread GitBox
leesf merged pull request #1321: [HUDI-574] Fix CLI counts small file inserts as updates URL: https://github.com/apache/incubator-hudi/pull/1321 This is an automated message from the Apache Git Service. To respond to the

[GitHub] [incubator-hudi] adamjoneill edited a comment on issue #1325: presto - querying nested object in parquet file created by hudi

2020-02-13 Thread GitBox
adamjoneill edited a comment on issue #1325: presto - querying nested object in parquet file created by hudi URL: https://github.com/apache/incubator-hudi/issues/1325#issuecomment-585777129 Had some success, if i added a property to the object of simple type ie. not a nested object hudi

[GitHub] [incubator-hudi] adamjoneill edited a comment on issue #1325: presto - querying nested object in parquet file created by hudi

2020-02-13 Thread GitBox
adamjoneill edited a comment on issue #1325: presto - querying nested object in parquet file created by hudi URL: https://github.com/apache/incubator-hudi/issues/1325#issuecomment-585748045 Futher update I've narrowed it down to the array on the nested object. The following

[incubator-hudi] branch hudi_test_suite_refactor updated (7a0794a -> b04b037)

2020-02-13 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository. vinoyang pushed a change to branch hudi_test_suite_refactor in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. discard 7a0794a Fix compile error after rebasing the branch add b04b037 [MINOR] Fix

[jira] [Created] (HUDI-609) Implement a Flink specific HoodieIndex

2020-02-13 Thread vinoyang (Jira)
vinoyang created HUDI-609: - Summary: Implement a Flink specific HoodieIndex Key: HUDI-609 URL: https://issues.apache.org/jira/browse/HUDI-609 Project: Apache Hudi (incubating) Issue Type: Sub-task

[GitHub] [incubator-hudi] adamjoneill edited a comment on issue #1325: presto - querying nested object in parquet file created by hudi

2020-02-13 Thread GitBox
adamjoneill edited a comment on issue #1325: presto - querying nested object in parquet file created by hudi URL: https://github.com/apache/incubator-hudi/issues/1325#issuecomment-585777129 Had some success, if i added a property to the object of simple type ie. not a nested object hudi

[GitHub] [incubator-hudi] adamjoneill edited a comment on issue #1325: presto - querying nested object in parquet file created by hudi

2020-02-13 Thread GitBox
adamjoneill edited a comment on issue #1325: presto - querying nested object in parquet file created by hudi URL: https://github.com/apache/incubator-hudi/issues/1325#issuecomment-585777129 Had some success, if i added a property to the object of simple type ie. not a nested object hudi

[GitHub] [incubator-hudi] wangxianghu opened a new pull request #1331: [MINOR] Fix typo

2020-02-13 Thread GitBox
wangxianghu opened a new pull request #1331: [MINOR] Fix typo URL: https://github.com/apache/incubator-hudi/pull/1331 ## What is the purpose of the pull request *Fix typo* ## Brief change log *Fix typo* ## Verify this pull request This pull request is a

[jira] [Created] (HUDI-610) Impala nea real time table support

2020-02-13 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-610: --- Summary: Impala nea real time table support Key: HUDI-610 URL: https://issues.apache.org/jira/browse/HUDI-610 Project: Apache Hudi (incubating) Issue Type:

[GitHub] [incubator-hudi] adamjoneill commented on issue #1325: presto - querying nested object in parquet file created by hudi

2020-02-13 Thread GitBox
adamjoneill commented on issue #1325: presto - querying nested object in parquet file created by hudi URL: https://github.com/apache/incubator-hudi/issues/1325#issuecomment-585932919 @vinothchandar from my investigation above it would suggest it to be how hudi writes parquet data.

[GitHub] [incubator-hudi] nbalajee commented on a change in pull request #1312: [HUDI-571] Add "compactions show archived" command to CLI

2020-02-13 Thread GitBox
nbalajee commented on a change in pull request #1312: [HUDI-571] Add "compactions show archived" command to CLI URL: https://github.com/apache/incubator-hudi/pull/1312#discussion_r379089359 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/CompactionCommand.java

[GitHub] [incubator-hudi] vinothchandar commented on issue #1100: [HUDI-289] Implement a test suite to support long running test for Hudi writing and querying end-end

2020-02-13 Thread GitBox
vinothchandar commented on issue #1100: [HUDI-289] Implement a test suite to support long running test for Hudi writing and querying end-end URL: https://github.com/apache/incubator-hudi/pull/1100#issuecomment-585910315 >How have we validated the framework? are there real tests now?

[GitHub] [incubator-hudi] vinothchandar merged pull request #1331: [MINOR] Fix typo

2020-02-13 Thread GitBox
vinothchandar merged pull request #1331: [MINOR] Fix typo URL: https://github.com/apache/incubator-hudi/pull/1331 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[incubator-hudi] branch master updated: [MINOR] Fix typo (#1331)

2020-02-13 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/master by this push: new 175de0d [MINOR] Fix typo (#1331)

[GitHub] [incubator-hudi] vinothchandar commented on issue #1314: [HUDI-542] Introduce a new pom module named hudi-writer-common

2020-02-13 Thread GitBox
vinothchandar commented on issue #1314: [HUDI-542] Introduce a new pom module named hudi-writer-common URL: https://github.com/apache/incubator-hudi/pull/1314#issuecomment-585917397 @yanghua as I also mentioned on the JIRA. What you suggest makes sense when we are in the phase of

[GitHub] [incubator-hudi] vinothchandar commented on issue #1325: presto - querying nested object in parquet file created by hudi

2020-02-13 Thread GitBox
vinothchandar commented on issue #1325: presto - querying nested object in parquet file created by hudi URL: https://github.com/apache/incubator-hudi/issues/1325#issuecomment-585920479 @adamjoneill I am wondering if this is simply a presto issue.. Hudi/Presto integration leaves all of the

[jira] [Created] (HUDI-611) Impala sync tool

2020-02-13 Thread Yanjia Gary Li (Jira)
Yanjia Gary Li created HUDI-611: --- Summary: Impala sync tool Key: HUDI-611 URL: https://issues.apache.org/jira/browse/HUDI-611 Project: Apache Hudi (incubating) Issue Type: New Feature

[GitHub] [incubator-hudi] vinothchandar commented on issue #1328: Hudi upsert hangs

2020-02-13 Thread GitBox
vinothchandar commented on issue #1328: Hudi upsert hangs URL: https://github.com/apache/incubator-hudi/issues/1328#issuecomment-585991857 There must be something else going on.. just used my own benchmark jobs to generate a pattern where the records are fully overwritten in a second (and

[GitHub] [incubator-hudi] ramachandranms opened a new pull request #1332: [HUDI -409] Match header and footer block length to improve corrupted block detection

2020-02-13 Thread GitBox
ramachandranms opened a new pull request #1332: [HUDI -409] Match header and footer block length to improve corrupted block detection URL: https://github.com/apache/incubator-hudi/pull/1332 ## What is the purpose of the pull request - This PR is to address the JIRA ticket -

[jira] [Updated] (HUDI-589) Fix references to Views in some of the pages. Replace with Query instead

2020-02-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-589: Labels: pull-request-available (was: ) > Fix references to Views in some of the pages. Replace with

[GitHub] [incubator-hudi] bhasudha opened a new pull request #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
bhasudha opened a new pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333 - Added support matrix for COW and MOR tables - Change reference from (`views`|`pulls`) to `queries` - And minor restructuring ## *Tips*

[GitHub] [incubator-hudi] bhasudha commented on issue #1316: [HUDI-604] Update docker page

2020-02-13 Thread GitBox
bhasudha commented on issue #1316: [HUDI-604] Update docker page URL: https://github.com/apache/incubator-hudi/pull/1316#issuecomment-586033608 Will merge this with other doc changes later next week after cutting 0.5.1 doc version

[GitHub] [incubator-hudi] vinothchandar commented on issue #1325: presto - querying nested object in parquet file created by hudi

2020-02-13 Thread GitBox
vinothchandar commented on issue #1325: presto - querying nested object in parquet file created by hudi URL: https://github.com/apache/incubator-hudi/issues/1325#issuecomment-585993053 @adamjoneill Hmmm. this is surprising.. - Does a `select * from table` on hudi table work using

[GitHub] [incubator-hudi] ramachandranms commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
ramachandranms commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#discussion_r379190434 ## File path: docs/_docs/2_3_querying_data.md ## @@ -24,31 +24,49 @@ If `table name =

[GitHub] [incubator-hudi] bhasudha commented on issue #1329: [SUPPORT] Presto cannot query non-partitioned table

2020-02-13 Thread GitBox
bhasudha commented on issue #1329: [SUPPORT] Presto cannot query non-partitioned table URL: https://github.com/apache/incubator-hudi/issues/1329#issuecomment-586038416 @popart The stack trace you showed looks like somehow a partition metafile (".hoodie_partition_metadata") is created in

[GitHub] [incubator-hudi] n3nash commented on a change in pull request #1312: [HUDI-571] Add "compactions show archived" command to CLI

2020-02-13 Thread GitBox
n3nash commented on a change in pull request #1312: [HUDI-571] Add "compactions show archived" command to CLI URL: https://github.com/apache/incubator-hudi/pull/1312#discussion_r379164513 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/CompactionCommand.java

[GitHub] [incubator-hudi] bhasudha commented on issue #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
bhasudha commented on issue #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#issuecomment-586032324 @vinothchandar @leesf Attaching screenshot pdf here for look and feel [Querying Hudi Tables - Apache

[GitHub] [incubator-hudi] ramachandranms commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
ramachandranms commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#discussion_r379190434 ## File path: docs/_docs/2_3_querying_data.md ## @@ -24,31 +24,49 @@ If `table name =

[GitHub] [incubator-hudi] n3nash commented on a change in pull request #1332: [HUDI -409] Match header and footer block length to improve corrupted block detection

2020-02-13 Thread GitBox
n3nash commented on a change in pull request #1332: [HUDI -409] Match header and footer block length to improve corrupted block detection URL: https://github.com/apache/incubator-hudi/pull/1332#discussion_r379212070 ## File path:

[GitHub] [incubator-hudi] n3nash commented on a change in pull request #1332: [HUDI -409] Match header and footer block length to improve corrupted block detection

2020-02-13 Thread GitBox
n3nash commented on a change in pull request #1332: [HUDI -409] Match header and footer block length to improve corrupted block detection URL: https://github.com/apache/incubator-hudi/pull/1332#discussion_r379212556 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1150: [HUDI-288]: Add support for ingesting multiple kafka streams in a single DeltaStreamer deployment

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1150: [HUDI-288]: Add support for ingesting multiple kafka streams in a single DeltaStreamer deployment URL: https://github.com/apache/incubator-hudi/pull/1150#discussion_r379215764 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1150: [HUDI-288]: Add support for ingesting multiple kafka streams in a single DeltaStreamer deployment

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1150: [HUDI-288]: Add support for ingesting multiple kafka streams in a single DeltaStreamer deployment URL: https://github.com/apache/incubator-hudi/pull/1150#discussion_r379215926 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1150: [HUDI-288]: Add support for ingesting multiple kafka streams in a single DeltaStreamer deployment

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1150: [HUDI-288]: Add support for ingesting multiple kafka streams in a single DeltaStreamer deployment URL: https://github.com/apache/incubator-hudi/pull/1150#discussion_r379216693 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on issue #1150: [HUDI-288]: Add support for ingesting multiple kafka streams in a single DeltaStreamer deployment

2020-02-13 Thread GitBox
vinothchandar commented on issue #1150: [HUDI-288]: Add support for ingesting multiple kafka streams in a single DeltaStreamer deployment URL: https://github.com/apache/incubator-hudi/pull/1150#issuecomment-586060460 @pratyakshsharma lets revive this ? :)

[incubator-hudi] branch hudi_test_suite_refactor updated (b04b037 -> 3063cd7)

2020-02-13 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository. vinoyang pushed a change to branch hudi_test_suite_refactor in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. omit b04b037 [MINOR] Fix compile error after rebasing the branch add 3063cd7 [MINOR]

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#discussion_r379225899 ## File path: docs/_docs/2_3_querying_data.md ## @@ -84,55 +102,53 @@ using the hive session

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#discussion_r379226110 ## File path: docs/_docs/2_3_querying_data.md ## @@ -84,55 +102,53 @@ using the hive session

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#discussion_r379226403 ## File path: docs/_docs/2_3_querying_data.md ## @@ -84,55 +102,53 @@ using the hive session

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#discussion_r379225479 ## File path: docs/_docs/2_3_querying_data.md ## @@ -84,55 +102,53 @@ using the hive session

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#discussion_r379225601 ## File path: docs/_docs/2_3_querying_data.md ## @@ -84,55 +102,53 @@ using the hive session

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#discussion_r379226038 ## File path: docs/_docs/2_3_querying_data.md ## @@ -84,55 +102,53 @@ using the hive session

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#discussion_r379225202 ## File path: docs/_docs/2_3_querying_data.md ## @@ -84,55 +102,53 @@ using the hive session

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#discussion_r379226205 ## File path: docs/_docs/2_3_querying_data.md ## @@ -84,55 +102,53 @@ using the hive session

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#discussion_r379226624 ## File path: docs/_docs/2_3_querying_data.md ## @@ -145,8 +161,13 @@ Additionally,

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#discussion_r379226552 ## File path: docs/_docs/2_3_querying_data.md ## @@ -145,8 +161,13 @@ Additionally,

[incubator-hudi] branch hudi_test_suite_refactor updated (3063cd7 -> 67fdda3)

2020-02-13 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository. vinoyang pushed a change to branch hudi_test_suite_refactor in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. discard 3063cd7 [MINOR] Fix compile error after rebasing the branch discard e064262

Build failed in Jenkins: hudi-snapshot-deployment-0.5 #188

2020-02-13 Thread Apache Jenkins Server
See Changes: -- [...truncated 2.29 KB...] /home/jenkins/tools/maven/apache-maven-3.5.4/boot: plexus-classworlds-2.5.2.jar

[GitHub] [incubator-hudi] lamber-ken opened a new pull request #1334: [HUDI-612] Fix return no data when using incremental query

2020-02-13 Thread GitBox
lamber-ken opened a new pull request #1334: [HUDI-612] Fix return no data when using incremental query URL: https://github.com/apache/incubator-hudi/pull/1334 ## What is the purpose of the pull request When using incremental query, if specify a specific time instant, will return no

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1150: [HUDI-288]: Add support for ingesting multiple kafka streams in a single DeltaStreamer deployment

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1150: [HUDI-288]: Add support for ingesting multiple kafka streams in a single DeltaStreamer deployment URL: https://github.com/apache/incubator-hudi/pull/1150#discussion_r379215764 ## File path:

[jira] [Updated] (HUDI-607) Hive sync fails to register tables partitioned by Date Type column

2020-02-13 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-607: Fix Version/s: 0.6.0 > Hive sync fails to register tables partitioned by Date Type column >

[GitHub] [incubator-hudi] vinothchandar commented on issue #1200: [HUDI-514] A schema provider to get metadata through Jdbc

2020-02-13 Thread GitBox
vinothchandar commented on issue #1200: [HUDI-514] A schema provider to get metadata through Jdbc URL: https://github.com/apache/incubator-hudi/pull/1200#issuecomment-586062515 @OpenOpened Thanks for working through this issue with us patiently! merged!

[incubator-hudi] branch master updated (175de0d -> dfbee67)

2020-02-13 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. from 175de0d [MINOR] Fix typo (#1331) add dfbee67 [HUDI-514] A schema provider to get metadata through

[GitHub] [incubator-hudi] satishkotha commented on issue #1312: [HUDI-571] Add "compactions show archived" command to CLI

2020-02-13 Thread GitBox
satishkotha commented on issue #1312: [HUDI-571] Add "compactions show archived" command to CLI URL: https://github.com/apache/incubator-hudi/pull/1312#issuecomment-586066884 @n3nash I removed the comment. please take a look.

[GitHub] [incubator-hudi] bwu2 edited a comment on issue #1328: Hudi upsert hangs

2020-02-13 Thread GitBox
bwu2 edited a comment on issue #1328: Hudi upsert hangs URL: https://github.com/apache/incubator-hudi/issues/1328#issuecomment-586071719 Ok, thanks for this. I have run the jobs again. First, insert 4m records, then upsert 3m of them, then upsert 4m, then upsert 4m. The two jobs

[jira] [Commented] (HUDI-407) Implement a join-based index

2020-02-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17036652#comment-17036652 ] sivabalan narayanan commented on HUDI-407: -- actually yes. Will put up something by this weekend. I

[GitHub] [incubator-hudi] vinothchandar commented on issue #1100: [HUDI-289] Implement a test suite to support long running test for Hudi writing and querying end-end

2020-02-13 Thread GitBox
vinothchandar commented on issue #1100: [HUDI-289] Implement a test suite to support long running test for Hudi writing and querying end-end URL: https://github.com/apache/incubator-hudi/pull/1100#issuecomment-586071634 >About real tests, do you mean how to validate all the functions of

[GitHub] [incubator-hudi] bwu2 commented on issue #1328: Hudi upsert hangs

2020-02-13 Thread GitBox
bwu2 commented on issue #1328: Hudi upsert hangs URL: https://github.com/apache/incubator-hudi/issues/1328#issuecomment-586071719 Ok, thanks for this. I have run the jobs again. First, insert 4m records, then upsert 3m of them, then upsert 4m, then upsert 4m. The two jobs upserting

[GitHub] [incubator-hudi] bwu2 edited a comment on issue #1328: Hudi upsert hangs

2020-02-13 Thread GitBox
bwu2 edited a comment on issue #1328: Hudi upsert hangs URL: https://github.com/apache/incubator-hudi/issues/1328#issuecomment-586071719 Ok, thanks for this. I have run the jobs again. First, insert 4m records, then upsert 3m of them, then upsert 4m, then upsert 4m. The two jobs

[jira] [Updated] (HUDI-612) Fix return no data when using incremental query

2020-02-13 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lamber-ken updated HUDI-612: Status: Open (was: New) > Fix return no data when using incremental query >

[jira] [Assigned] (HUDI-612) Fix return no data when using incremental query

2020-02-13 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lamber-ken reassigned HUDI-612: --- Assignee: lamber-ken > Fix return no data when using incremental query >

[GitHub] [incubator-hudi] n3nash commented on a change in pull request #1332: [HUDI -409] Match header and footer block length to improve corrupted block detection

2020-02-13 Thread GitBox
n3nash commented on a change in pull request #1332: [HUDI -409] Match header and footer block length to improve corrupted block detection URL: https://github.com/apache/incubator-hudi/pull/1332#discussion_r379212556 ## File path:

[GitHub] [incubator-hudi] satishkotha commented on a change in pull request #1312: [HUDI-571] Add "compactions show archived" command to CLI

2020-02-13 Thread GitBox
satishkotha commented on a change in pull request #1312: [HUDI-571] Add "compactions show archived" command to CLI URL: https://github.com/apache/incubator-hudi/pull/1312#discussion_r379220952 ## File path: hudi-cli/src/main/java/org/apache/hudi/cli/commands/CompactionCommand.java

[GitHub] [incubator-hudi] liujianhuiouc commented on issue #1216: [HUDI-525] lack of insert info in delta_commit inflight

2020-02-13 Thread GitBox
liujianhuiouc commented on issue #1216: [HUDI-525] lack of insert info in delta_commit inflight URL: https://github.com/apache/incubator-hudi/pull/1216#issuecomment-586070652 @n3nash I dont have any definate case , in that case, the fields related to updates already in the metadata, so i

[jira] [Comment Edited] (HUDI-407) Implement a join-based index

2020-02-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17036652#comment-17036652 ] sivabalan narayanan edited comment on HUDI-407 at 2/14/20 2:49 AM: ---

[GitHub] [incubator-hudi] bwu2 edited a comment on issue #1328: Hudi upsert hangs

2020-02-13 Thread GitBox
bwu2 edited a comment on issue #1328: Hudi upsert hangs URL: https://github.com/apache/incubator-hudi/issues/1328#issuecomment-586071719 Ok, thanks for this. I have run the jobs again. First, insert 4m records, then upsert 3m of them, then upsert 4m, then upsert 4m. The two jobs

[jira] [Created] (HUDI-612) Fix return no data when using incremental query

2020-02-13 Thread lamber-ken (Jira)
lamber-ken created HUDI-612: --- Summary: Fix return no data when using incremental query Key: HUDI-612 URL: https://issues.apache.org/jira/browse/HUDI-612 Project: Apache Hudi (incubating) Issue

[jira] [Updated] (HUDI-612) Fix return no data when using incremental query

2020-02-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-612: Labels: pull-request-available (was: ) > Fix return no data when using incremental query >

[GitHub] [incubator-hudi] bwu2 commented on issue #1328: Hudi upsert hangs

2020-02-13 Thread GitBox
bwu2 commented on issue #1328: Hudi upsert hangs URL: https://github.com/apache/incubator-hudi/issues/1328#issuecomment-586108720 Ok, after reading https://github.com/apache/incubator-hudi/issues/800 (which is simialr), I think this is a memory issue (even though GC seems to be a small

[GitHub] [incubator-hudi] bwu2 edited a comment on issue #1328: Hudi upsert hangs

2020-02-13 Thread GitBox
bwu2 edited a comment on issue #1328: Hudi upsert hangs URL: https://github.com/apache/incubator-hudi/issues/1328#issuecomment-586108720 Ok, after reading https://github.com/apache/incubator-hudi/issues/800 (which is similar), I think this is a memory issue (even though GC seems to be a

[GitHub] [incubator-hudi] vinothchandar commented on issue #1165: [HUDI-76] Add CSV Source support for Hudi Delta Streamer

2020-02-13 Thread GitBox
vinothchandar commented on issue #1165: [HUDI-76] Add CSV Source support for Hudi Delta Streamer URL: https://github.com/apache/incubator-hudi/pull/1165#issuecomment-586061084 cc @nsivabalan could you take over in case @pratyakshsharma is busy..

[GitHub] [incubator-hudi] vinothchandar commented on issue #1176: [WIP] [HUDI-430] Adding InlineFileSystem to support embedding any file format as an InlineFile

2020-02-13 Thread GitBox
vinothchandar commented on issue #1176: [WIP] [HUDI-430] Adding InlineFileSystem to support embedding any file format as an InlineFile URL: https://github.com/apache/incubator-hudi/pull/1176#issuecomment-586061205 @nsivabalan are you still blocked by the test failures for this one?

[GitHub] [incubator-hudi] vinothchandar merged pull request #1200: [HUDI-514] A schema provider to get metadata through Jdbc

2020-02-13 Thread GitBox
vinothchandar merged pull request #1200: [HUDI-514] A schema provider to get metadata through Jdbc URL: https://github.com/apache/incubator-hudi/pull/1200 This is an automated message from the Apache Git Service. To respond

[GitHub] [incubator-hudi] yanghua commented on issue #1100: [HUDI-289] Implement a test suite to support long running test for Hudi writing and querying end-end

2020-02-13 Thread GitBox
yanghua commented on issue #1100: [HUDI-289] Implement a test suite to support long running test for Hudi writing and querying end-end URL: https://github.com/apache/incubator-hudi/pull/1100#issuecomment-586062275 > > How have we validated the framework? are there real tests now? >

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1330: [HUDI-607] Fix to allow creation/syncing of Hive tables partitioned by Date type columns

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1330: [HUDI-607] Fix to allow creation/syncing of Hive tables partitioned by Date type columns URL: https://github.com/apache/incubator-hudi/pull/1330#discussion_r379219794 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #1330: [HUDI-607] Fix to allow creation/syncing of Hive tables partitioned by Date type columns

2020-02-13 Thread GitBox
vinothchandar commented on a change in pull request #1330: [HUDI-607] Fix to allow creation/syncing of Hive tables partitioned by Date type columns URL: https://github.com/apache/incubator-hudi/pull/1330#discussion_r379219969 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on issue #1308: [Hudi-561] partition path config

2020-02-13 Thread GitBox
vinothchandar commented on issue #1308: [Hudi-561] partition path config URL: https://github.com/apache/incubator-hudi/pull/1308#issuecomment-586067261 hi @UZi5136225 you can actually define your own key generator class to do this.. This PR should not necessary..

[incubator-hudi] branch hudi_test_suite_refactor updated (67fdda3 -> 6fbd285)

2020-02-13 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository. vinoyang pushed a change to branch hudi_test_suite_refactor in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git. discard 67fdda3 [MINOR] Fix compile error after rebasing the branch add 6fbd285 [MINOR]

[jira] [Commented] (HUDI-407) Implement a join-based index

2020-02-13 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17036719#comment-17036719 ] Vinoth Chandar commented on HUDI-407: - okay sounds good  > Implement a join-based index >

[GitHub] [incubator-hudi] lamber-ken closed pull request #1334: [HUDI-612] Fix return no data when using incremental query

2020-02-13 Thread GitBox
lamber-ken closed pull request #1334: [HUDI-612] Fix return no data when using incremental query URL: https://github.com/apache/incubator-hudi/pull/1334 This is an automated message from the Apache Git Service. To respond

[GitHub] [incubator-hudi] lamber-ken commented on issue #1334: [HUDI-612] Fix return no data when using incremental query

2020-02-13 Thread GitBox
lamber-ken commented on issue #1334: [HUDI-612] Fix return no data when using incremental query URL: https://github.com/apache/incubator-hudi/pull/1334#issuecomment-586136740 hi @vinothchandar, we met this problem in our staging env, we want to know the "incremental" data when specify a

[GitHub] [incubator-hudi] vinothchandar commented on issue #1334: [HUDI-612] Fix return no data when using incremental query

2020-02-13 Thread GitBox
vinothchandar commented on issue #1334: [HUDI-612] Fix return no data when using incremental query URL: https://github.com/apache/incubator-hudi/pull/1334#issuecomment-586133592 can we also close the JIRA? This is an

[jira] [Closed] (HUDI-612) Fix return no data when using incremental query

2020-02-13 Thread lamber-ken (Jira)
[ https://issues.apache.org/jira/browse/HUDI-612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lamber-ken closed HUDI-612. --- Resolution: Not A Bug Sorry, I understand the "incremental" in a wrong way. Btw, how does user query data

[GitHub] [incubator-hudi] lamber-ken commented on issue #1334: [HUDI-612] Fix return no data when using incremental query

2020-02-13 Thread GitBox
lamber-ken commented on issue #1334: [HUDI-612] Fix return no data when using incremental query URL: https://github.com/apache/incubator-hudi/pull/1334#issuecomment-586135378 Right, I understand the "incremental" in a wrong way. BTW, how does user query data when specify a specific

[jira] [Updated] (HUDI-581) NOTICE need more work as it missing content form included 3rd party ALv2 licensed NOTICE files

2020-02-13 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-581: Fix Version/s: 0.6.0 > NOTICE need more work as it missing content form included 3rd party ALv2 >