[jira] [Closed] (HUDI-1069) Remove duplicate assertNoWriteErrors()

2020-07-07 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vinoyang closed HUDI-1069. -- Resolution: Fixed Fixed via master branch: 7b2a947aed5649f8cbbade748e464e1228da6e5d > Remove duplicate

[jira] [Updated] (HUDI-1069) Remove duplicate assertNoWriteErrors()

2020-07-07 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vinoyang updated HUDI-1069: --- Status: Open (was: New) > Remove duplicate assertNoWriteErrors() > -- >

[GitHub] [hudi] yanghua merged pull request #1797: [HUDI-1069] Remove duplicate assertNoWriteErrors()

2020-07-07 Thread GitBox
yanghua merged pull request #1797: URL: https://github.com/apache/hudi/pull/1797 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[hudi] branch master updated: [HUDI-1069] Remove duplicate assertNoWriteErrors() (#1797)

2020-07-07 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository. vinoyang pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 7b2a947 [HUDI-1069] Remove duplicate

[GitHub] [hudi] yanghua commented on a change in pull request #1797: [HUDI-1069] Remove duplicate assertNoWriteErrors()

2020-07-07 Thread GitBox
yanghua commented on a change in pull request #1797: URL: https://github.com/apache/hudi/pull/1797#discussion_r451300568 ## File path: hudi-client/src/test/java/org/apache/hudi/testutils/Assertions.java ## @@ -0,0 +1,50 @@ +/* + * Licensed to the Apache Software Foundation

[jira] [Commented] (HUDI-480) Support a querying delete data methond in incremental view

2020-07-07 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17153241#comment-17153241 ] vinoyang commented on HUDI-480: --- Yes, you provide a differentiated implementation of this feature based on

[GitHub] [hudi] vinothchandar commented on issue #1546: Issue - Table Read fails in Spark Submit , Where as succeeds in spark-shell

2020-07-07 Thread GitBox
vinothchandar commented on issue #1546: URL: https://github.com/apache/hudi/issues/1546#issuecomment-655292713 Great! This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [hudi] garyli1019 commented on issue #1789: [SUPPORT] What jars are needed to run on AWS Glue 1.0 ?

2020-07-07 Thread GitBox
garyli1019 commented on issue #1789: URL: https://github.com/apache/hudi/issues/1789#issuecomment-655278012 cc: @umehrot2 AWS related This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [hudi] n3nash commented on pull request #1242: [HUDI-544] Archived commits command code cleanup

2020-07-07 Thread GitBox
n3nash commented on pull request #1242: URL: https://github.com/apache/hudi/pull/1242#issuecomment-655266701 @hddong Please create a JIRA ticket here -> https://issues.apache.org/jira/projects/HUDI/issues and add the tag of documentation/release notes update.

[GitHub] [hudi] n3nash commented on pull request #1289: [HUDI-92] Provide reasonable names for Spark DAG stages in Hudi.

2020-07-07 Thread GitBox
n3nash commented on pull request #1289: URL: https://github.com/apache/hudi/pull/1289#issuecomment-655266313 @prashantwason can you rebase and push please ? I can then merge this This is an automated message from the Apache

[GitHub] [hudi] n3nash commented on a change in pull request #1562: [HUDI-837]: implemented custom deserializer for AvroKafkaSource

2020-07-07 Thread GitBox
n3nash commented on a change in pull request #1562: URL: https://github.com/apache/hudi/pull/1562#discussion_r451266799 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/serde/HoodieAvroKafkaDeserializer.java ## @@ -0,0 +1,83 @@ +/* + * Licensed to

[GitHub] [hudi] n3nash commented on pull request #1706: [HUDI-998] Introduce a robot to build testing website automatically

2020-07-07 Thread GitBox
n3nash commented on pull request #1706: URL: https://github.com/apache/hudi/pull/1706#issuecomment-655265702 @lamber-ken any updates on this ? This is an automated message from the Apache Git Service. To respond to the

Build failed in Jenkins: hudi-snapshot-deployment-0.5 #332

2020-07-07 Thread Apache Jenkins Server
See Changes: -- [...truncated 2.32 KB...] /home/jenkins/tools/maven/apache-maven-3.5.4/conf: logging settings.xml toolchains.xml

[GitHub] [hudi] RajasekarSribalan commented on issue #1794: [SUPPORT] Hudi delete operation but HiveSync failed

2020-07-07 Thread GitBox
RajasekarSribalan commented on issue #1794: URL: https://github.com/apache/hudi/issues/1794#issuecomment-655261425 Thanks Vinoth. I ll try to fetch the commit file but as of now I have now disabled hive sync for delete operation and now I don't get this error at all? Do you have any

[GitHub] [hudi] xushiyan commented on a change in pull request #1797: [HUDI-1069] Remove duplicate assertNoWriteErrors()

2020-07-07 Thread GitBox
xushiyan commented on a change in pull request #1797: URL: https://github.com/apache/hudi/pull/1797#discussion_r451245140 ## File path: hudi-client/src/test/java/org/apache/hudi/testutils/Assertions.java ## @@ -0,0 +1,50 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] [hudi] xushiyan commented on a change in pull request #1797: [HUDI-1069] Remove duplicate assertNoWriteErrors()

2020-07-07 Thread GitBox
xushiyan commented on a change in pull request #1797: URL: https://github.com/apache/hudi/pull/1797#discussion_r451244748 ## File path: hudi-client/src/test/java/org/apache/hudi/testutils/Assertions.java ## @@ -0,0 +1,50 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] [hudi] masterlemmi commented on issue #1791: [SUPPORT] Does DeltaStreamer support listening to multiple kafka topics and upserting to multiple tables?

2020-07-07 Thread GitBox
masterlemmi commented on issue #1791: URL: https://github.com/apache/hudi/issues/1791#issuecomment-655226661 sure. will do. thanks @vinothchandar This is an automated message from the Apache Git Service. To respond to the

[GitHub] [hudi] vinothchandar commented on issue #1806: [SUPPORT] Deltastreamer can`t validate rewritten record that is valid

2020-07-07 Thread GitBox
vinothchandar commented on issue #1806: URL: https://github.com/apache/hudi/issues/1806#issuecomment-655216403 https://github.com/apache/hudi/blob/release-0.5.3/hudi-common/src/main/java/org/apache/hudi/common/util/HoodieAvroUtils.java#L229 @prashantwason any ideas? @sbernauer

[GitHub] [hudi] yanghua commented on pull request #1779: [HUDI-1062]Remove unnecessary maxEvent check and add some log in KafkaOffsetGen

2020-07-07 Thread GitBox
yanghua commented on pull request #1779: URL: https://github.com/apache/hudi/pull/1779#issuecomment-655214863 > Hi @yanghua ,the test case have been added. Cc @wangxianghu Hi @Trevor-zhang Thanks for addressing my comment. Now, LGTM, let's wait and see if @vinothchandar still has

[GitHub] [hudi] yanghua commented on a change in pull request #1797: [HUDI-1069] Remove duplicate assertNoWriteErrors()

2020-07-07 Thread GitBox
yanghua commented on a change in pull request #1797: URL: https://github.com/apache/hudi/pull/1797#discussion_r451219310 ## File path: hudi-client/src/test/java/org/apache/hudi/testutils/Assertions.java ## @@ -0,0 +1,50 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] [hudi] vinothchandar commented on issue #1794: [SUPPORT] Hudi delete operation but HiveSync failed

2020-07-07 Thread GitBox
vinothchandar commented on issue #1794: URL: https://github.com/apache/hudi/issues/1794#issuecomment-655211543 `20200705101913__commit__COMPLETED` indicates the commit was completed successfully actually.. I think it's saying the commit file does not have a data file, from which it can

[GitHub] [hudi] vinothchandar commented on pull request #1808: [HUDI-1078]Fix IllegalArgumentException in Delete data demo of Quick-Start Guide

2020-07-07 Thread GitBox
vinothchandar commented on pull request #1808: URL: https://github.com/apache/hudi/pull/1808#issuecomment-655210255 @nsivabalan can you please review this This is an automated message from the Apache Git Service. To respond

[GitHub] [hudi] vinothchandar commented on pull request #1807: [HUDI-875] Abstract hudi-sync-common ,and support hudi-sync-hive

2020-07-07 Thread GitBox
vinothchandar commented on pull request #1807: URL: https://github.com/apache/hudi/pull/1807#issuecomment-655209009 @lw309637554 can you please check CI failure? Before diving in, are there any backwards compatible changes/special upgrade instructions needed for users with this

[GitHub] [hudi] vinothchandar commented on pull request #1484: [HUDI-316] : Hbase qps repartition writestatus

2020-07-07 Thread GitBox
vinothchandar commented on pull request #1484: URL: https://github.com/apache/hudi/pull/1484#issuecomment-655208316 @v3nkatesh are you able to write the rate limiter class yourself? :) if not can @garyli1019 @xushiyan help?

[GitHub] [hudi] vinothchandar commented on issue #1791: [SUPPORT] Does DeltaStreamer support listening to multiple kafka topics and upserting to multiple tables?

2020-07-07 Thread GitBox
vinothchandar commented on issue #1791: URL: https://github.com/apache/hudi/issues/1791#issuecomment-65520 @masterlemmi while we wait for @pratyakshsharma , you can just build hudi off master and simply run this class

[GitHub] [hudi] vinothchandar closed issue #1776: [SUPPORT] org.eclipse.jetty.server.session.SessionHandler.setHttpOnly(Z)V

2020-07-07 Thread GitBox
vinothchandar closed issue #1776: URL: https://github.com/apache/hudi/issues/1776 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] vinothchandar commented on issue #1776: [SUPPORT] org.eclipse.jetty.server.session.SessionHandler.setHttpOnly(Z)V

2020-07-07 Thread GitBox
vinothchandar commented on issue #1776: URL: https://github.com/apache/hudi/issues/1776#issuecomment-655206781 I meant HUDI-259 .. where we collect issues like this related to hadoop 3 This is an automated message from the

[jira] [Updated] (HUDI-259) Hadoop 3 support for Hudi writing

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-259: Description: Sample issues   [https://github.com/apache/incubator-hudi/issues/735]

[jira] [Updated] (HUDI-259) Hadoop 3 support for Hudi writing

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-259: Fix Version/s: 0.6.1 > Hadoop 3 support for Hudi writing > - > >

[GitHub] [hudi] vinothchandar commented on issue #1694: Slow Write into Hudi Dataset(MOR)

2020-07-07 Thread GitBox
vinothchandar commented on issue #1694: URL: https://github.com/apache/hudi/issues/1694#issuecomment-655205689 Happy to work more hands-on and get this working for you. lmk This is an automated message from the Apache Git

[GitHub] [hudi] vinothchandar commented on issue #1586: [SUPPORT] DMS with 2 key example

2020-07-07 Thread GitBox
vinothchandar commented on issue #1586: URL: https://github.com/apache/hudi/issues/1586#issuecomment-655205874 cc @pratyakshsharma any ideas? (since you are actively looking at this code) This is an automated message from

[GitHub] [hudi] vinothchandar commented on issue #1694: Slow Write into Hudi Dataset(MOR)

2020-07-07 Thread GitBox
vinothchandar commented on issue #1694: URL: https://github.com/apache/hudi/issues/1694#issuecomment-655205591 #1752 is the PR. What I am seeing is that the range based pruning is not very effective.. and is resulting in lots of shuffled data.. is there a way to not use

[GitHub] [hudi] vinothchandar commented on issue #998: Incremental view not implemented yet, for merge-on-read datasets

2020-07-07 Thread GitBox
vinothchandar commented on issue #998: URL: https://github.com/apache/hudi/issues/998#issuecomment-655200587 @WilliamWhispell Work for this is tracked in HUDI-651, HUDI-920, all tracked for 0.6.0, end of the month. @bhasudha has the changes, to make this work via SparkSQL/Hive.

[GitHub] [hudi] vinothchandar commented on issue #1745: Deltastreamer -Global bloom Index resulting Duplicates across partitions for Same record Key

2020-07-07 Thread GitBox
vinothchandar commented on issue #1745: URL: https://github.com/apache/hudi/issues/1745#issuecomment-655200192 now that we have the PR up.. closing this. This is an automated message from the Apache Git Service. To respond

[GitHub] [hudi] vinothchandar closed issue #1745: Deltastreamer -Global bloom Index resulting Duplicates across partitions for Same record Key

2020-07-07 Thread GitBox
vinothchandar closed issue #1745: URL: https://github.com/apache/hudi/issues/1745 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[jira] [Closed] (HUDI-58) Implement Spark incremental pull with merge-on-read #41

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-58?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar closed HUDI-58. -- Resolution: Duplicate Dup of HUDI-651, HUDI-920 > Implement Spark incremental pull with merge-on-read

[GitHub] [hudi] vinothchandar commented on issue #1780: [SUPPORT]IllegalStateException: Hudi File Id has more than 1 pending compactions. MoR. Compaction inline.

2020-07-07 Thread GitBox
vinothchandar commented on issue #1780: URL: https://github.com/apache/hudi/issues/1780#issuecomment-655194540 @a-uddhav are you facing the same issue as well? @zuyanton by any chance you upgraded the writers before dropping new jars for queries? I think the issue is that

[jira] [Commented] (HUDI-480) Support a querying delete data methond in incremental view

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17153133#comment-17153133 ] Vinoth Chandar commented on HUDI-480: - even just storing the row_key, if there are millions of deletes

[GitHub] [hudi] vinothchandar commented on issue #1787: Exception During Insert

2020-07-07 Thread GitBox
vinothchandar commented on issue #1787: URL: https://github.com/apache/hudi/issues/1787#issuecomment-655191206 it's okay to disable. But trying to understand why you got that error consistently.. was the port blocked on the driver? do you have the entire executor/driver logs..

[GitHub] [hudi] vinothchandar commented on issue #1738: [SUPPORT] java.io.FileNotFoundException: No such file or directory: .hoodie/archived

2020-07-07 Thread GitBox
vinothchandar commented on issue #1738: URL: https://github.com/apache/hudi/issues/1738#issuecomment-655184220 Based on slack discussion.. this is also resolved now? if not, @tooptoop4 do you have a small snippet I can use to repro this locally? we can make the code create the

[jira] [Updated] (HUDI-558) Introduce ability to compress bloom filters while storing in parquet

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-558: Labels: pull-request-available (was: help-wanted pull-request-available) > Introduce ability to

[jira] [Updated] (HUDI-284) Need Tests for Hudi handling of schema evolution

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-284: Labels: help-wanted starter (was: help-requested starter) > Need Tests for Hudi handling of schema

[jira] [Updated] (HUDI-677) Abstract/Refactor all transaction management logic into a set of classes

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-677: Labels: help-requested (was: ) > Abstract/Refactor all transaction management logic into a set of

[jira] [Updated] (HUDI-677) Abstract/Refactor all transaction management logic into a set of classes

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-677: Labels: (was: help-requested help-wanted) > Abstract/Refactor all transaction management logic

[jira] [Updated] (HUDI-86) Add indexing support to the log file format

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-86?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-86: --- Labels: (was: help-requested) > Add indexing support to the log file format >

[jira] [Commented] (HUDI-984) Support Hive 1.x out of box

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17153122#comment-17153122 ] Vinoth Chandar commented on HUDI-984: - IMO.. given Hive 3 is where we need to head towards.. this is

[jira] [Updated] (HUDI-984) Support Hive 1.x out of box

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-984: Labels: (was: help-requested starter) > Support Hive 1.x out of box > ---

[GitHub] [hudi] vinothchandar closed issue #1730: [SUPPORT] unhelpful error message when there are parquets outside table base path

2020-07-07 Thread GitBox
vinothchandar closed issue #1730: URL: https://github.com/apache/hudi/issues/1730 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] vinothchandar commented on issue #1730: [SUPPORT] unhelpful error message when there are parquets outside table base path

2020-07-07 Thread GitBox
vinothchandar commented on issue #1730: URL: https://github.com/apache/hudi/issues/1730#issuecomment-655176203 yes.. makes sense.. closing this issue This is an automated message from the Apache Git Service. To respond to

[GitHub] [hudi] vinothchandar commented on pull request #1756: [HUDI-839] Adding unit test for MarkerFiles,RollbackUtils, RollbackActionExecutor for markers and filelisting

2020-07-07 Thread GitBox
vinothchandar commented on pull request #1756: URL: https://github.com/apache/hudi/pull/1756#issuecomment-655173307 Thanks! on it! This is an automated message from the Apache Git Service. To respond to the message, please

[jira] [Commented] (HUDI-47) Revisit null checks in the Log Blocks, merge lazyreading with this null check #340

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-47?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17153114#comment-17153114 ] Vinoth Chandar commented on HUDI-47: general speaking, this task involves making this piece of code, lot

[jira] [Updated] (HUDI-45) Refactor handleWrite() in HoodieMergeHandle to offload conversion and merging of records to reader #374

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-45?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-45: --- Labels: (was: help-requested starter) > Refactor handleWrite() in HoodieMergeHandle to offload

[jira] [Commented] (HUDI-45) Refactor handleWrite() in HoodieMergeHandle to offload conversion and merging of records to reader #374

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-45?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17153113#comment-17153113 ] Vinoth Chandar commented on HUDI-45: This probably needs to be fleshed out in more detail > Refactor

[jira] [Closed] (HUDI-102) Beeline/Hive Client - select * on real-time views fails with schema related errors for tables with deep-nested schema #439

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar closed HUDI-102. --- Resolution: Cannot Reproduce This is very old.. lot of fixes have gone in after this. without ways to

[jira] [Updated] (HUDI-102) Beeline/Hive Client - select * on real-time views fails with schema related errors for tables with deep-nested schema #439

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-102: Status: In Progress (was: Open) > Beeline/Hive Client - select * on real-time views fails with

[jira] [Closed] (HUDI-39) Scope out how hudi can be integrated underneath Gobblin.. #407

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-39?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar closed HUDI-39. -- Resolution: Won't Fix this needs lot more work on Gobblin side.. We can close for now. > Scope out how

[jira] [Updated] (HUDI-39) Scope out how hudi can be integrated underneath Gobblin.. #407

2020-07-07 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-39?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-39: --- Labels: (was: help-wanted) > Scope out how hudi can be integrated underneath Gobblin.. #407 >

[jira] [Updated] (HUDI-1080) Fix backward compatiblity for com.uber input formats

2020-07-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1080: - Labels: pull-request-available (was: ) > Fix backward compatiblity for com.uber input formats >

[GitHub] [hudi] satishkotha opened a new pull request #1809: [HUDI-1080] Fix backward compatibility for com.uber inputformats

2020-07-07 Thread GitBox
satishkotha opened a new pull request #1809: URL: https://github.com/apache/hudi/pull/1809 ## What is the purpose of the pull request 1) InputFormat backward compatibility is broken in several places because equality check is done including package name in multiple places

[jira] [Created] (HUDI-1080) Fix backward compatiblity for com.uber input formats

2020-07-07 Thread satish (Jira)
satish created HUDI-1080: Summary: Fix backward compatiblity for com.uber input formats Key: HUDI-1080 URL: https://issues.apache.org/jira/browse/HUDI-1080 Project: Apache Hudi Issue Type: Bug

[GitHub] [hudi] tooptoop4 commented on issue #1802: [SUPPORT] Delete gives Caused by: org.apache.parquet.io.ParquetDecodingException: Can not read value at 0 in block -1 in file

2020-07-07 Thread GitBox
tooptoop4 commented on issue #1802: URL: https://github.com/apache/hudi/issues/1802#issuecomment-655052438 when col is a long type, spark 2.3 would return long from expr(concat(col)) but spark 2.4 would return string from expr(concat(col)). Fix was changing to expr(col) in spark 2.4 - to

[GitHub] [hudi] garyli1019 commented on issue #1786: [SUPPORT] Bulk insert slow on MOR

2020-07-07 Thread GitBox
garyli1019 commented on issue #1786: URL: https://github.com/apache/hudi/issues/1786#issuecomment-655021966 @rvd8345 Ok, 100 wouldn't be too much different from 64. During the Stage 5 `count xxx`, Hudi is actually writing the file into the filesystem. Even we reduce the parallelism number

[GitHub] [hudi] vinothchandar commented on issue #1802: [SUPPORT] Delete gives Caused by: org.apache.parquet.io.ParquetDecodingException: Can not read value at 0 in block -1 in file

2020-07-07 Thread GitBox
vinothchandar commented on issue #1802: URL: https://github.com/apache/hudi/issues/1802#issuecomment-654969872 ack! was this a parquet version issue ? This is an automated message from the Apache Git Service. To respond to

[GitHub] [hudi] Trevor-zhang edited a comment on pull request #1808: [HUDI-1078]Fix IllegalArgumentException in Delete data demo of Quick-Start Guide

2020-07-07 Thread GitBox
Trevor-zhang edited a comment on pull request #1808: URL: https://github.com/apache/hudi/pull/1808#issuecomment-654967198 detailed discuss is here:[HUDI-1078](https://issues.apache.org/jira/browse/HUDI-1078) This is an

[GitHub] [hudi] Trevor-zhang commented on pull request #1808: [HUDI-1078]Fix IllegalArgumentException in Delete data demo of Quick-Start Guide

2020-07-07 Thread GitBox
Trevor-zhang commented on pull request #1808: URL: https://github.com/apache/hudi/pull/1808#issuecomment-654967198 detailed discuss is here:[HUDI-1078](url) This is an automated message from the Apache Git Service. To

[GitHub] [hudi] Trevor-zhang opened a new pull request #1808: [HUDI-1078]Fix IllegalArgumentException in Delete data demo of Quick-Start Guide

2020-07-07 Thread GitBox
Trevor-zhang opened a new pull request #1808: URL: https://github.com/apache/hudi/pull/1808 …Start Guide ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What

[jira] [Updated] (HUDI-1078) Fix IllegalArgumentException in Delete data demo of Quick-Start Guide

2020-07-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-1078: - Labels: pull-request-available (was: ) > Fix IllegalArgumentException in Delete data demo of

[GitHub] [hudi] lw309637554 commented on pull request #1756: [HUDI-839] Adding unit test for MarkerFiles,RollbackUtils, RollbackActionExecutor for markers and filelisting

2020-07-07 Thread GitBox
lw309637554 commented on pull request #1756: URL: https://github.com/apache/hudi/pull/1756#issuecomment-654941314 @vinothchandar I have fix all the issue as you comment. More comprehensive solution about delete markerfile will depend on you

[GitHub] [hudi] lw309637554 commented on a change in pull request #1756: [HUDI-839] Adding unit test for MarkerFiles,RollbackUtils, RollbackActionExecutor for markers and filelisting

2020-07-07 Thread GitBox
lw309637554 commented on a change in pull request #1756: URL: https://github.com/apache/hudi/pull/1756#discussion_r449948557 ## File path: hudi-client/src/test/java/org/apache/hudi/table/action/rollback/TestCopyOnWriteRollbackActionExecutor.java ## @@ -0,0 +1,247 @@ +/* + *

[GitHub] [hudi] lw309637554 commented on pull request #1807: [HUDI-875] Abstract hudi-sync-common ,and support hudi-sync-hive

2020-07-07 Thread GitBox
lw309637554 commented on pull request #1807: URL: https://github.com/apache/hudi/pull/1807#issuecomment-654919654 @vinothchandar hello ,as discussion in https://github.com/apache/hudi/pull/1593. I submit a PR with the hudi-sync-common and reconstruct the old hudi-hive-sync to

[GitHub] [hudi] lw309637554 opened a new pull request #1807: [HUDI-875] Abstract hudi-sync-common ,and support hudi-sync-hive

2020-07-07 Thread GitBox
lw309637554 opened a new pull request #1807: URL: https://github.com/apache/hudi/pull/1807 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of

[GitHub] [hudi] sbernauer opened a new issue #1806: [SUPPORT] Deltastreamer can`t validate rewritten record that is valid

2020-07-07 Thread GitBox
sbernauer opened a new issue #1806: URL: https://github.com/apache/hudi/issues/1806 Hello together, we try to ingest Avro-Events from Kafka to an MOR table on S3. The Deltasteamer cannot validate our events, but they seem totally valid to me. This is the error message: ```

[jira] [Updated] (HUDI-1079) Cannot upsert on schema with Array of Record with single field

2020-07-07 Thread Adrian Tanase (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adrian Tanase updated HUDI-1079: Description: I am trying to trigger upserts on a table that has an array field with records of

[jira] [Comment Edited] (HUDI-1079) Cannot upsert on schema with Array of Record with single field

2020-07-07 Thread Adrian Tanase (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17152766#comment-17152766 ] Adrian Tanase edited comment on HUDI-1079 at 7/7/20, 2:18 PM: -- Quick update,

[jira] [Comment Edited] (HUDI-1079) Cannot upsert on schema with Array of Record with single field

2020-07-07 Thread Adrian Tanase (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17152766#comment-17152766 ] Adrian Tanase edited comment on HUDI-1079 at 7/7/20, 2:17 PM: -- Quick update,

[GitHub] [hudi] bvaradar commented on a change in pull request #1678: [HUDI-242] Metadata Bootstrap changes

2020-07-07 Thread GitBox
bvaradar commented on a change in pull request #1678: URL: https://github.com/apache/hudi/pull/1678#discussion_r444210548 ## File path: hudi-cli/pom.xml ## @@ -147,6 +147,43 @@

[jira] [Comment Edited] (HUDI-1079) Cannot upsert on schema with Array of Record with single field

2020-07-07 Thread Adrian Tanase (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17152766#comment-17152766 ] Adrian Tanase edited comment on HUDI-1079 at 7/7/20, 2:07 PM: -- Quick update,

[jira] [Commented] (HUDI-1079) Cannot upsert on schema with Array of Record with single field

2020-07-07 Thread Adrian Tanase (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17152766#comment-17152766 ] Adrian Tanase commented on HUDI-1079: - Quick update, I thought it's related to the nullability of the

[jira] [Updated] (HUDI-1079) Cannot upsert on schema with Array of Record with single field

2020-07-07 Thread Adrian Tanase (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adrian Tanase updated HUDI-1079: Description: I am trying to trigger upserts on a table that has an array field with records of

[GitHub] [hudi] aditanase commented on pull request #1406: [HUDI-713] Fix conversion of Spark array of struct type to Avro schema

2020-07-07 Thread GitBox
aditanase commented on pull request #1406: URL: https://github.com/apache/hudi/pull/1406#issuecomment-654874566 @vinothchandar @umehrot2 - I just added a flavor of this issue, for when the inner record of the array has only 1 field: https://issues.apache.org/jira/browse/HUDI-1079

[jira] [Commented] (HUDI-1079) Cannot upsert on schema with Array of Record with single field

2020-07-07 Thread Adrian Tanase (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17152761#comment-17152761 ] Adrian Tanase commented on HUDI-1079: - [~vinothchandar], [~uditme] - would you mind helping triage

[GitHub] [hudi] tooptoop4 edited a comment on issue #1730: [SUPPORT] unhelpful error message when there are parquets outside table base path

2020-07-07 Thread GitBox
tooptoop4 edited a comment on issue #1730: URL: https://github.com/apache/hudi/issues/1730#issuecomment-654797100 prestosql 336 with hudi 0.5.3 gives better error: ``` io.prestosql.spi.PrestoException: Index 2 out of bounds for length 1 at

[jira] [Created] (HUDI-1079) Cannot upsert on schema with Array of Record with single field

2020-07-07 Thread Adrian Tanase (Jira)
Adrian Tanase created HUDI-1079: --- Summary: Cannot upsert on schema with Array of Record with single field Key: HUDI-1079 URL: https://issues.apache.org/jira/browse/HUDI-1079 Project: Apache Hudi

[GitHub] [hudi] tooptoop4 edited a comment on issue #1730: [SUPPORT] unhelpful error message when there are parquets outside table base path

2020-07-07 Thread GitBox
tooptoop4 edited a comment on issue #1730: URL: https://github.com/apache/hudi/issues/1730#issuecomment-654797100 prestosql 336 with hudi 0.5.3 gives better error: ``` io.prestosql.spi.PrestoException: Index 2 out of bounds for length 1 at

[GitHub] [hudi] tooptoop4 edited a comment on issue #1730: [SUPPORT] unhelpful error message when there are parquets outside table base path

2020-07-07 Thread GitBox
tooptoop4 edited a comment on issue #1730: URL: https://github.com/apache/hudi/issues/1730#issuecomment-654797100 prestosql 336 with hudi 0.5.3 gives better error: ``` io.prestosql.spi.PrestoException: Index 2 out of bounds for length 1 at

[GitHub] [hudi] tooptoop4 edited a comment on issue #1730: [SUPPORT] unhelpful error message when there are parquets outside table base path

2020-07-07 Thread GitBox
tooptoop4 edited a comment on issue #1730: URL: https://github.com/apache/hudi/issues/1730#issuecomment-654797100 prestosql 336 with hudi 0.5.3 gives better error: ``` io.prestosql.spi.PrestoException: Index 2 out of bounds for length 1 at

[GitHub] [hudi] tooptoop4 edited a comment on issue #1730: [SUPPORT] unhelpful error message when there are parquets outside table base path

2020-07-07 Thread GitBox
tooptoop4 edited a comment on issue #1730: URL: https://github.com/apache/hudi/issues/1730#issuecomment-654797100 prestosql 336 with hudi 0.5.3 gives better error: ``` io.prestosql.spi.PrestoException: Index 2 out of bounds for length 1 at

[GitHub] [hudi] rvd8345 commented on issue #1786: [SUPPORT] Bulk insert slow on MOR

2020-07-07 Thread GitBox
rvd8345 commented on issue #1786: URL: https://github.com/apache/hudi/issues/1786#issuecomment-654842089 @garyli1019 I was talking about hoodie.bulkinsert.shuffle.parallelism only. Have tried with default 1500, 500 and 64 without much difference in runtime. I can try 100 also but not sure

[jira] [Updated] (HUDI-1078) Fix IllegalArgumentException in Delete data demo of Quick-Start Guide

2020-07-07 Thread wangxianghu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangxianghu updated HUDI-1078: -- Description: When running the [Delete data |#deletes]demo in Quick-Start Guide, I got this Exception:

[jira] [Updated] (HUDI-1078) Fix IllegalArgumentException in Delete data demo of Quick-Start Guide

2020-07-07 Thread wangxianghu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangxianghu updated HUDI-1078: -- Description: When running the [Delete data|#deletes]] demo in Quick-Start Guide, I got this Exception:

[jira] [Updated] (HUDI-1078) Fix IllegalArgumentException in Delete data demo of Quick-Start Guide

2020-07-07 Thread wangxianghu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangxianghu updated HUDI-1078: -- Description: When running the [Delete

[GitHub] [hudi] nsivabalan commented on a change in pull request #1793: [HUDI-1068] Fixing deletes in global bloom

2020-07-07 Thread GitBox
nsivabalan commented on a change in pull request #1793: URL: https://github.com/apache/hudi/pull/1793#discussion_r450812695 ## File path: hudi-client/src/test/java/org/apache/hudi/client/TestHoodieClientOnCopyOnWriteStorage.java ## @@ -399,56 +405,173 @@ public void

[jira] [Updated] (HUDI-1078) Fix IllegalArgumentException in Delete data demo of Quick-Start Guide

2020-07-07 Thread wangxianghu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangxianghu updated HUDI-1078: -- Description: When running the Delete data demo in  Step to reproduce:   > Fix

[GitHub] [hudi] nsivabalan commented on a change in pull request #1793: [HUDI-1068] Fixing deletes in global bloom

2020-07-07 Thread GitBox
nsivabalan commented on a change in pull request #1793: URL: https://github.com/apache/hudi/pull/1793#discussion_r450811380 ## File path: hudi-client/src/test/java/org/apache/hudi/client/TestHoodieClientOnCopyOnWriteStorage.java ## @@ -399,56 +405,173 @@ public void

[jira] [Assigned] (HUDI-1078) Fix IllegalArgumentException in Delete data demo of Quick-Start Guide

2020-07-07 Thread wangxianghu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangxianghu reassigned HUDI-1078: - Assignee: Trevorzhang (was: wangxianghu) > Fix IllegalArgumentException in Delete data demo of

[jira] [Assigned] (HUDI-1078) Fix IllegalArgumentException in Delete data demo of Quick-Start Guide

2020-07-07 Thread wangxianghu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangxianghu reassigned HUDI-1078: - Assignee: wangxianghu > Fix IllegalArgumentException in Delete data demo of Quick-Start Guide >

[jira] [Created] (HUDI-1078) Fix IllegalArgumentException in Delete data demo of Quick-Start Guide

2020-07-07 Thread wangxianghu (Jira)
wangxianghu created HUDI-1078: - Summary: Fix IllegalArgumentException in Delete data demo of Quick-Start Guide Key: HUDI-1078 URL: https://issues.apache.org/jira/browse/HUDI-1078 Project: Apache Hudi

[GitHub] [hudi] nsivabalan commented on a change in pull request #1793: [HUDI-1068] Fixing deletes in global bloom

2020-07-07 Thread GitBox
nsivabalan commented on a change in pull request #1793: URL: https://github.com/apache/hudi/pull/1793#discussion_r450809602 ## File path: hudi-client/src/test/java/org/apache/hudi/client/TestHoodieClientOnCopyOnWriteStorage.java ## @@ -399,56 +405,173 @@ public void

[GitHub] [hudi] tooptoop4 commented on issue #1730: [SUPPORT] unhelpful error message when there are parquets outside table base path

2020-07-07 Thread GitBox
tooptoop4 commented on issue #1730: URL: https://github.com/apache/hudi/issues/1730#issuecomment-654797100 prestosql 336 with hudi 0.5.3 gives better error: ``` io.prestosql.spi.PrestoException: Index 2 out of bounds for length 1 at

[GitHub] [hudi] leesf merged pull request #1805: [HUDI-1064]Trim hoodie table name

2020-07-07 Thread GitBox
leesf merged pull request #1805: URL: https://github.com/apache/hudi/pull/1805 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

  1   2   >