[GitHub] [incubator-hudi] NetsanetGeb commented on issue #714: Performance Comparison of HoodieDeltaStreamer and DataSourceAPI

2019-06-05 Thread GitBox
NetsanetGeb commented on issue #714: Performance Comparison of HoodieDeltaStreamer and DataSourceAPI URL: https://github.com/apache/incubator-hudi/issues/714#issuecomment-498998794 Yes They have the same amount of data at the beginning as a source input. But in the middle there are some d

[GitHub] [incubator-hudi] amaranathv opened a new issue #716: Class not found error in mapr platform

2019-06-05 Thread GitBox
amaranathv opened a new issue #716: Class not found error in mapr platform URL: https://github.com/apache/incubator-hudi/issues/716 I am getting below error while trying to do the quick start in our cluster environment. I can able to do it successfully in mapr LIVE environment but no

[GitHub] [incubator-hudi] garyli1019 commented on issue #715: [Bug]Hudi 0.4.7 HoodieTable not found

2019-06-05 Thread GitBox
garyli1019 commented on issue #715: [Bug]Hudi 0.4.7 HoodieTable not found URL: https://github.com/apache/incubator-hudi/issues/715#issuecomment-499195989 Hi @bvaradar yes, it didn't cause any duplicate. In my case, I would like to directly read parquet without hoodie format, but the di

[GitHub] [incubator-hudi] garyli1019 closed issue #715: [Bug]Hudi 0.4.7 HoodieTable not found

2019-06-05 Thread GitBox
garyli1019 closed issue #715: [Bug]Hudi 0.4.7 HoodieTable not found URL: https://github.com/apache/incubator-hudi/issues/715 This is an automated message from the Apache Git Service. To respond to the message, please log on t

[GitHub] [incubator-hudi] vinothchandar commented on issue #714: Performance Comparison of HoodieDeltaStreamer and DataSourceAPI

2019-06-05 Thread GitBox
vinothchandar commented on issue #714: Performance Comparison of HoodieDeltaStreamer and DataSourceAPI URL: https://github.com/apache/incubator-hudi/issues/714#issuecomment-499259238 My guess is , your parallelism 1500 is excessive for such low volume of data and the additional spark overh

[GitHub] [incubator-hudi] garyli1019 edited a comment on issue #715: [Bug]Hudi 0.4.7 HoodieTable not found

2019-06-05 Thread GitBox
garyli1019 edited a comment on issue #715: [Bug]Hudi 0.4.7 HoodieTable not found URL: https://github.com/apache/incubator-hudi/issues/715#issuecomment-499195989 Hi @bvaradar yes, it didn't cause any duplicate. In my case, I would like to directly read parquet without hoodie format, but

[GitHub] [incubator-hudi] bvaradar commented on issue #704: Shutdown hook was not called using spark on Kubernetes or spark on yarn

2019-06-05 Thread GitBox
bvaradar commented on issue #704: Shutdown hook was not called using spark on Kubernetes or spark on yarn URL: https://github.com/apache/incubator-hudi/issues/704#issuecomment-499288870 @eisig : Any updates on this issue ? ---

[GitHub] [incubator-hudi] bvaradar commented on issue #673: Hudi-96: Command line options instead of positional arguments

2019-06-05 Thread GitBox
bvaradar commented on issue #673: Hudi-96: Command line options instead of positional arguments URL: https://github.com/apache/incubator-hudi/pull/673#issuecomment-499289822 @abhioncbr : This PR is almost ready. Let us know if you need any help. Thank you Balaji.V -

[GitHub] [incubator-hudi] bvaradar commented on issue #716: Class not found error in mapr platform

2019-06-05 Thread GitBox
bvaradar commented on issue #716: Class not found error in mapr platform URL: https://github.com/apache/incubator-hudi/issues/716#issuecomment-499291270 @amaranathv It looks like you have compiled Hudi with Hive 1.x but the docker setup has Hive 2.x . HiveMetaStoreClient.alter_table functi

[GitHub] [incubator-hudi] bvaradar commented on issue #677: presto hoodie bundle giving error

2019-06-05 Thread GitBox
bvaradar commented on issue #677: presto hoodie bundle giving error URL: https://github.com/apache/incubator-hudi/issues/677#issuecomment-499296195 Closing this due to inactivity. Please reopen if issue persists. This is an au

[GitHub] [incubator-hudi] bvaradar closed issue #677: presto hoodie bundle giving error

2019-06-05 Thread GitBox
bvaradar closed issue #677: presto hoodie bundle giving error URL: https://github.com/apache/incubator-hudi/issues/677 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitH

[GitHub] [incubator-hudi] bvaradar commented on issue #681: spark-bundle packaging shades HiveDriver

2019-06-05 Thread GitBox
bvaradar commented on issue #681: spark-bundle packaging shades HiveDriver URL: https://github.com/apache/incubator-hudi/issues/681#issuecomment-499296335 @arw357 : Were you able to resolve the problem ? This is an automated

[GitHub] [incubator-hudi] bvaradar closed issue #663: Cannot query real time table

2019-06-05 Thread GitBox
bvaradar closed issue #663: Cannot query real time table URL: https://github.com/apache/incubator-hudi/issues/663 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub an

[GitHub] [incubator-hudi] bvaradar commented on issue #663: Cannot query real time table

2019-06-05 Thread GitBox
bvaradar commented on issue #663: Cannot query real time table URL: https://github.com/apache/incubator-hudi/issues/663#issuecomment-499296421 Closing this due to inactivity. Please reopen if issue persists. This is an automat

[GitHub] [incubator-hudi] bvaradar closed issue #498: Is there any record delete examples?

2019-06-05 Thread GitBox
bvaradar closed issue #498: Is there any record delete examples? URL: https://github.com/apache/incubator-hudi/issues/498 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [incubator-hudi] bvaradar commented on issue #498: Is there any record delete examples?

2019-06-05 Thread GitBox
bvaradar commented on issue #498: Is there any record delete examples? URL: https://github.com/apache/incubator-hudi/issues/498#issuecomment-499296604 Resolved. This is an automated message from the Apache Git Service. To res

[GitHub] [incubator-hudi] bvaradar closed issue #588: Has anyone used hudi with AWS EMR and EMRFS on s3?

2019-06-05 Thread GitBox
bvaradar closed issue #588: Has anyone used hudi with AWS EMR and EMRFS on s3? URL: https://github.com/apache/incubator-hudi/issues/588 This is an automated message from the Apache Git Service. To respond to the message, plea

[GitHub] [incubator-hudi] bvaradar commented on issue #588: Has anyone used hudi with AWS EMR and EMRFS on s3?

2019-06-05 Thread GitBox
bvaradar commented on issue #588: Has anyone used hudi with AWS EMR and EMRFS on s3? URL: https://github.com/apache/incubator-hudi/issues/588#issuecomment-499296932 Closing this due to inactivity. Please reopen if issue persists.

[GitHub] [incubator-hudi] abhioncbr commented on issue #673: Hudi-96: Command line options instead of positional arguments

2019-06-05 Thread GitBox
abhioncbr commented on issue #673: Hudi-96: Command line options instead of positional arguments URL: https://github.com/apache/incubator-hudi/pull/673#issuecomment-499308469 @bvaradar I will try to get it done by the weekend. Also, I checked with the stylesheet issue, and it looks like so

[GitHub] [incubator-hudi] bvaradar opened a new pull request #717: LogFile comparator must handle log file names without write token for backwards compatibility

2019-06-05 Thread GitBox
bvaradar opened a new pull request #717: LogFile comparator must handle log file names without write token for backwards compatibility URL: https://github.com/apache/incubator-hudi/pull/717 Found this during testing in staging. --

[GitHub] [incubator-hudi] bvaradar commented on issue #717: LogFile comparator must handle log file names without write token for backwards compatibility

2019-06-05 Thread GitBox
bvaradar commented on issue #717: LogFile comparator must handle log file names without write token for backwards compatibility URL: https://github.com/apache/incubator-hudi/pull/717#issuecomment-499313164 @n3nash @vinothchandar : Please review when you get a chance. Found this issue when

[GitHub] [incubator-hudi] bvaradar commented on issue #715: [Bug]Hudi 0.4.7 HoodieTable not found

2019-06-05 Thread GitBox
bvaradar commented on issue #715: [Bug]Hudi 0.4.7 HoodieTable not found URL: https://github.com/apache/incubator-hudi/issues/715#issuecomment-499314321 @garyli1019 : The Cleaner policy sounds good. But, this would only work when you are NOT querying the data during the time write is happeni

[GitHub] [incubator-hudi] bvaradar opened a new pull request #718: Auto generated Slack Channel Notifications setup for Travis CI

2019-06-05 Thread GitBox
bvaradar opened a new pull request #718: Auto generated Slack Channel Notifications setup for Travis CI URL: https://github.com/apache/incubator-hudi/pull/718 This is an automated message from the Apache Git Service. To resp

[GitHub] [incubator-hudi] garyli1019 commented on issue #715: [Bug]Hudi 0.4.7 HoodieTable not found

2019-06-05 Thread GitBox
garyli1019 commented on issue #715: [Bug]Hudi 0.4.7 HoodieTable not found URL: https://github.com/apache/incubator-hudi/issues/715#issuecomment-499345235 @bvaradar What will be the consequence of that? Based on my understanding, hudi will generate a new version first, then the cleaner will

[GitHub] [incubator-hudi] bvaradar commented on issue #715: [Bug]Hudi 0.4.7 HoodieTable not found

2019-06-05 Thread GitBox
bvaradar commented on issue #715: [Bug]Hudi 0.4.7 HoodieTable not found URL: https://github.com/apache/incubator-hudi/issues/715#issuecomment-499350032 @garyli1019 : Yes, Hudi will generate a new version first and then cleaner will delete old version. If the reader doesn't recognize Hudi f

[GitHub] [incubator-hudi] garyli1019 commented on issue #715: [Bug]Hudi 0.4.7 HoodieTable not found

2019-06-05 Thread GitBox
garyli1019 commented on issue #715: [Bug]Hudi 0.4.7 HoodieTable not found URL: https://github.com/apache/incubator-hudi/issues/715#issuecomment-499354537 @bvaradar Gotcha, I am trying to use impala to read the data. Currently, my approach is to maintain a single version of parquet. Impa