[GitHub] [incubator-hudi] leilinen opened a new issue #615: how change HoodieDeltaStreamer with Real-time calculation

2019-03-26 Thread GitBox
leilinen opened a new issue #615: how change HoodieDeltaStreamer with Real-time calculation URL: https://github.com/apache/incubator-hudi/issues/615 Hi, In my project, I want to consume data from kafka topic and upsert data with hoodie. In hoodie, the HoodieDeltaStreamer is

[GitHub] [incubator-hudi] cdmikechen commented on issue #547: The question about Hudi transform timestamp and decimal's type

2019-03-26 Thread GitBox
cdmikechen commented on issue #547: The question about Hudi transform timestamp and decimal's type URL: https://github.com/apache/incubator-hudi/issues/547#issuecomment-476621181 @vinothchandar Sorry for delay. I've changed my job in February, and my new job works for an automobile

[GitHub] [incubator-hudi] n3nash commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads

2019-03-26 Thread GitBox
n3nash commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads URL: https://github.com/apache/incubator-hudi/pull/612#discussion_r269236754 ## File path: hoodie-client/src/main/java/com/uber/hoodie/io/HoodieIOHandle.java

[GitHub] [incubator-hudi] bvaradar commented on issue #581: ClassNotFoundException:HoodieInputFormat

2019-03-26 Thread GitBox
bvaradar commented on issue #581: ClassNotFoundException:HoodieInputFormat URL: https://github.com/apache/incubator-hudi/issues/581#issuecomment-476779092 @Achuth17 : This was done on purpose. There are lot of different hadoop/hive environments out there. With Hudi having to interoperate

[GitHub] [incubator-hudi] bvaradar commented on issue #581: ClassNotFoundException:HoodieInputFormat

2019-03-26 Thread GitBox
bvaradar commented on issue #581: ClassNotFoundException:HoodieInputFormat URL: https://github.com/apache/incubator-hudi/issues/581#issuecomment-476779549 @daikon12 : Is run_hive_sync tool working fine now ? This is an

[GitHub] [incubator-hudi] n3nash commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads

2019-03-26 Thread GitBox
n3nash commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads URL: https://github.com/apache/incubator-hudi/pull/612#discussion_r269237813 ## File path:

[GitHub] [incubator-hudi] n3nash commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads

2019-03-26 Thread GitBox
n3nash commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads URL: https://github.com/apache/incubator-hudi/pull/612#discussion_r269237737 ## File path: hoodie-client/src/main/java/com/uber/hoodie/io/HoodieIOHandle.java

[GitHub] [incubator-hudi] n3nash commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads

2019-03-26 Thread GitBox
n3nash commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads URL: https://github.com/apache/incubator-hudi/pull/612#discussion_r269236754 ## File path: hoodie-client/src/main/java/com/uber/hoodie/io/HoodieIOHandle.java

[GitHub] [incubator-hudi] n3nash commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads

2019-03-26 Thread GitBox
n3nash commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads URL: https://github.com/apache/incubator-hudi/pull/612#discussion_r269236502 ## File path: hoodie-client/src/main/java/com/uber/hoodie/io/HoodieCreateHandle.java

[GitHub] [incubator-hudi] n3nash commented on issue #612: Fixing source schema and writer schema distinction in payloads

2019-03-26 Thread GitBox
n3nash commented on issue #612: Fixing source schema and writer schema distinction in payloads URL: https://github.com/apache/incubator-hudi/pull/612#issuecomment-476771521 @vinothchandar Addressed and replied to comments.

[GitHub] [incubator-hudi] n3nash commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads

2019-03-26 Thread GitBox
n3nash commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads URL: https://github.com/apache/incubator-hudi/pull/612#discussion_r269238293 ## File path:

[GitHub] [incubator-hudi] vinothchandar commented on issue #547: The question about Hudi transform timestamp and decimal's type

2019-03-26 Thread GitBox
vinothchandar commented on issue #547: The question about Hudi transform timestamp and decimal's type URL: https://github.com/apache/incubator-hudi/issues/547#issuecomment-476752594 Congrats on the new gig! :) This is an

[GitHub] [incubator-hudi] vinothchandar commented on issue #547: The question about Hudi transform timestamp and decimal's type

2019-03-26 Thread GitBox
vinothchandar commented on issue #547: The question about Hudi transform timestamp and decimal's type URL: https://github.com/apache/incubator-hudi/issues/547#issuecomment-476752493 @cdmikechen no worries.. it would help if you can file a JIRA on hive/hadoop version gaps preventing you

[incubator-hudi] branch master updated: [HUDI-63] Removed unused BucketedIndex code

2019-03-26 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/master by this push: new 395806f [HUDI-63] Removed unused

[GitHub] [incubator-hudi] vinothchandar merged pull request #608: [HUDI-63] Removed unused BucketedIndex code

2019-03-26 Thread GitBox
vinothchandar merged pull request #608: [HUDI-63] Removed unused BucketedIndex code URL: https://github.com/apache/incubator-hudi/pull/608 This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [incubator-hudi] vinothchandar commented on issue #608: [HUDI-63] Removed unused BucketedIndex code

2019-03-26 Thread GitBox
vinothchandar commented on issue #608: [HUDI-63] Removed unused BucketedIndex code URL: https://github.com/apache/incubator-hudi/pull/608#issuecomment-476753401 awesome .. @ambition119 . congrats on your first PR :) This is

[GitHub] [incubator-hudi] vinothchandar opened a new pull request #616: Fixes HUDI-38: Reduce memory overhead of WriteStatus

2019-03-26 Thread GitBox
vinothchandar opened a new pull request #616: Fixes HUDI-38: Reduce memory overhead of WriteStatus URL: https://github.com/apache/incubator-hudi/pull/616 - For implicit indexes (e.g BloomIndex), don't buffer up written records - By default, only collect 10% of failing records to avoid

[GitHub] [incubator-hudi] vinothchandar opened a new pull request #617: Fixes HUDI-9 : Check precondition minInstantsToKeep > cleanerCommitsR…

2019-03-26 Thread GitBox
vinothchandar opened a new pull request #617: Fixes HUDI-9 : Check precondition minInstantsToKeep > cleanerCommitsR… URL: https://github.com/apache/incubator-hudi/pull/617 …etained - Added a precondition check, otherwise incr pull could miss commits - Lowered default cleaner

[GitHub] [incubator-hudi] ambition119 opened a new pull request #618: fix Schema.parse to new Schema.Parser().parse

2019-03-26 Thread GitBox
ambition119 opened a new pull request #618: fix Schema.parse to new Schema.Parser().parse URL: https://github.com/apache/incubator-hudi/pull/618 minor modify info: 1. Schema.parse() deprecated function to new Schema.Parser().parse() 2. Extract method from duplicated code.

[incubator-hudi] branch master updated: Fixing source schema and writer schema distinction in payloads

2019-03-26 Thread vbalaji
This is an automated email from the ASF dual-hosted git repository. vbalaji pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-hudi.git The following commit(s) were added to refs/heads/master by this push: new 3d9041e Fixing source schema and

[GitHub] [incubator-hudi] bvaradar merged pull request #612: Fixing source schema and writer schema distinction in payloads

2019-03-26 Thread GitBox
bvaradar merged pull request #612: Fixing source schema and writer schema distinction in payloads URL: https://github.com/apache/incubator-hudi/pull/612 This is an automated message from the Apache Git Service. To respond

[GitHub] [incubator-hudi] ambition119 commented on issue #608: [HUDI-63] Removed unused BucketedIndex code

2019-03-26 Thread GitBox
ambition119 commented on issue #608: [HUDI-63] Removed unused BucketedIndex code URL: https://github.com/apache/incubator-hudi/pull/608#issuecomment-476928498 > awesome .. @ambition119 . congrats on your first PR :) happy, thanks

[GitHub] [incubator-hudi] vinothchandar commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads

2019-03-26 Thread GitBox
vinothchandar commented on a change in pull request #612: Fixing source schema and writer schema distinction in payloads URL: https://github.com/apache/incubator-hudi/pull/612#discussion_r269381371 ## File path: hoodie-client/src/main/java/com/uber/hoodie/io/HoodieIOHandle.java

[GitHub] [incubator-hudi] vinothchandar commented on issue #612: Fixing source schema and writer schema distinction in payloads

2019-03-26 Thread GitBox
vinothchandar commented on issue #612: Fixing source schema and writer schema distinction in payloads URL: https://github.com/apache/incubator-hudi/pull/612#issuecomment-476933580 @n3nash nit: can we get into habit of always having a JIRA for PRs ? esp large change like this deserves