[GitHub] [hudi] bvaradar commented on issue #1875: EMR + Spark Batch job + HUDI + Hive external Metastore (MySQL RDS Instance) failed with No Suitable Driver

2020-07-24 Thread GitBox
bvaradar commented on issue #1875: URL: https://github.com/apache/hudi/issues/1875#issuecomment-663814729 @umehrot2 @bschell @zhedoubushishi : Can you guys chime in here. This is an automated message from the Apache Git

[GitHub] [hudi] bvaradar commented on issue #1878: [SUPPORT] Spark Structured Streaming To Hudi Sink Datasource taking much longer

2020-07-24 Thread GitBox
bvaradar commented on issue #1878: URL: https://github.com/apache/hudi/issues/1878#issuecomment-663813751 This is a spark tuning issue in general. The slowness is due to memory pressure and node failures due to it. Atleast in one of the batches, I see task failures (and retries) during

[GitHub] [hudi] rubenssoto commented on issue #1878: [SUPPORT] Spark Structured Streaming To Hudi Sink Datasource taking much longer

2020-07-24 Thread GitBox
rubenssoto commented on issue #1878: URL: https://github.com/apache/hudi/issues/1878#issuecomment-663806475 I tried resizing the cluster with 3 more nodes, so in total(4 nodes) after resizing I had 4 cores in each node and 16gb of ram each, and it wasn't any difference, the job keeps very

Build failed in Jenkins: hudi-snapshot-deployment-0.5 #349

2020-07-24 Thread Apache Jenkins Server
See Changes: -- [...truncated 2.33 KB...] /home/jenkins/tools/maven/apache-maven-3.5.4/conf: logging settings.xml toolchains.xml

[GitHub] [hudi] rubenssoto opened a new issue #1878: [SUPPORT] Spark Structured Streaming To Hudi Sink Datasource taking much longer

2020-07-24 Thread GitBox
rubenssoto opened a new issue #1878: URL: https://github.com/apache/hudi/issues/1878 Hi, how are you? Im using EMR 5.30.1, spark 2.4.5, hudi 0.5.2 and my data is store in S3. Since today Im trying to migrate some of our datasets in production to apache hudi, Im having

[GitHub] [hudi] vinothchandar merged pull request #1874: [MINOR] Use HoodieActiveTimeline.COMMIT_FORMATTER

2020-07-24 Thread GitBox
vinothchandar merged pull request #1874: URL: https://github.com/apache/hudi/pull/1874 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[hudi] branch master updated: [MINOR] Use HoodieActiveTimeline.COMMIT_FORMATTER (#1874)

2020-07-24 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 0cb24e4 [MINOR] Use

[GitHub] [hudi] vinothchandar merged pull request #1877: [MINOR] Add Databricks File System to StorageSchemes

2020-07-24 Thread GitBox
vinothchandar merged pull request #1877: URL: https://github.com/apache/hudi/pull/1877 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[hudi] branch master updated: [MINOR] Add Databricks File System to StorageSchemes (#1877)

2020-07-24 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 467d097 [MINOR] Add Databricks File System to

[GitHub] [hudi] satishkotha commented on issue #1872: [SUPPORT]Getting 503s from S3 during upserts

2020-07-24 Thread GitBox
satishkotha commented on issue #1872: URL: https://github.com/apache/hudi/issues/1872#issuecomment-663788217 Hi, Number of 'createMarkerFile' calls = (number of partitions) + (number of file groups) *touched* by upsert operation. What is the partition for your workload? What

[GitHub] [hudi] garyli1019 opened a new pull request #1877: [MINOR] Add Databricks File System to StorageSchemes

2020-07-24 Thread GitBox
garyli1019 opened a new pull request #1877: URL: https://github.com/apache/hudi/pull/1877 ## What is the purpose of the pull request *Add support to databricks file system as a mount point on top of Azure data lake* ## Brief change log Add dbfs to StorageSchemes

[GitHub] [hudi] vinothchandar commented on pull request #1876: [HUDI-242] Support for RFC-12/Bootstrapping of external datasets

2020-07-24 Thread GitBox
vinothchandar commented on pull request #1876: URL: https://github.com/apache/hudi/pull/1876#issuecomment-663787403 @bvaradar @umehrot2 after many valiant efforts, finally rebased the original #1678 here. Will be working on getting the code review comments addressed and tests passing

[GitHub] [hudi] vinothchandar opened a new pull request #1876: [HUDI-242] Support for RFC-12/Bootstrapping of external datasets

2020-07-24 Thread GitBox
vinothchandar opened a new pull request #1876: URL: https://github.com/apache/hudi/pull/1876 - [HUDI-418] Bootstrap Index Implementation using HFile with unit-test - [HUDI-421] FileSystem View Changes to support Bootstrap with unit-tests - [HUDI-424] Implement Query Side

[GitHub] [hudi] vinothchandar commented on a change in pull request #1858: [WIP] [1014] Part 1: Adding Upgrade or downgrade infra

2020-07-24 Thread GitBox
vinothchandar commented on a change in pull request #1858: URL: https://github.com/apache/hudi/pull/1858#discussion_r460343368 ## File path: hudi-common/src/main/java/org/apache/hudi/common/table/HoodieTableConfig.java ## @@ -151,6 +154,27 @@ public HoodieTableType

[GitHub] [hudi] FelixKJose opened a new issue #1875: EMR + Spark Batch job + HUDI + Hive external Metastore (MySQL RDS Instance) failed with No Suitable Driver

2020-07-24 Thread GitBox
FelixKJose opened a new issue #1875: URL: https://github.com/apache/hudi/issues/1875 Hello, I am getting following error while I am using external RDS instance as Hive Metastore. **My configuration:** 'hoodie.datasource.hive_sync.enable': 'true',

[GitHub] [hudi] satishkotha commented on issue #1866: [SUPPORT]Clean up does not seem to happen on MOR table

2020-07-24 Thread GitBox
satishkotha commented on issue #1866: URL: https://github.com/apache/hudi/issues/1866#issuecomment-663751174 Sounds good. Please try it and let me know if you see any issues. This is an automated message from the Apache Git

[GitHub] [hudi] satishkotha edited a comment on issue #1866: [SUPPORT]Clean up does not seem to happen on MOR table

2020-07-24 Thread GitBox
satishkotha edited a comment on issue #1866: URL: https://github.com/apache/hudi/issues/1866#issuecomment-663683323 > Is there a possibility that commits get archived before clean job is resulting in a noop. I will continue to monitor. clean and archival are somewhat independent

[GitHub] [hudi] luffyd edited a comment on issue #1866: [SUPPORT]Clean up does not seem to happen on MOR table

2020-07-24 Thread GitBox
luffyd edited a comment on issue #1866: URL: https://github.com/apache/hudi/issues/1866#issuecomment-663741729 Ok thanks No I was not thinking to run as separate process continuously, but I wanted to execute "clean commands" from cli o that my streaming tests progress faster.

[GitHub] [hudi] luffyd commented on issue #1866: [SUPPORT]Clean up does not seem to happen on MOR table

2020-07-24 Thread GitBox
luffyd commented on issue #1866: URL: https://github.com/apache/hudi/issues/1866#issuecomment-663741729 Ok thanks, I will be running "clean commands" from hudi cli so that my tests progress faster for streaming. This is an

[GitHub] [hudi] umehrot2 commented on issue #1867: [SUPPORT] hudi is incurring emrfs eTag inconsistency issue with s3 and emrfs consistent view

2020-07-24 Thread GitBox
umehrot2 commented on issue #1867: URL: https://github.com/apache/hudi/issues/1867#issuecomment-663734319 Also on a side note, we always recommend using latest EMR releases as it has latest fixes and version of applications. So you may want to use `emr-5.30.1` instead.

[GitHub] [hudi] umehrot2 commented on issue #1867: [SUPPORT] hudi is incurring emrfs eTag inconsistency issue with s3 and emrfs consistent view

2020-07-24 Thread GitBox
umehrot2 commented on issue #1867: URL: https://github.com/apache/hudi/issues/1867#issuecomment-663732545 @tsolanki95 have you tried using `hoodie.consistency.check.enabled` which is Hudi's in-built mechanism for avoiding `eventual consistency` issues instead ? As for this

[GitHub] [hudi] xushiyan opened a new pull request #1874: [MINOR] Use HoodieActiveTimeline.COMMIT_FORMATTER

2020-07-24 Thread GitBox
xushiyan opened a new pull request #1874: URL: https://github.com/apache/hudi/pull/1874 To avoid repeated datetime format "MMddHHmmss". ## Committer checklist - [ ] Has a corresponding JIRA in PR title & commit - [ ] Commit message is descriptive of the change

[GitHub] [hudi] xushiyan commented on a change in pull request #1873: [HUDI-995] Move TestRawTripPayload and HoodieTestDataGenerator to hudi-common

2020-07-24 Thread GitBox
xushiyan commented on a change in pull request #1873: URL: https://github.com/apache/hudi/pull/1873#discussion_r460283968 ## File path: hudi-common/src/test/java/org/apache/hudi/common/testutils/RawTripTestPayload.java ## @@ -141,59 +138,4 @@ private String

[GitHub] [hudi] xushiyan commented on a change in pull request #1873: [HUDI-995] Move TestRawTripPayload and HoodieTestDataGenerator to hudi-common

2020-07-24 Thread GitBox
xushiyan commented on a change in pull request #1873: URL: https://github.com/apache/hudi/pull/1873#discussion_r460283968 ## File path: hudi-common/src/test/java/org/apache/hudi/common/testutils/RawTripTestPayload.java ## @@ -141,59 +138,4 @@ private String

[jira] [Updated] (HUDI-995) Organize test utils methods and classes

2020-07-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-995: Labels: pull-request-available (was: ) > Organize test utils methods and classes >

[GitHub] [hudi] xushiyan opened a new pull request #1873: [HUDI-995] Move TestRawTripPayload and HoodieTestDataGenerator to hudi-common

2020-07-24 Thread GitBox
xushiyan opened a new pull request #1873: URL: https://github.com/apache/hudi/pull/1873 To allow wider access to these classes ## Committer checklist - [ ] Has a corresponding JIRA in PR title & commit - [ ] Commit message is descriptive of the change - [ ]

[GitHub] [hudi] umehrot2 commented on issue #1847: [SUPPORT] querying MoR tables on S3 becomes slow with number of files growing

2020-07-24 Thread GitBox
umehrot2 commented on issue #1847: URL: https://github.com/apache/hudi/issues/1847#issuecomment-663728553 @bvaradar EMR only overrides the `getLen()` if the customer has explicitly enabled `Client Side Encryption` using the EmrFS property `fs.s3.cse.enabled`. In that case I see that EmrFS

[GitHub] [hudi] tsolanki95 commented on issue #1867: [SUPPORT] hudi is incurring emrfs eTag inconsistency issue with s3 and emrfs consistent view

2020-07-24 Thread GitBox
tsolanki95 commented on issue #1867: URL: https://github.com/apache/hudi/issues/1867#issuecomment-663698781 This is also a field where data quality, precision, and accuracy are important. EMRFS consistent view helps us keep us having issues with s3 consistency, some of the features that

[GitHub] [hudi] tsolanki95 edited a comment on issue #1867: [SUPPORT] hudi is incurring emrfs eTag inconsistency issue with s3 and emrfs consistent view

2020-07-24 Thread GitBox
tsolanki95 edited a comment on issue #1867: URL: https://github.com/apache/hudi/issues/1867#issuecomment-663688633 @luffyd We put in consistent view as a solution earlier, based on AWS support, to solve issues with using spark with S3 eventual consistency model causing duplicates in our

[GitHub] [hudi] luffyd commented on issue #1872: [SUPPORT]Getting 503s from S3 during upserts

2020-07-24 Thread GitBox
luffyd commented on issue #1872: URL: https://github.com/apache/hudi/issues/1872#issuecomment-663690493 I have noticed slowing down ingestion worked. It seems like number of calls to "HoodieWriteHandle.createMarkerFile" is resulting an S3 call. But can you give any hints on 1.

[GitHub] [hudi] asheeshgarg commented on issue #1787: Exception During Insert

2020-07-24 Thread GitBox
asheeshgarg commented on issue #1787: URL: https://github.com/apache/hudi/issues/1787#issuecomment-663688727 @bvaradar I am getting the same exception I had added the jars to the --jars option of submit so its available to both driver and executors.

[GitHub] [hudi] tsolanki95 edited a comment on issue #1867: [SUPPORT] hudi is incurring emrfs eTag inconsistency issue with s3 and emrfs consistent view

2020-07-24 Thread GitBox
tsolanki95 edited a comment on issue #1867: URL: https://github.com/apache/hudi/issues/1867#issuecomment-663688633 @luffyd We put in consistent view as a solution earlier, based on AWS support, to solve issues with using spark with S3 eventual consistency model. We are now looking towards

[GitHub] [hudi] tsolanki95 commented on issue #1867: [SUPPORT] hudi is incurring emrfs eTag inconsistency issue with s3 and emrfs consistent view

2020-07-24 Thread GitBox
tsolanki95 commented on issue #1867: URL: https://github.com/apache/hudi/issues/1867#issuecomment-663688633 We put in consistent view as a solution earlier, based on AWS support, to solve issues with using spark with S3 eventual consistency model. We are now looking towards changing some

[GitHub] [hudi] satishkotha commented on issue #1872: [SUPPORT]Getting 503s from S3 during upserts

2020-07-24 Thread GitBox
satishkotha commented on issue #1872: URL: https://github.com/apache/hudi/issues/1872#issuecomment-663688088 This is likely more of AWS support question. A quick search shows https://aws.amazon.com/premiumsupport/knowledge-center/emr-s3-503-slow-down/ Can you see if any of the

[GitHub] [hudi] satishkotha edited a comment on issue #1866: [SUPPORT]Clean up does not seem to happen on MOR table

2020-07-24 Thread GitBox
satishkotha edited a comment on issue #1866: URL: https://github.com/apache/hudi/issues/1866#issuecomment-663683323 > Is there a possibility that commits get archived before clean job is resulting in a noop. I will continue to monitor. clean and archival are somewhat independent

[GitHub] [hudi] satishkotha commented on issue #1866: [SUPPORT]Clean up does not seem to happen on MOR table

2020-07-24 Thread GitBox
satishkotha commented on issue #1866: URL: https://github.com/apache/hudi/issues/1866#issuecomment-663683323 > Is there a possibility that commits get archived before clean job is resulting in a noop. I will continue to monitor. clean and archival are somewhat independent. So noop

[GitHub] [hudi] satishkotha edited a comment on issue #1866: [SUPPORT]Clean up does not seem to happen on MOR table

2020-07-24 Thread GitBox
satishkotha edited a comment on issue #1866: URL: https://github.com/apache/hudi/issues/1866#issuecomment-663683323 > Is there a possibility that commits get archived before clean job is resulting in a noop. I will continue to monitor. clean and archival are somewhat independent. So

[GitHub] [hudi] bvaradar commented on issue #1787: Exception During Insert

2020-07-24 Thread GitBox
bvaradar commented on issue #1787: URL: https://github.com/apache/hudi/issues/1787#issuecomment-663677973 @asheeshgarg : I may have accidentally deleted a comment from. Has the issue been resolved ? This is an automated

[GitHub] [hudi] bvaradar closed issue #1856: [SUPPORT] HiveSyncTool fails on alter table cascade

2020-07-24 Thread GitBox
bvaradar closed issue #1856: URL: https://github.com/apache/hudi/issues/1856 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] bvaradar commented on issue #1856: [SUPPORT] HiveSyncTool fails on alter table cascade

2020-07-24 Thread GitBox
bvaradar commented on issue #1856: URL: https://github.com/apache/hudi/issues/1856#issuecomment-663677430 Please reopen if you need further clarifications. This is an automated message from the Apache Git Service. To respond

[GitHub] [hudi] bvaradar commented on issue #1864: Spark 2.2.0 is compatible?

2020-07-24 Thread GitBox
bvaradar commented on issue #1864: URL: https://github.com/apache/hudi/issues/1864#issuecomment-663677139 Closing this ticket. Please reach out in slack or open a new ticket if you find any issues This is an automated

[GitHub] [hudi] bvaradar closed issue #1864: Spark 2.2.0 is compatible?

2020-07-24 Thread GitBox
bvaradar closed issue #1864: URL: https://github.com/apache/hudi/issues/1864 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] luffyd commented on issue #1867: [SUPPORT] hudi is incurring emrfs eTag inconsistency issue with s3 and emrfs consistent view

2020-07-24 Thread GitBox
luffyd commented on issue #1867: URL: https://github.com/apache/hudi/issues/1867#issuecomment-663659622 @tsolanki95 Does this happen at the time read? In my tests, I noticed etags are not being in sync for .hoodie folder. Also what are your reasons to enable consistent view when using

[GitHub] [hudi] luffyd opened a new issue #1872: [SUPPORT]Getting 503s from S3 during upserts

2020-07-24 Thread GitBox
luffyd opened a new issue #1872: URL: https://github.com/apache/hudi/issues/1872 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://cwiki.apache.org/confluence/display/HUDI/FAQ)? - Join the mailing list to engage in conversations and get faster

[jira] [Assigned] (HUDI-995) Organize test utils methods and classes

2020-07-24 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu reassigned HUDI-995: --- Assignee: Raymond Xu > Organize test utils methods and classes >

[jira] [Updated] (HUDI-995) Organize test utils methods and classes

2020-07-24 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-995: Description: * Move test utils classes to hudi-common where appropriate, e.g. TestRawTripPayload,

[jira] [Updated] (HUDI-995) Organize test utils methods and classes

2020-07-24 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-995: Summary: Organize test utils methods and classes (was: Add hudi-testutils module) > Organize test utils

[GitHub] [hudi] luffyd commented on issue #1866: [SUPPORT]Clean up does not seem to happen on MOR table

2020-07-24 Thread GitBox
luffyd commented on issue #1866: URL: https://github.com/apache/hudi/issues/1866#issuecomment-663648589 Thanks saitsh, I have inline turned on by default, Now I see cleans did happen! Is there a possibility that commits get archived before clean job is resulting in a noop. I will

[GitHub] [hudi] ssomuah commented on issue #1852: [SUPPORT]

2020-07-24 Thread GitBox
ssomuah commented on issue #1852: URL: https://github.com/apache/hudi/issues/1852#issuecomment-663646201 Hi Balaji, I think I've narrowed down my issue somewhat for my MOR table. I started again with a fresh table and the initial commits make sense, but after a time I notice It's

[GitHub] [hudi] bvaradar commented on issue #1847: [SUPPORT] querying MoR tables on S3 becomes slow with number of files growing

2020-07-24 Thread GitBox
bvaradar commented on issue #1847: URL: https://github.com/apache/hudi/issues/1847#issuecomment-663450319 @bschell : Thanks for the information. As getLen() is used extensively both on read and write side, can you let us elaborate more on what cases does it actually result in RPC calls ?

[GitHub] [hudi] lw309637554 commented on pull request #1810: [HUDI-875] Abstract hudi-sync-common, and support hudi-hive-sync

2020-07-24 Thread GitBox
lw309637554 commented on pull request #1810: URL: https://github.com/apache/hudi/pull/1810#issuecomment-663442107 > couple okay ,thanks This is an automated message from the Apache Git Service. To respond to the

[GitHub] [hudi] bvaradar commented on issue #1852: [SUPPORT]

2020-07-24 Thread GitBox
bvaradar commented on issue #1852: URL: https://github.com/apache/hudi/issues/1852#issuecomment-663427905 What do you mean by "runs serially with ingestion"? My understanding was that inline compaction happened in the same flow as writing so an inline compaction would simply slow down

[GitHub] [hudi] bvaradar commented on issue #1867: [SUPPORT] hudi is incurring emrfs eTag inconsistency issue with s3 and emrfs consistent view

2020-07-24 Thread GitBox
bvaradar commented on issue #1867: URL: https://github.com/apache/hudi/issues/1867#issuecomment-663414676 @umehrot2 : Can you help answer this question. Thanks. Balaji.V This is an automated message from the Apache Git

[GitHub] [hudi] bvaradar commented on issue #1860: [SUPPORT] Issue when querying from Spark Datasource if COW table is being written to at the same time

2020-07-24 Thread GitBox
bvaradar commented on issue #1860: URL: https://github.com/apache/hudi/issues/1860#issuecomment-663413173 I would expect the data to be same across query engines unless there is some caching or GS is not giving consistent listing view. With Hudi's Spark datasource integration, Hudi

[GitHub] [hudi] sbernauer commented on issue #1845: [SUPPORT] Support for Schema evolution. Facing an error

2020-07-24 Thread GitBox
sbernauer commented on issue #1845: URL: https://github.com/apache/hudi/issues/1845#issuecomment-663373241 > 4. We ingest old events again (there are some upserts). ?? What schema is being used here? At this step I used SCHEMA_V2 We use Deltastreamer in continues mode and

[jira] [Commented] (HUDI-1123) Document the usage of user define metrics reporter

2020-07-24 Thread Zheren Yu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17164179#comment-17164179 ] Zheren Yu commented on HUDI-1123: - @leesf Thank you for assigning  > Document the usage of user define

[jira] [Commented] (HUDI-1124) Document the usage of Tencent COSN

2020-07-24 Thread deyzhong (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17164181#comment-17164181 ] deyzhong commented on HUDI-1124: ok, I will finish the work as soon as possible.   > Document the usage

[jira] [Commented] (HUDI-1124) Document the usage of Tencent COSN

2020-07-24 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17164176#comment-17164176 ] leesf commented on HUDI-1124: - [~meimile] Assign the ticket to you and feel free to open a new PR, thanks >

[jira] [Created] (HUDI-1124) Document the usage of Tencent COSN

2020-07-24 Thread leesf (Jira)
leesf created HUDI-1124: --- Summary: Document the usage of Tencent COSN Key: HUDI-1124 URL: https://issues.apache.org/jira/browse/HUDI-1124 Project: Apache Hudi Issue Type: Improvement

[jira] [Commented] (HUDI-1123) Document the usage of user define metrics reporter

2020-07-24 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17164174#comment-17164174 ] leesf commented on HUDI-1123: - [~york831] . Assign the ticket to you and feel free to open a new PR, thanks >

[jira] [Assigned] (HUDI-1123) Document the usage of user define metrics reporter

2020-07-24 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf reassigned HUDI-1123: --- Assignee: Zheren Yu > Document the usage of user define metrics reporter >

[jira] [Updated] (HUDI-1123) Document the usage of user define metrics reporter

2020-07-24 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf updated HUDI-1123: Description: (was: [~york831] . Assign the ticket to you and feel free to open a new PR, thanks) > Document

[jira] [Created] (HUDI-1123) Document the usage of user define metrics reporter

2020-07-24 Thread leesf (Jira)
leesf created HUDI-1123: --- Summary: Document the usage of user define metrics reporter Key: HUDI-1123 URL: https://issues.apache.org/jira/browse/HUDI-1123 Project: Apache Hudi Issue Type: Improvement

[jira] [Closed] (HUDI-1113) Support user defined metrics reporter

2020-07-24 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf closed HUDI-1113. --- > Support user defined metrics reporter > - > > Key: HUDI-1113 >

[jira] [Resolved] (HUDI-1113) Support user defined metrics reporter

2020-07-24 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf resolved HUDI-1113. - Resolution: Fixed > Support user defined metrics reporter > - > >

[jira] [Updated] (HUDI-1113) Support user defined metrics reporter

2020-07-24 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf updated HUDI-1113: Fix Version/s: (was: 0.5.3) 0.60 > Support user defined metrics reporter >

[jira] [Updated] (HUDI-1113) Support user defined metrics reporter

2020-07-24 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf updated HUDI-1113: Fix Version/s: (was: 0.60) 0.6.0 > Support user defined metrics reporter >

[jira] [Updated] (HUDI-1113) Support user defined metrics reporter

2020-07-24 Thread leesf (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] leesf updated HUDI-1113: Status: Open (was: New) > Support user defined metrics reporter > - > >

[GitHub] [hudi] xushiyan opened a new pull request #1871: [WIP] [HUDI-781] Introduce HoodieDataPrep for test preparation

2020-07-24 Thread GitBox
xushiyan opened a new pull request #1871: URL: https://github.com/apache/hudi/pull/1871 - Consolidate relevant util methods to `HoodieDataPrep` - Make `HoodieDataPrep` the sole class for creating hoodie data/metadata files for testing ## Committer checklist - [ ] Has a