[GitHub] [hudi] danny0405 commented on pull request #2867: [HUDI-1836] Logging consuming instant to StreamReadOperator#processSplits

2021-04-26 Thread GitBox


danny0405 commented on pull request #2867:
URL: https://github.com/apache/hudi/pull/2867#issuecomment-827329997


   Hi @yanghua ,can you take a look for this PR again, thanks ~


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Updated] (HUDI-1853) Add flink index and compaction options to document

2021-04-26 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HUDI-1853:
-
Labels: pull-request-available  (was: )

> Add flink index and compaction options to document
> --
>
> Key: HUDI-1853
> URL: https://issues.apache.org/jira/browse/HUDI-1853
> Project: Apache Hudi
>  Issue Type: Task
>  Components: Flink Integration
>Reporter: Danny Chen
>Assignee: Danny Chen
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.9.0
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] danny0405 opened a new pull request #2885: [HUDI-1853] Add flink index and compaction options to document

2021-04-26 Thread GitBox


danny0405 opened a new pull request #2885:
URL: https://github.com/apache/hudi/pull/2885


   ## *Tips*
   - *Thank you very much for contributing to Apache Hudi.*
   - *Please review https://hudi.apache.org/contributing.html before opening a 
pull request.*
   
   ## What is the purpose of the pull request
   
   *(For example: This pull request adds quick-start document.)*
   
   ## Brief change log
   
   *(for example:)*
 - *Modify AnnotationLocation checkstyle rule in checkstyle.xml*
   
   ## Verify this pull request
   
   *(Please pick either of the following options)*
   
   This pull request is a trivial rework / code cleanup without any test 
coverage.
   
   *(or)*
   
   This pull request is already covered by existing tests, such as *(please 
describe tests)*.
   
   (or)
   
   This change added tests and can be verified as follows:
   
   *(example:)*
   
 - *Added integration tests for end-to-end.*
 - *Added HoodieClientWriteTest to verify the change.*
 - *Manually verified the change by running a job locally.*
   
   ## Committer checklist
   
- [ ] Has a corresponding JIRA in PR title & commit

- [ ] Commit message is descriptive of the change

- [ ] CI is green
   
- [ ] Necessary doc changes done or have another open PR
  
- [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] hudi-bot edited a comment on pull request #2643: DO NOT MERGE (Azure CI) test branch ci

2021-04-26 Thread GitBox


hudi-bot edited a comment on pull request #2643:
URL: https://github.com/apache/hudi/pull/2643#issuecomment-792368481


   
   ## CI report:
   
   * 9831a6c50e9f49f8a71c02fc6ac50ae1446f7c1f UNKNOWN
   * a569dbe9409910fbb83b3764b300574c0e52612e Azure: 
[FAILURE](https://dev.azure.com/XUSH0012/0ef433cc-d4b4-47cc-b6a1-03d032ef546c/_build/results?buildId=142)
 
   * e6e9f1f1554a1474dd6c20338215030cad23a2e0 UNKNOWN
   * 2a6690a256c8cd8efe9ed2b1984b896fb27ef077 UNKNOWN
   * d8b7cca55e057a52a2e229d81e8cb52b60dc275f UNKNOWN
   * 3bce301333cc78194d13a702598b46e04fe9f85f UNKNOWN
   * f07f345baa450f3fec7eab59caa76b0fbda1e132 UNKNOWN
   * 869d2ce3fad330af93c1bb3b576824f519c6e68b UNKNOWN
   * fa86907f7522bc8dbe512d48b5a87e4a6b13f035 UNKNOWN
   * 4ebe53016ce3e0648992dbe14d04f71a92f116e6 UNKNOWN
   * 682ae9985f591f6d0c30ee2ef9b159403c1e46de UNKNOWN
   * d80397fcfeaa2996ab550bcdab4524be7420a364 UNKNOWN
   * bfe3a803e19540578b94f778f7ba7551db0f86f1 UNKNOWN
   * a632e58390eb94fcc7e757bd7580780cf184f9a8 UNKNOWN
   * 2e413d601c80b123269c2fc3fc6aa9a8bd0d746a UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run travis` re-run the last Travis build
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] n3nash commented on issue #2609: [SUPPORT] Presto hudi query slow when compared to parquet

2021-04-26 Thread GitBox


n3nash commented on issue #2609:
URL: https://github.com/apache/hudi/issues/2609#issuecomment-827323716


   I've filed a ticket here -> https://issues.apache.org/jira/browse/HUDI-1856. 
Let's move the conversation to the JIRA. Happy to guide you @GaruGaru or 
@njalan if you would like to upstream the change in Trino. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] n3nash closed issue #2609: [SUPPORT] Presto hudi query slow when compared to parquet

2021-04-26 Thread GitBox


n3nash closed issue #2609:
URL: https://github.com/apache/hudi/issues/2609


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Created] (HUDI-1856) Upstream changes made in PrestoDB to eliminate file listing to Trino

2021-04-26 Thread Nishith Agarwal (Jira)
Nishith Agarwal created HUDI-1856:
-

 Summary: Upstream changes made in PrestoDB to eliminate file 
listing to Trino
 Key: HUDI-1856
 URL: https://issues.apache.org/jira/browse/HUDI-1856
 Project: Apache Hudi
  Issue Type: Wish
Reporter: Nishith Agarwal


inputFormat.getSplits() code was optimized for PrestoDB code base. This change 
is not implemented / upstreamed in Trino.

 

Additionally, there are other changes that need to be upstreamed in Trino. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HUDI-1856) Upstream changes made in PrestoDB to eliminate file listing to Trino

2021-04-26 Thread Nishith Agarwal (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nishith Agarwal updated HUDI-1856:
--
Labels: sev:high sev:triage  (was: )

> Upstream changes made in PrestoDB to eliminate file listing to Trino
> 
>
> Key: HUDI-1856
> URL: https://issues.apache.org/jira/browse/HUDI-1856
> Project: Apache Hudi
>  Issue Type: Wish
>Reporter: Nishith Agarwal
>Priority: Major
>  Labels: sev:high, sev:triage
>
> inputFormat.getSplits() code was optimized for PrestoDB code base. This 
> change is not implemented / upstreamed in Trino.
>  
> Additionally, there are other changes that need to be upstreamed in Trino. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] n3nash commented on issue #2620: [SUPPORT] Performance Tuning: Slow stages (Building Workload Profile & Getting Small files from partitions) during Hudi Writes

2021-04-26 Thread GitBox


n3nash commented on issue #2620:
URL: https://github.com/apache/hudi/issues/2620#issuecomment-827321400


   @kimberlyamandalu @njalan @codejoyan There are a few problems when using 
BLOOM_INDEX
   
   1. Depending on the number of entries in the parquet file, if the 
BLOOM_INDEX num_entries is not configured correctly, it will lead to lots of 
false positives that results in bloom index spending more time looking up data. 
You can check the default bloom index entries here -> 
https://github.com/apache/hudi/blob/5be3997f70415e1752a0b5214f9398880fc8fd1f/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieIndexConfig.java#L47.
 You can either increase this or use dynamic bloom filter. We are working on 
adding metrics to emit how many such false positives happened. 
   2. The BLOOM_INDEX step needs to perform a "listing" of the partitions to 
find the candidate files. On S3 without `hoodie.metadata.table` being enabled, 
this listing can take time. Enable the config to eliminate these file listings.
   3. Depending on your workload, BLOOM_INDEX could, in some cases not be the 
ideal choice. For example, if you have updates across all your partitions, then 
using SIMPLE_INDEX is better since bloom will just do some extra work and then 
do the work that SIMPLE_INDEX would have done anyways. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] n3nash commented on issue #2623: org.apache.hudi.exception.HoodieDependentSystemUnavailableException:System HBASE unavailable.

2021-04-26 Thread GitBox


n3nash commented on issue #2623:
URL: https://github.com/apache/hudi/issues/2623#issuecomment-827317890


   @hj2016 Thanks for the update. @root18039532923 Is your issue resolved now ? 
Were you able to package the jar and try it out


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] n3nash closed issue #2639: [SUPPORT] Spark 3.0.1 upgrade cause severe increase in Hudi write time

2021-04-26 Thread GitBox


n3nash closed issue #2639:
URL: https://github.com/apache/hudi/issues/2639


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] n3nash commented on issue #2639: [SUPPORT] Spark 3.0.1 upgrade cause severe increase in Hudi write time

2021-04-26 Thread GitBox


n3nash commented on issue #2639:
URL: https://github.com/apache/hudi/issues/2639#issuecomment-827313973


   @afeldman1 I think this issue is related to the AWS EMR changes for Hudi 
0.6. As @rubenssoto described, the Hudi 0.7 works fine. Closing this ticket, 
feel free to re-open if issue still persists for you.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] n3nash commented on issue #2641: [SUPPORT]Build Hudi source with spark 3.1.1 compile error

2021-04-26 Thread GitBox


n3nash commented on issue #2641:
URL: https://github.com/apache/hudi/issues/2641#issuecomment-827313469


   @garyli1019 Are you able to take a look at this one as well ?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] n3nash commented on issue #2647: [SUPPORT] hive2.1.1 MR cannot be queried based on hive partition

2021-04-26 Thread GitBox


n3nash commented on issue #2647:
URL: https://github.com/apache/hudi/issues/2647#issuecomment-827313315


   @liujinhui1994 Are you able to provide information that @nsivabalan and 
@bvaradar have asked ? We can help triage this issue, if not, I will close this 
ticket soon.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Commented] (HUDI-1855) RejectedExecutionException FutureTask rejected from ThreadPoolExecutor[Terminated...]

2021-04-26 Thread Jira


[ 
https://issues.apache.org/jira/browse/HUDI-1855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17332927#comment-17332927
 ] 

李建康 commented on HUDI-1855:
---

[[SUPPORT]RejectedExecutionException FutureTask rejected from 
ThreadPoolExecutor[Terminated...] · Issue #2723 · apache/hudi 
(github.com)|https://github.com/apache/hudi/issues/2723]

> RejectedExecutionException FutureTask rejected from 
> ThreadPoolExecutor[Terminated...] 
> --
>
> Key: HUDI-1855
> URL: https://issues.apache.org/jira/browse/HUDI-1855
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: Cleaner
>Affects Versions: 0.6.0
>Reporter: 李建康
>Priority: Major
>
> *Describe the problem you faced*
> We use Structured Streaming to subscribe to the data in Kafka, and then write 
> the data to the hoodie, the program will stop abnormally after running for a 
> period of time.
> *Environment Description*
>  * Hudi version :0.6.0
>  * Spark version :2.4.0
>  * Hive version :2.1.1
>  * Hadoop version :3.0.0
>  * Storage (HDFS/S3/GCS..) :HDFS
>  * Running on Docker? (yes/no) :no



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Issue Comment Deleted] (HUDI-1855) RejectedExecutionException FutureTask rejected from ThreadPoolExecutor[Terminated...]

2021-04-26 Thread Jira


 [ 
https://issues.apache.org/jira/browse/HUDI-1855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

李建康 updated HUDI-1855:
--
Comment: was deleted

(was: [[SUPPORT]RejectedExecutionException FutureTask rejected from 
ThreadPoolExecutor[Terminated...] · Issue #2723 · apache/hudi 
(github.com)|https://github.com/apache/hudi/issues/2723])

> RejectedExecutionException FutureTask rejected from 
> ThreadPoolExecutor[Terminated...] 
> --
>
> Key: HUDI-1855
> URL: https://issues.apache.org/jira/browse/HUDI-1855
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: Cleaner
>Affects Versions: 0.6.0
>Reporter: 李建康
>Priority: Major
>
> *Describe the problem you faced*
> We use Structured Streaming to subscribe to the data in Kafka, and then write 
> the data to the hoodie, the program will stop abnormally after running for a 
> period of time.
> *Environment Description*
>  * Hudi version :0.6.0
>  * Spark version :2.4.0
>  * Hive version :2.1.1
>  * Hadoop version :3.0.0
>  * Storage (HDFS/S3/GCS..) :HDFS
>  * Running on Docker? (yes/no) :no



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] n3nash commented on issue #2648: [SUPPORT] a NPE error when reading MOR table in spark datasource

2021-04-26 Thread GitBox


n3nash commented on issue #2648:
URL: https://github.com/apache/hudi/issues/2648#issuecomment-827312511


   @hushenmin Thanks for the cross comment. Yes, it looks like for _rt view, 
for indexType = global when there are no base data files, this exception is 
thrown. We will take this up soon.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Commented] (HUDI-1855) RejectedExecutionException FutureTask rejected from ThreadPoolExecutor[Terminated...]

2021-04-26 Thread Jira


[ 
https://issues.apache.org/jira/browse/HUDI-1855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17332926#comment-17332926
 ] 

李建康 commented on HUDI-1855:
---

[[SUPPORT]RejectedExecutionException FutureTask rejected from 
ThreadPoolExecutor[Terminated...] · Issue #2723 · apache/hudi 
(github.com)|https://github.com/apache/hudi/issues/2723]

> RejectedExecutionException FutureTask rejected from 
> ThreadPoolExecutor[Terminated...] 
> --
>
> Key: HUDI-1855
> URL: https://issues.apache.org/jira/browse/HUDI-1855
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: Cleaner
>Affects Versions: 0.6.0
>Reporter: 李建康
>Priority: Major
>
> *Describe the problem you faced*
> We use Structured Streaming to subscribe to the data in Kafka, and then write 
> the data to the hoodie, the program will stop abnormally after running for a 
> period of time.
> *Environment Description*
>  * Hudi version :0.6.0
>  * Spark version :2.4.0
>  * Hive version :2.1.1
>  * Hadoop version :3.0.0
>  * Storage (HDFS/S3/GCS..) :HDFS
>  * Running on Docker? (yes/no) :no



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Issue Comment Deleted] (HUDI-1855) RejectedExecutionException FutureTask rejected from ThreadPoolExecutor[Terminated...]

2021-04-26 Thread Jira


 [ 
https://issues.apache.org/jira/browse/HUDI-1855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

李建康 updated HUDI-1855:
--
Comment: was deleted

(was: [[SUPPORT]RejectedExecutionException FutureTask rejected from 
ThreadPoolExecutor[Terminated...] · Issue #2723 · apache/hudi 
(github.com)|https://github.com/apache/hudi/issues/2723])

> RejectedExecutionException FutureTask rejected from 
> ThreadPoolExecutor[Terminated...] 
> --
>
> Key: HUDI-1855
> URL: https://issues.apache.org/jira/browse/HUDI-1855
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: Cleaner
>Affects Versions: 0.6.0
>Reporter: 李建康
>Priority: Major
>
> *Describe the problem you faced*
> We use Structured Streaming to subscribe to the data in Kafka, and then write 
> the data to the hoodie, the program will stop abnormally after running for a 
> period of time.
> *Environment Description*
>  * Hudi version :0.6.0
>  * Spark version :2.4.0
>  * Hive version :2.1.1
>  * Hadoop version :3.0.0
>  * Storage (HDFS/S3/GCS..) :HDFS
>  * Running on Docker? (yes/no) :no



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] n3nash commented on issue #2657: [SUPPORT] SparkSQL/Hive query fails if there are two or more record array fields in MOR table.

2021-04-26 Thread GitBox


n3nash commented on issue #2657:
URL: https://github.com/apache/hudi/issues/2657#issuecomment-827312061


   @garyli1019 Are you able to pick this up now ?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] liijiankang commented on issue #2723: [SUPPORT]RejectedExecutionException FutureTask rejected from ThreadPoolExecutor[Terminated...]

2021-04-26 Thread GitBox


liijiankang commented on issue #2723:
URL: https://github.com/apache/hudi/issues/2723#issuecomment-827312076


   @n3nash https://issues.apache.org/jira/browse/HUDI-1855


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Created] (HUDI-1855) RejectedExecutionException FutureTask rejected from ThreadPoolExecutor[Terminated...]

2021-04-26 Thread Jira
李建康 created HUDI-1855:
-

 Summary: RejectedExecutionException FutureTask rejected from 
ThreadPoolExecutor[Terminated...] 
 Key: HUDI-1855
 URL: https://issues.apache.org/jira/browse/HUDI-1855
 Project: Apache Hudi
  Issue Type: Bug
  Components: Cleaner
Affects Versions: 0.6.0
Reporter: 李建康


*Describe the problem you faced*
We use Structured Streaming to subscribe to the data in Kafka, and then write 
the data to the hoodie, the program will stop abnormally after running for a 
period of time.

*Environment Description*
 * Hudi version :0.6.0

 * Spark version :2.4.0

 * Hive version :2.1.1

 * Hadoop version :3.0.0

 * Storage (HDFS/S3/GCS..) :HDFS

 * Running on Docker? (yes/no) :no



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] n3nash commented on issue #2680: [SUPPORT]Hive sync error by using run_sync_tool.sh

2021-04-26 Thread GitBox


n3nash commented on issue #2680:
URL: https://github.com/apache/hudi/issues/2680#issuecomment-827311874


   @ztcheck Since this issue is not reported by other users, my guess is that 
this has something to do with your environment. If you are able to pin point 
which jar helped resolved the issue, that will be helpful to provide a longer 
term fix, if not, we can close this ticket. Let me know


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] n3nash commented on issue #2688: [SUPPORT] Sync to Hive using Metastore

2021-04-26 Thread GitBox


n3nash commented on issue #2688:
URL: https://github.com/apache/hudi/issues/2688#issuecomment-827309433


   @rubenssoto Gentle ping 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] codecov-commenter edited a comment on pull request #2822: [Hotfix][hudi-sync] Refactor method up to parent-class

2021-04-26 Thread GitBox


codecov-commenter edited a comment on pull request #2822:
URL: https://github.com/apache/hudi/pull/2822#issuecomment-821787385


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2822?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2822](https://codecov.io/gh/apache/hudi/pull/2822?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (afd686c) into 
[master](https://codecov.io/gh/apache/hudi/commit/ab4a7b0b4afc66d2123c7f63fdab77b925a8a7f1?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (ab4a7b0) will **decrease** coverage by `8.97%`.
   > The diff coverage is `50.00%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2822/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2822?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2822  +/-   ##
   
   - Coverage 61.96%   52.99%   -8.98% 
   - Complexity  335 3744+3409 
   
 Files54  488 +434 
 Lines  199323525   +21532 
 Branches235 2501+2266 
   
   + Hits   123512467   +11232 
   - Misses  638 9957+9319 
   - Partials120 1101 +981 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `39.53% <ø> (?)` | `220.00 <ø> (?)` | |
   | hudiclient | `∅ <ø> (?)` | `0.00 <ø> (?)` | |
   | hudicommon | `50.37% <ø> (?)` | `1975.00 <ø> (?)` | |
   | hudiflink | `59.65% <ø> (?)` | `536.00 <ø> (?)` | |
   | hudihadoopmr | `33.33% <ø> (?)` | `198.00 <ø> (?)` | |
   | hudisparkdatasource | `73.33% <ø> (?)` | `237.00 <ø> (?)` | |
   | hudisync | `46.39% <50.00%> (?)` | `142.00 <0.00> (?)` | |
   | huditimelineservice | `64.36% <ø> (?)` | `62.00 <ø> (?)` | |
   | hudiutilities | `69.70% <ø> (+7.73%)` | `374.00 <ø> (+39.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2822?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...main/java/org/apache/hudi/dla/HoodieDLAClient.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1zeW5jL2h1ZGktZGxhLXN5bmMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvZGxhL0hvb2RpZURMQUNsaWVudC5qYXZh)
 | `0.00% <0.00%> (ø)` | `0.00 <0.00> (?)` | |
   | 
[...in/java/org/apache/hudi/hive/HoodieHiveClient.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1zeW5jL2h1ZGktaGl2ZS1zeW5jL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9odWRpL2hpdmUvSG9vZGllSGl2ZUNsaWVudC5qYXZh)
 | `71.25% <100.00%> (ø)` | `41.00 <0.00> (?)` | |
   | 
[...org/apache/hudi/utilities/HoodieClusteringJob.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL0hvb2RpZUNsdXN0ZXJpbmdKb2IuamF2YQ==)
 | `62.50% <0.00%> (-2.72%)` | `9.00% <0.00%> (ø%)` | |
   | 
[...apache/hudi/utilities/deltastreamer/DeltaSync.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2RlbHRhc3RyZWFtZXIvRGVsdGFTeW5jLmphdmE=)
 | `71.08% <0.00%> (-0.35%)` | `55.00% <0.00%> (-1.00%)` | |
   | 
[...org/apache/hudi/common/model/HoodieFileFormat.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL21vZGVsL0hvb2RpZUZpbGVGb3JtYXQuamF2YQ==)
 | `100.00% <0.00%> (ø)` | `3.00% <0.00%> (?%)` | |
   | 

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2822: [Hotfix][hudi-sync] Refactor method up to parent-class

2021-04-26 Thread GitBox


codecov-commenter edited a comment on pull request #2822:
URL: https://github.com/apache/hudi/pull/2822#issuecomment-821787385


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2822?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2822](https://codecov.io/gh/apache/hudi/pull/2822?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (afd686c) into 
[master](https://codecov.io/gh/apache/hudi/commit/ab4a7b0b4afc66d2123c7f63fdab77b925a8a7f1?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (ab4a7b0) will **decrease** coverage by `1.04%`.
   > The diff coverage is `50.00%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2822/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2822?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2822  +/-   ##
   
   - Coverage 61.96%   60.92%   -1.05% 
   - Complexity  335  516 +181 
   
 Files54   72  +18 
 Lines  1993 3204+1211 
 Branches235  377 +142 
   
   + Hits   1235 1952 +717 
   - Misses  638 1058 +420 
   - Partials120  194  +74 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudisync | `46.39% <50.00%> (?)` | `142.00 <0.00> (?)` | |
   | hudiutilities | `69.70% <ø> (+7.73%)` | `374.00 <ø> (+39.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2822?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...main/java/org/apache/hudi/dla/HoodieDLAClient.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1zeW5jL2h1ZGktZGxhLXN5bmMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvZGxhL0hvb2RpZURMQUNsaWVudC5qYXZh)
 | `0.00% <0.00%> (ø)` | `0.00 <0.00> (?)` | |
   | 
[...in/java/org/apache/hudi/hive/HoodieHiveClient.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1zeW5jL2h1ZGktaGl2ZS1zeW5jL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9odWRpL2hpdmUvSG9vZGllSGl2ZUNsaWVudC5qYXZh)
 | `71.25% <100.00%> (ø)` | `41.00 <0.00> (?)` | |
   | 
[...org/apache/hudi/utilities/HoodieClusteringJob.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL0hvb2RpZUNsdXN0ZXJpbmdKb2IuamF2YQ==)
 | `62.50% <0.00%> (-2.72%)` | `9.00% <0.00%> (ø%)` | |
   | 
[...apache/hudi/utilities/deltastreamer/DeltaSync.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2RlbHRhc3RyZWFtZXIvRGVsdGFTeW5jLmphdmE=)
 | `71.08% <0.00%> (-0.35%)` | `55.00% <0.00%> (-1.00%)` | |
   | 
[.../org/apache/hudi/hive/NonPartitionedExtractor.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1zeW5jL2h1ZGktaGl2ZS1zeW5jL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9odWRpL2hpdmUvTm9uUGFydGl0aW9uZWRFeHRyYWN0b3IuamF2YQ==)
 | `100.00% <0.00%> (ø)` | `2.00% <0.00%> (?%)` | |
   | 
[...ache/hudi/hive/HiveMetastoreBasedLockProvider.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1zeW5jL2h1ZGktaGl2ZS1zeW5jL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9odWRpL2hpdmUvSGl2ZU1ldGFzdG9yZUJhc2VkTG9ja1Byb3ZpZGVyLmphdmE=)
 | `0.00% <0.00%> (ø)` | `0.00% <0.00%> (?%)` | |
   | 

[jira] [Updated] (HUDI-1854) Corrupt blocks in GCS log files

2021-04-26 Thread Nishith Agarwal (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nishith Agarwal updated HUDI-1854:
--
Labels: sev:critical sev:triage  (was: )

> Corrupt blocks in GCS log files
> ---
>
> Key: HUDI-1854
> URL: https://issues.apache.org/jira/browse/HUDI-1854
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: Common Core
>Reporter: Nishith Agarwal
>Priority: Major
>  Labels: sev:critical, sev:triage
>
> Details on how to reproduce this can be found here -> 
> [https://github.com/apache/hudi/issues/2692]
>  
> We need a GCS, google data proc environment to reproduce this. 
>  
> [~vburenin] Would you be able to help try out hudi 0.7 and follow the steps 
> mentioned in this ticket to help reproduce this issue and find the root cause 
> ?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HUDI-1854) Corrupt blocks in GCS log files

2021-04-26 Thread Nishith Agarwal (Jira)
Nishith Agarwal created HUDI-1854:
-

 Summary: Corrupt blocks in GCS log files
 Key: HUDI-1854
 URL: https://issues.apache.org/jira/browse/HUDI-1854
 Project: Apache Hudi
  Issue Type: Bug
  Components: Common Core
Reporter: Nishith Agarwal


Details on how to reproduce this can be found here -> 
[https://github.com/apache/hudi/issues/2692]

 

We need a GCS, google data proc environment to reproduce this. 

 

[~vburenin] Would you be able to help try out hudi 0.7 and follow the steps 
mentioned in this ticket to help reproduce this issue and find the root cause ?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] n3nash commented on issue #2723: [SUPPORT]RejectedExecutionException FutureTask rejected from ThreadPoolExecutor[Terminated...]

2021-04-26 Thread GitBox


n3nash commented on issue #2723:
URL: https://github.com/apache/hudi/issues/2723#issuecomment-827307848


   @liijiankang Does this issue come back when you turn on async cleaning ? If 
yes, can you file a JIRA ticket and ping back the ticket here so we can look 
into this ?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] n3nash commented on issue #2774: [SUPPORT] NoSuchMethodError on exposing metrics

2021-04-26 Thread GitBox


n3nash commented on issue #2774:
URL: https://github.com/apache/hudi/issues/2774#issuecomment-827306636


   @vobukh Closing this ticket since this seems to have resolved your issue. 
Feel free to re-open if otherwise


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] n3nash closed issue #2774: [SUPPORT] NoSuchMethodError on exposing metrics

2021-04-26 Thread GitBox


n3nash closed issue #2774:
URL: https://github.com/apache/hudi/issues/2774


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] codecov-commenter edited a comment on pull request #2822: [Hotfix][hudi-sync] Refactor method up to parent-class

2021-04-26 Thread GitBox


codecov-commenter edited a comment on pull request #2822:
URL: https://github.com/apache/hudi/pull/2822#issuecomment-821787385


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2822?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2822](https://codecov.io/gh/apache/hudi/pull/2822?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (afd686c) into 
[master](https://codecov.io/gh/apache/hudi/commit/ab4a7b0b4afc66d2123c7f63fdab77b925a8a7f1?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (ab4a7b0) will **increase** coverage by `7.73%`.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2822/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2822?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2822  +/-   ##
   
   + Coverage 61.96%   69.70%   +7.73% 
   - Complexity  335  374  +39 
   
 Files54   54  
 Lines  1993 1997   +4 
 Branches235  236   +1 
   
   + Hits   1235 1392 +157 
   + Misses  638  473 -165 
   - Partials120  132  +12 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudiutilities | `69.70% <ø> (+7.73%)` | `374.00 <ø> (+39.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2822?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...org/apache/hudi/utilities/HoodieClusteringJob.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL0hvb2RpZUNsdXN0ZXJpbmdKb2IuamF2YQ==)
 | `62.50% <0.00%> (-2.72%)` | `9.00% <0.00%> (ø%)` | |
   | 
[...apache/hudi/utilities/deltastreamer/DeltaSync.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2RlbHRhc3RyZWFtZXIvRGVsdGFTeW5jLmphdmE=)
 | `71.08% <0.00%> (-0.35%)` | `55.00% <0.00%> (-1.00%)` | |
   | 
[...s/deltastreamer/HoodieMultiTableDeltaStreamer.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2RlbHRhc3RyZWFtZXIvSG9vZGllTXVsdGlUYWJsZURlbHRhU3RyZWFtZXIuamF2YQ==)
 | `78.52% <0.00%> (+0.13%)` | `19.00% <0.00%> (+1.00%)` | |
   | 
[...in/java/org/apache/hudi/utilities/UtilHelpers.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL1V0aWxIZWxwZXJzLmphdmE=)
 | `64.53% <0.00%> (+23.83%)` | `33.00% <0.00%> (+6.00%)` | |
   | 
[...he/hudi/utilities/transform/AWSDmsTransformer.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3RyYW5zZm9ybS9BV1NEbXNUcmFuc2Zvcm1lci5qYXZh)
 | `66.66% <0.00%> (+66.66%)` | `2.00% <0.00%> (+2.00%)` | |
   | 
[...hudi/utilities/schema/JdbcbasedSchemaProvider.java](https://codecov.io/gh/apache/hudi/pull/2822/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NjaGVtYS9KZGJjYmFzZWRTY2hlbWFQcm92aWRlci5qYXZh)
 | `72.22% <0.00%> (+72.22%)` | `2.00% <0.00%> (+2.00%)` | |
   | 

[jira] [Updated] (HUDI-1853) Add flink index and compaction options to document

2021-04-26 Thread Danny Chen (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Danny Chen updated HUDI-1853:
-
Fix Version/s: 0.9.0

> Add flink index and compaction options to document
> --
>
> Key: HUDI-1853
> URL: https://issues.apache.org/jira/browse/HUDI-1853
> Project: Apache Hudi
>  Issue Type: Task
>  Components: Flink Integration
>Reporter: Danny Chen
>Assignee: Danny Chen
>Priority: Major
> Fix For: 0.9.0
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HUDI-1853) Add flink index and compaction options to document

2021-04-26 Thread Danny Chen (Jira)
Danny Chen created HUDI-1853:


 Summary: Add flink index and compaction options to document
 Key: HUDI-1853
 URL: https://issues.apache.org/jira/browse/HUDI-1853
 Project: Apache Hudi
  Issue Type: Task
  Components: Flink Integration
Reporter: Danny Chen






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (HUDI-1853) Add flink index and compaction options to document

2021-04-26 Thread Danny Chen (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Danny Chen reassigned HUDI-1853:


Assignee: Danny Chen

> Add flink index and compaction options to document
> --
>
> Key: HUDI-1853
> URL: https://issues.apache.org/jira/browse/HUDI-1853
> Project: Apache Hudi
>  Issue Type: Task
>  Components: Flink Integration
>Reporter: Danny Chen
>Assignee: Danny Chen
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] n3nash edited a comment on issue #2874: [SUPPORT] Can not query mor `_rt` table by FlinkSQL Writing .

2021-04-26 Thread GitBox


n3nash edited a comment on issue #2874:
URL: https://github.com/apache/hudi/issues/2874#issuecomment-827298091


   @danny0405 Thanks for that explanation. This is a hive reader not the spark 
reader, did you mean hive ? Is there already a JIRA filed for this ? 
   Does all implementation of flink use global index or is there a non-global 
index implementation for merge on read tables ?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] n3nash edited a comment on issue #2874: [SUPPORT] Can not query mor `_rt` table by FlinkSQL Writing .

2021-04-26 Thread GitBox


n3nash edited a comment on issue #2874:
URL: https://github.com/apache/hudi/issues/2874#issuecomment-827298091


   @danny0405 Thanks for that explanation. This is a hive path not the spark 
reader, did you mean hive ? Is there already a JIRA filed for this ? 
   Does all implementation of flink use global index ? Or is there a non-global 
index implementation for merge on read tables ?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] n3nash commented on issue #2874: [SUPPORT] Can not query mor `_rt` table by FlinkSQL Writing .

2021-04-26 Thread GitBox


n3nash commented on issue #2874:
URL: https://github.com/apache/hudi/issues/2874#issuecomment-827298091


   @danny0405 Thanks for that explanation. Is there already a JIRA filed for 
this ? Does all implementation of flink use global index ? Or is there a 
non-global index implementation for merge on read tables ?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] RocMarshal commented on a change in pull request #2822: [Hotfix][hudi-sync] Refactor method up to parent-class

2021-04-26 Thread GitBox


RocMarshal commented on a change in pull request #2822:
URL: https://github.com/apache/hudi/pull/2822#discussion_r620836739



##
File path: 
hudi-sync/hudi-sync-common/src/main/java/org/apache/hudi/sync/common/AbstractSyncHoodieClient.java
##
@@ -29,20 +29,25 @@
 
 import org.apache.hadoop.fs.FileSystem;
 import org.apache.hadoop.fs.Path;
+import org.apache.hudi.common.util.ValidationUtils;
 import org.apache.log4j.LogManager;
 import org.apache.log4j.Logger;
 import org.apache.parquet.schema.MessageType;
 
+import java.io.Serializable;
 import java.sql.ResultSet;
 import java.sql.SQLException;
 import java.sql.Statement;
 import java.util.List;
 import java.util.Map;
+import java.util.Objects;
 
 public abstract class AbstractSyncHoodieClient {
 
   private static final Logger LOG = 
LogManager.getLogger(AbstractSyncHoodieClient.class);
 
+  public static final TypeConverter DECIMAL_TYPE_CONVERTOR = new 
TypeConverter() {};

Review comment:
   @leesf Done. Thank you for your suggestions.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] RocMarshal commented on a change in pull request #2822: [Hotfix][hudi-sync] Refactor method up to parent-class

2021-04-26 Thread GitBox


RocMarshal commented on a change in pull request #2822:
URL: https://github.com/apache/hudi/pull/2822#discussion_r620411008



##
File path: 
hudi-sync/hudi-sync-common/src/main/java/org/apache/hudi/sync/common/AbstractSyncHoodieClient.java
##
@@ -29,20 +29,25 @@
 
 import org.apache.hadoop.fs.FileSystem;
 import org.apache.hadoop.fs.Path;
+import org.apache.hudi.common.util.ValidationUtils;
 import org.apache.log4j.LogManager;
 import org.apache.log4j.Logger;
 import org.apache.parquet.schema.MessageType;
 
+import java.io.Serializable;
 import java.sql.ResultSet;
 import java.sql.SQLException;
 import java.sql.Statement;
 import java.util.List;
 import java.util.Map;
+import java.util.Objects;
 
 public abstract class AbstractSyncHoodieClient {
 
   private static final Logger LOG = 
LogManager.getLogger(AbstractSyncHoodieClient.class);
 
+  public static final TypeConverter DECIMAL_TYPE_CONVERTOR = new 
TypeConverter() {};

Review comment:
   IMO, The current default implementation is the default implementation of 
the decimal type convertor. And we would do a special implementation by 
extending `TypeConverter` in the specific scene. So, keeping 
`DECIMAL_TYPE_CONVERTOR` would be a fine choice in the case...




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] codecov-commenter edited a comment on pull request #2884: [HUDI-1852]Add SCHEMA_REGISTRY_SOURCE_URL_SUFFIX and SCHEMA_REGISTRY_TARGET_URL_SUFFIX property

2021-04-26 Thread GitBox


codecov-commenter edited a comment on pull request #2884:
URL: https://github.com/apache/hudi/pull/2884#issuecomment-827288106


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2884](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (6f9f3ec) into 
[master](https://codecov.io/gh/apache/hudi/commit/9bbb458e888273af95b04f72cc5b428524b41487?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (9bbb458) will **increase** coverage by `0.04%`.
   > The diff coverage is `0.00%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2884/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2884  +/-   ##
   
   + Coverage 52.93%   52.97%   +0.04% 
   - Complexity 3742 3745   +3 
   
 Files   488  488  
 Lines 2351123544  +33 
 Branches   2501 2504   +3 
   
   + Hits  1244612473  +27 
   - Misses 9966 9971   +5 
   - Partials   1099 1100   +1 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `39.53% <ø> (ø)` | `220.00 <ø> (ø)` | |
   | hudiclient | `∅ <ø> (∅)` | `0.00 <ø> (ø)` | |
   | hudicommon | `50.38% <ø> (+0.01%)` | `1975.00 <ø> (ø)` | |
   | hudiflink | `59.65% <ø> (+0.19%)` | `536.00 <ø> (+2.00)` | |
   | hudihadoopmr | `33.33% <ø> (ø)` | `198.00 <ø> (ø)` | |
   | hudisparkdatasource | `73.33% <ø> (ø)` | `237.00 <ø> (ø)` | |
   | hudisync | `46.10% <ø> (ø)` | `142.00 <ø> (ø)` | |
   | huditimelineservice | `64.36% <ø> (ø)` | `62.00 <ø> (ø)` | |
   | hudiutilities | `69.58% <0.00%> (-0.13%)` | `375.00 <0.00> (+1.00)` | 
:arrow_down: |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...s/deltastreamer/HoodieMultiTableDeltaStreamer.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2RlbHRhc3RyZWFtZXIvSG9vZGllTXVsdGlUYWJsZURlbHRhU3RyZWFtZXIuamF2YQ==)
 | `76.19% <0.00%> (-2.34%)` | `19.00 <0.00> (ø)` | |
   | 
[...java/org/apache/hudi/sink/StreamWriteFunction.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS9zaW5rL1N0cmVhbVdyaXRlRnVuY3Rpb24uamF2YQ==)
 | `79.24% <0.00%> (-1.68%)` | `24.00% <0.00%> (ø%)` | |
   | 
[...va/org/apache/hudi/configuration/FlinkOptions.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS9jb25maWd1cmF0aW9uL0ZsaW5rT3B0aW9ucy5qYXZh)
 | `90.48% <0.00%> (+0.28%)` | `11.00% <0.00%> (ø%)` | |
   | 
[...apache/hudi/utilities/deltastreamer/DeltaSync.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2RlbHRhc3RyZWFtZXIvRGVsdGFTeW5jLmphdmE=)
 | `71.42% <0.00%> (+0.34%)` | `56.00% <0.00%> (+1.00%)` | |
   | 
[...c/main/java/org/apache/hudi/util/StreamerUtil.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS91dGlsL1N0cmVhbWVyVXRpbC5qYXZh)
 | `58.33% <0.00%> (+0.35%)` | `18.00% <0.00%> (ø%)` | |
   | 

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2884: [HUDI-1852]Add SCHEMA_REGISTRY_SOURCE_URL_SUFFIX and SCHEMA_REGISTRY_TARGET_URL_SUFFIX property

2021-04-26 Thread GitBox


codecov-commenter edited a comment on pull request #2884:
URL: https://github.com/apache/hudi/pull/2884#issuecomment-827288106


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2884](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (6f9f3ec) into 
[master](https://codecov.io/gh/apache/hudi/commit/9bbb458e888273af95b04f72cc5b428524b41487?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (9bbb458) will **increase** coverage by `0.04%`.
   > The diff coverage is `0.00%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2884/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2884  +/-   ##
   
   + Coverage 52.93%   52.97%   +0.04% 
   - Complexity 3742 3745   +3 
   
 Files   488  488  
 Lines 2351123544  +33 
 Branches   2501 2504   +3 
   
   + Hits  1244612473  +27 
   - Misses 9966 9971   +5 
   - Partials   1099 1100   +1 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `39.53% <ø> (ø)` | `220.00 <ø> (ø)` | |
   | hudiclient | `∅ <ø> (∅)` | `0.00 <ø> (ø)` | |
   | hudicommon | `50.38% <ø> (+0.01%)` | `1975.00 <ø> (ø)` | |
   | hudiflink | `59.65% <ø> (+0.19%)` | `536.00 <ø> (+2.00)` | |
   | hudihadoopmr | `33.33% <ø> (ø)` | `198.00 <ø> (ø)` | |
   | hudisparkdatasource | `73.33% <ø> (ø)` | `237.00 <ø> (ø)` | |
   | hudisync | `46.10% <ø> (ø)` | `142.00 <ø> (ø)` | |
   | huditimelineservice | `64.36% <ø> (ø)` | `62.00 <ø> (ø)` | |
   | hudiutilities | `69.58% <0.00%> (-0.13%)` | `375.00 <0.00> (+1.00)` | 
:arrow_down: |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...s/deltastreamer/HoodieMultiTableDeltaStreamer.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2RlbHRhc3RyZWFtZXIvSG9vZGllTXVsdGlUYWJsZURlbHRhU3RyZWFtZXIuamF2YQ==)
 | `76.19% <0.00%> (-2.34%)` | `19.00 <0.00> (ø)` | |
   | 
[...java/org/apache/hudi/sink/StreamWriteFunction.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS9zaW5rL1N0cmVhbVdyaXRlRnVuY3Rpb24uamF2YQ==)
 | `79.24% <0.00%> (-1.68%)` | `24.00% <0.00%> (ø%)` | |
   | 
[...va/org/apache/hudi/configuration/FlinkOptions.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS9jb25maWd1cmF0aW9uL0ZsaW5rT3B0aW9ucy5qYXZh)
 | `90.48% <0.00%> (+0.28%)` | `11.00% <0.00%> (ø%)` | |
   | 
[...apache/hudi/utilities/deltastreamer/DeltaSync.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2RlbHRhc3RyZWFtZXIvRGVsdGFTeW5jLmphdmE=)
 | `71.42% <0.00%> (+0.34%)` | `56.00% <0.00%> (+1.00%)` | |
   | 
[...c/main/java/org/apache/hudi/util/StreamerUtil.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS91dGlsL1N0cmVhbWVyVXRpbC5qYXZh)
 | `58.33% <0.00%> (+0.35%)` | `18.00% <0.00%> (ø%)` | |
   | 

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2884: [HUDI-1852]Add SCHEMA_REGISTRY_SOURCE_URL_SUFFIX and SCHEMA_REGISTRY_TARGET_URL_SUFFIX property

2021-04-26 Thread GitBox


codecov-commenter edited a comment on pull request #2884:
URL: https://github.com/apache/hudi/pull/2884#issuecomment-827288106


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2884](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (6f9f3ec) into 
[master](https://codecov.io/gh/apache/hudi/commit/9bbb458e888273af95b04f72cc5b428524b41487?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (9bbb458) will **decrease** coverage by `1.64%`.
   > The diff coverage is `0.00%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2884/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2884  +/-   ##
   
   - Coverage 52.93%   51.29%   -1.65% 
   + Complexity 3742 3304 -438 
   
 Files   488  425  -63 
 Lines 2351120055-3456 
 Branches   2501 2084 -417 
   
   - Hits  1244610287-2159 
   + Misses 9966 8914-1052 
   + Partials   1099  854 -245 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `39.53% <ø> (ø)` | `220.00 <ø> (ø)` | |
   | hudiclient | `∅ <ø> (∅)` | `0.00 <ø> (ø)` | |
   | hudicommon | `50.38% <ø> (+0.01%)` | `1975.00 <ø> (ø)` | |
   | hudiflink | `59.65% <ø> (+0.19%)` | `536.00 <ø> (+2.00)` | |
   | hudihadoopmr | `33.33% <ø> (ø)` | `198.00 <ø> (ø)` | |
   | hudisparkdatasource | `?` | `?` | |
   | hudisync | `?` | `?` | |
   | huditimelineservice | `?` | `?` | |
   | hudiutilities | `69.58% <0.00%> (-0.13%)` | `375.00 <0.00> (+1.00)` | 
:arrow_down: |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...s/deltastreamer/HoodieMultiTableDeltaStreamer.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2RlbHRhc3RyZWFtZXIvSG9vZGllTXVsdGlUYWJsZURlbHRhU3RyZWFtZXIuamF2YQ==)
 | `76.19% <0.00%> (-2.34%)` | `19.00 <0.00> (ø)` | |
   | 
[...java/org/apache/hudi/sink/StreamWriteFunction.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS9zaW5rL1N0cmVhbVdyaXRlRnVuY3Rpb24uamF2YQ==)
 | `79.24% <0.00%> (-1.68%)` | `24.00% <0.00%> (ø%)` | |
   | 
[.../src/main/java/org/apache/hudi/dla/util/Utils.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1zeW5jL2h1ZGktZGxhLXN5bmMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvZGxhL3V0aWwvVXRpbHMuamF2YQ==)
 | | | |
   | 
[...g/apache/hudi/timeline/service/RequestHandler.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS10aW1lbGluZS1zZXJ2aWNlL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9odWRpL3RpbWVsaW5lL3NlcnZpY2UvUmVxdWVzdEhhbmRsZXIuamF2YQ==)
 | | | |
   | 
[...src/main/java/org/apache/hudi/dla/DLASyncTool.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1zeW5jL2h1ZGktZGxhLXN5bmMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvZGxhL0RMQVN5bmNUb29sLmphdmE=)
 | | | |
   | 

[jira] [Resolved] (HUDI-1415) Read Hoodie Table As Spark DataSource Table

2021-04-26 Thread pengzhiwei (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

pengzhiwei resolved HUDI-1415.
--
Resolution: Fixed

> Read Hoodie Table As Spark DataSource Table 
> 
>
> Key: HUDI-1415
> URL: https://issues.apache.org/jira/browse/HUDI-1415
> Project: Apache Hudi
>  Issue Type: Improvement
>  Components: Spark Integration
>Affects Versions: 0.9.0
>Reporter: pengzhiwei
>Assignee: pengzhiwei
>Priority: Major
>  Labels: pull-request-available, user-support-issues
> Fix For: 0.9.0
>
>
>  Currently hudi can sync the meta data to hive meta store using HiveSyncTool. 
> The table description  synced to hive  just like this:
> {code:java}
> CREATE EXTERNAL TABLE `tbl_price_insert0`(
>   `_hoodie_commit_time` string, 
>   `_hoodie_commit_seqno` string, 
>   `_hoodie_record_key` string, 
>   `_hoodie_partition_path` string, 
>   `_hoodie_file_name` string, 
>   `id` int, 
>   `name` string, 
>   `price` double,
>   `version` int, 
>   `dt` string)
> ROW FORMAT SERDE 
>   'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe' 
> STORED AS INPUTFORMAT 
>   'org.apache.hudi.hadoop.HoodieParquetInputFormat' 
> OUTPUTFORMAT 
>   'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat'
> LOCATION
>   'file:/tmp/hudi/tbl_price_insert0'
> TBLPROPERTIES (
>   'last_commit_time_sync'='20201124105009', 
>   'transient_lastDdlTime'='1606186222')
> {code}
> When we query this table using spark sql, it trait it as a Hive Table, not a 
> spark data source table and convert it to parquet LogicalRelation in 
> HiveStrategies#RelationConversions. As a result, spark sql read the hudi 
> table just like a parquet data source.  This lead to an incorrect query 
> result if user missing set the spark.sql.hive.convertMetastoreParquet=false.
> Inorder to query hudi table as data source table in spark, more table 
> properties and serde properties must be added to the hive meta,just like the 
> follow:
> {code:java}
> CREATE EXTERNAL TABLE `tbl_price_cow0`(
>   `_hoodie_commit_time` string, 
>   `_hoodie_commit_seqno` string, 
>   `_hoodie_record_key` string, 
>   `_hoodie_partition_path` string, 
>   `_hoodie_file_name` string, 
>   `id` int, 
>   `name` string, 
>   `price` double,
>   `version` int)
> ROW FORMAT SERDE 
>   'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe' 
> WITH SERDEPROPERTIES ( 
>   'path'='/tmp/hudi/tbl_price_cow0') 
> STORED AS INPUTFORMAT 
>   'org.apache.hudi.hadoop.HoodieParquetInputFormat' 
> OUTPUTFORMAT 
>   'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat'
> LOCATION
>   'file:/tmp/hudi/tbl_price_cow0'
> TBLPROPERTIES (
>   'last_commit_time_sync'='20201124120532', 
>   'spark.sql.sources.provider'='hudi', 
>   'spark.sql.sources.schema.numParts'='1', 
>   
> 'spark.sql.sources.schema.part.0'='{"type":"struct","fields":[{"name":"id","type":"integer","nullable":false,"metadata":{}},{"name":"name","type":"string","nullable":true,"metadata":{}},{"name":"price","type":"double","nullable":false,"metadata":{}},{"name":"version","type":"integer","nullable":false,"metadata":{}}]}',
>  
>   'transient_lastDdlTime'='1606190729')
> {code}
> These are the missing table properties:
> {code:java}
> spark.sql.sources.provider= 'hudi'
> spark.sql.sources.schema.numParts = 'xx'
> spark.sql.sources.schema.part.{num} ='xx'
> spark.sql.sources.schema.numPartCols = 'xx'
> spark.sql.sources.schema.partCol.{num} = 'xx'{code}
> and serde property:
> {code:java}
> 'path'='/path/to/hudi'
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] codecov-commenter commented on pull request #2884: [HUDI-1852]Add SCHEMA_REGISTRY_SOURCE_URL_SUFFIX and SCHEMA_REGISTRY_TARGET_URL_SUFFIX property

2021-04-26 Thread GitBox


codecov-commenter commented on pull request #2884:
URL: https://github.com/apache/hudi/pull/2884#issuecomment-827288106


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2884](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (6f9f3ec) into 
[master](https://codecov.io/gh/apache/hudi/commit/9bbb458e888273af95b04f72cc5b428524b41487?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (9bbb458) will **increase** coverage by `16.64%`.
   > The diff coverage is `0.00%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2884/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@  Coverage Diff  @@
   ## master#2884   +/-   ##
   =
   + Coverage 52.93%   69.58%   +16.64% 
   + Complexity 3742  375 -3367 
   =
 Files   488   54  -434 
 Lines 23511 2002-21509 
 Branches   2501  237 -2264 
   =
   - Hits  12446 1393-11053 
   + Misses 9966  478 -9488 
   + Partials   1099  131  -968 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `?` | `?` | |
   | hudiclient | `?` | `?` | |
   | hudicommon | `?` | `?` | |
   | hudiflink | `?` | `?` | |
   | hudihadoopmr | `?` | `?` | |
   | hudisparkdatasource | `?` | `?` | |
   | hudisync | `?` | `?` | |
   | huditimelineservice | `?` | `?` | |
   | hudiutilities | `69.58% <0.00%> (-0.13%)` | `375.00 <0.00> (+1.00)` | 
:arrow_down: |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2884?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...s/deltastreamer/HoodieMultiTableDeltaStreamer.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2RlbHRhc3RyZWFtZXIvSG9vZGllTXVsdGlUYWJsZURlbHRhU3RyZWFtZXIuamF2YQ==)
 | `76.19% <0.00%> (-2.34%)` | `19.00 <0.00> (ø)` | |
   | 
[.../java/org/apache/hudi/HoodieDataSourceHelpers.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1zcGFyay1kYXRhc291cmNlL2h1ZGktc3Bhcmsvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvSG9vZGllRGF0YVNvdXJjZUhlbHBlcnMuamF2YQ==)
 | | | |
   | 
[.../org/apache/hudi/hadoop/utils/HoodieHiveUtils.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1oYWRvb3AtbXIvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvaGFkb29wL3V0aWxzL0hvb2RpZUhpdmVVdGlscy5qYXZh)
 | | | |
   | 
[.../hudi/common/bloom/InternalDynamicBloomFilter.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL2Jsb29tL0ludGVybmFsRHluYW1pY0Jsb29tRmlsdGVyLmphdmE=)
 | | | |
   | 
[.../java/org/apache/hudi/common/util/RateLimiter.java](https://codecov.io/gh/apache/hudi/pull/2884/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL3V0aWwvUmF0ZUxpbWl0ZXIuamF2YQ==)
 | | | |
   | 

[GitHub] [hudi] hudi-bot edited a comment on pull request #2643: DO NOT MERGE (Azure CI) test branch ci

2021-04-26 Thread GitBox


hudi-bot edited a comment on pull request #2643:
URL: https://github.com/apache/hudi/pull/2643#issuecomment-792368481


   
   ## CI report:
   
   * 9831a6c50e9f49f8a71c02fc6ac50ae1446f7c1f UNKNOWN
   * a569dbe9409910fbb83b3764b300574c0e52612e Azure: 
[FAILURE](https://dev.azure.com/XUSH0012/0ef433cc-d4b4-47cc-b6a1-03d032ef546c/_build/results?buildId=142)
 
   * e6e9f1f1554a1474dd6c20338215030cad23a2e0 UNKNOWN
   * 2a6690a256c8cd8efe9ed2b1984b896fb27ef077 UNKNOWN
   * d8b7cca55e057a52a2e229d81e8cb52b60dc275f UNKNOWN
   * 3bce301333cc78194d13a702598b46e04fe9f85f UNKNOWN
   * f07f345baa450f3fec7eab59caa76b0fbda1e132 UNKNOWN
   * 869d2ce3fad330af93c1bb3b576824f519c6e68b UNKNOWN
   * fa86907f7522bc8dbe512d48b5a87e4a6b13f035 UNKNOWN
   * 4ebe53016ce3e0648992dbe14d04f71a92f116e6 UNKNOWN
   * 682ae9985f591f6d0c30ee2ef9b159403c1e46de UNKNOWN
   * d80397fcfeaa2996ab550bcdab4524be7420a364 UNKNOWN
   * bfe3a803e19540578b94f778f7ba7551db0f86f1 UNKNOWN
   * a632e58390eb94fcc7e757bd7580780cf184f9a8 UNKNOWN
   
   
   Bot commands
 @hudi-bot supports the following commands:
   
- `@hudi-bot run travis` re-run the last Travis build
- `@hudi-bot run azure` re-run the last Azure build
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Updated] (HUDI-1852) Add SCHEMA_REGISTRY_SOURCE_URL_SUFFIX SCHEMA_REGISTRY_TARGET_URL_SUFFIX property

2021-04-26 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HUDI-1852:
-
Labels: pull-request-available  (was: )

> Add SCHEMA_REGISTRY_SOURCE_URL_SUFFIX SCHEMA_REGISTRY_TARGET_URL_SUFFIX 
> property
> 
>
> Key: HUDI-1852
> URL: https://issues.apache.org/jira/browse/HUDI-1852
> Project: Apache Hudi
>  Issue Type: New Feature
>  Components: Utilities
>Reporter: NickYoung
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.9.0
>
>
> Before, only schemaRegistrySuffix did not distinguish between 
> targetSchemaRegistrySuffix and sourceSchemaRegistrySuffix, which is not 
> suitable for our needs.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] NickYoungPeng opened a new pull request #2884: [HUDI-1852]Add SCHEMA_REGISTRY_SOURCE_URL_SUFFIX SCHEMA_REGISTRY_TARGET_URL_SUFFIX property

2021-04-26 Thread GitBox


NickYoungPeng opened a new pull request #2884:
URL: https://github.com/apache/hudi/pull/2884


   
   ## What is the purpose of the pull request
   
   *(For example: This pull request adds quick-start document.)*
   
   ## Brief change log
   Before, only schemaRegistrySuffix did not distinguish between 
targetSchemaRegistrySuffix and sourceSchemaRegistrySuffix, which is not 
suitable for our needs.
   
   ## Verify this pull request
   
   
   This pull request is a trivial rework / code cleanup without any test 
coverage.
   
   
   ## Committer checklist
   
- [ ] Has a corresponding JIRA in PR title & commit

- [ ] Commit message is descriptive of the change

- [ ] CI is green
   
- [ ] Necessary doc changes done or have another open PR
  
- [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Closed] (HUDI-1841) Tweak the min max commits to keep when setting up cleaning retain commits for Flink

2021-04-26 Thread vinoyang (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

vinoyang closed HUDI-1841.
--
Resolution: Done

5be3997f70415e1752a0b5214f9398880fc8fd1f

> Tweak the min max commits to keep when setting up cleaning retain commits for 
> Flink
> ---
>
> Key: HUDI-1841
> URL: https://issues.apache.org/jira/browse/HUDI-1841
> Project: Apache Hudi
>  Issue Type: Improvement
>  Components: Flink Integration
>Reporter: Danny Chen
>Assignee: Danny Chen
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.9.0
>
>
> If the retain commits is greater that min commits to keep, reset the options:
> 1. min commits to keep: retain commits + 10
> 2. max commits to keep: retain commits + 20



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] yanghua merged pull request #2875: [HUDI-1841] Tweak the min max commits to keep when setting up cleanin…

2021-04-26 Thread GitBox


yanghua merged pull request #2875:
URL: https://github.com/apache/hudi/pull/2875


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[hudi] branch master updated (9bbb458 -> 5be3997)

2021-04-26 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository.

vinoyang pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/hudi.git.


from 9bbb458  [MINOR] Remove redundant method-calling. (#2881)
 add 5be3997  [HUDI-1841] Tweak the min max commits to keep when setting up 
cleaning retain commits for Flink (#2875)

No new revisions were added by this update.

Summary of changes:
 .../apache/hudi/configuration/FlinkOptions.java| 12 +
 .../org/apache/hudi/sink/StreamWriteFunction.java  | 36 +-
 .../org/apache/hudi/table/HoodieTableFactory.java  | 18 +++
 .../java/org/apache/hudi/util/StreamerUtil.java|  1 +
 .../apache/hudi/table/TestHoodieTableFactory.java  | 56 ++
 5 files changed, 112 insertions(+), 11 deletions(-)


[jira] [Updated] (HUDI-1852) Add SCHEMA_REGISTRY_SOURCE_URL_SUFFIX SCHEMA_REGISTRY_TARGET_URL_SUFFIX property

2021-04-26 Thread NickYoung (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

NickYoung updated HUDI-1852:

Summary: Add SCHEMA_REGISTRY_SOURCE_URL_SUFFIX 
SCHEMA_REGISTRY_TARGET_URL_SUFFIX property  (was: Add 
SCHEMA_REGISTRY_SOURCE_URL_SUFFIX SCHEMA_REGISTRY_TARGET_URL_SUFFIX variable)

> Add SCHEMA_REGISTRY_SOURCE_URL_SUFFIX SCHEMA_REGISTRY_TARGET_URL_SUFFIX 
> property
> 
>
> Key: HUDI-1852
> URL: https://issues.apache.org/jira/browse/HUDI-1852
> Project: Apache Hudi
>  Issue Type: New Feature
>  Components: Utilities
>Reporter: NickYoung
>Priority: Major
> Fix For: 0.9.0
>
>
> Before, only schemaRegistrySuffix did not distinguish between 
> targetSchemaRegistrySuffix and sourceSchemaRegistrySuffix, which is not 
> suitable for our needs.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HUDI-1852) Add SCHEMA_REGISTRY_SOURCE_URL_SUFFIX SCHEMA_REGISTRY_TARGET_URL_SUFFIX variable

2021-04-26 Thread NickYoung (Jira)
NickYoung created HUDI-1852:
---

 Summary: Add SCHEMA_REGISTRY_SOURCE_URL_SUFFIX 
SCHEMA_REGISTRY_TARGET_URL_SUFFIX variable
 Key: HUDI-1852
 URL: https://issues.apache.org/jira/browse/HUDI-1852
 Project: Apache Hudi
  Issue Type: New Feature
  Components: Utilities
Reporter: NickYoung
 Fix For: 0.9.0


Before, only schemaRegistrySuffix did not distinguish between 
targetSchemaRegistrySuffix and sourceSchemaRegistrySuffix, which is not 
suitable for our needs.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HUDI-1742) improve table level config priority in HoodieMultiTableDeltaStreamer

2021-04-26 Thread NickYoung (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

NickYoung resolved HUDI-1742.
-
Fix Version/s: 0.9.0
   Resolution: Done

> improve table level config priority in HoodieMultiTableDeltaStreamer
> 
>
> Key: HUDI-1742
> URL: https://issues.apache.org/jira/browse/HUDI-1742
> Project: Apache Hudi
>  Issue Type: Wish
>  Components: DeltaStreamer
>Reporter: NickYoung
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.9.0
>
>
> I hope that when the table-level configuration file and the public l 
> configuration file have the same configuration, the table-level configuration 
> file configuration is used。
> But now if the table-level configuration file and the public configuration 
> file have the same configuration, the configuration in the public 
> configuration file will be adopted。
> https://hudi.apache.org/blog/ingest-multiple-tables-using-hudi/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] codecov-commenter edited a comment on pull request #2882: [HUDI-1806] Honoring skipROSuffix in spark ds

2021-04-26 Thread GitBox


codecov-commenter edited a comment on pull request #2882:
URL: https://github.com/apache/hudi/pull/2882#issuecomment-827054323


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2882?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2882](https://codecov.io/gh/apache/hudi/pull/2882?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (7999033) into 
[master](https://codecov.io/gh/apache/hudi/commit/3e4fa170cfd2c198599c3bed3982f2f643c7fbe8?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (3e4fa17) will **increase** coverage by `8.77%`.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2882/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2882?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2882  +/-   ##
   
   + Coverage 53.06%   61.84%   +8.77% 
   + Complexity 3746  335-3411 
   
 Files   488   54 -434 
 Lines 23466 1997   -21469 
 Branches   2503  236-2267 
   
   - Hits  12453 1235   -11218 
   + Misses 9914  640-9274 
   + Partials   1099  122 -977 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `?` | `?` | |
   | hudiclient | `∅ <ø> (∅)` | `0.00 <ø> (ø)` | |
   | hudicommon | `?` | `?` | |
   | hudiflink | `?` | `?` | |
   | hudihadoopmr | `?` | `?` | |
   | hudisparkdatasource | `?` | `?` | |
   | hudisync | `?` | `?` | |
   | huditimelineservice | `?` | `?` | |
   | hudiutilities | `61.84% <ø> (-7.90%)` | `335.00 <ø> (-39.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2882?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...ies/exception/HoodieSnapshotExporterException.java](https://codecov.io/gh/apache/hudi/pull/2882/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2V4Y2VwdGlvbi9Ib29kaWVTbmFwc2hvdEV4cG9ydGVyRXhjZXB0aW9uLmphdmE=)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-1.00%)` | |
   | 
[.../apache/hudi/utilities/HoodieSnapshotExporter.java](https://codecov.io/gh/apache/hudi/pull/2882/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL0hvb2RpZVNuYXBzaG90RXhwb3J0ZXIuamF2YQ==)
 | `5.17% <0.00%> (-83.63%)` | `0.00% <0.00%> (-28.00%)` | |
   | 
[...hudi/utilities/schema/JdbcbasedSchemaProvider.java](https://codecov.io/gh/apache/hudi/pull/2882/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NjaGVtYS9KZGJjYmFzZWRTY2hlbWFQcm92aWRlci5qYXZh)
 | `0.00% <0.00%> (-72.23%)` | `0.00% <0.00%> (-2.00%)` | |
   | 
[...he/hudi/utilities/transform/AWSDmsTransformer.java](https://codecov.io/gh/apache/hudi/pull/2882/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3RyYW5zZm9ybS9BV1NEbXNUcmFuc2Zvcm1lci5qYXZh)
 | `0.00% <0.00%> (-66.67%)` | `0.00% <0.00%> (-2.00%)` | |
   | 
[...in/java/org/apache/hudi/utilities/UtilHelpers.java](https://codecov.io/gh/apache/hudi/pull/2882/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL1V0aWxIZWxwZXJzLmphdmE=)
 | `40.69% <0.00%> (-23.84%)` | `27.00% <0.00%> (-6.00%)` | |
   | 

[hudi] branch master updated (f4e3b94 -> 9bbb458)

2021-04-26 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository.

vinoyang pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/hudi.git.


from f4e3b94  [HUDI-1742] Improve table level config priority for 
HoodieMultiTableDeltaStreamer (#2744)
 add 9bbb458  [MINOR] Remove redundant method-calling. (#2881)

No new revisions were added by this update.

Summary of changes:
 .../org/apache/hudi/table/action/commit/BaseCommitActionExecutor.java   | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)


[GitHub] [hudi] yanghua merged pull request #2881: [MINOR] Remove redundant method-calling.

2021-04-26 Thread GitBox


yanghua merged pull request #2881:
URL: https://github.com/apache/hudi/pull/2881


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Updated] (HUDI-1817) when query incr view of hudi table by using spark-sql. the result is wrong

2021-04-26 Thread sivabalan narayanan (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sivabalan narayanan updated HUDI-1817:
--
Labels: pull-request-available sev:critical  (was: pull-request-available)

> when query incr view of hudi table by using spark-sql. the result is wrong
> --
>
> Key: HUDI-1817
> URL: https://issues.apache.org/jira/browse/HUDI-1817
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: Hive Integration
>Affects Versions: 0.8.0
> Environment: spark2.4.5   hive 3.1.1   hadoop 3.1.1
>Reporter: tao meng
>Priority: Major
>  Labels: pull-request-available, sev:critical
> Fix For: 0.9.0
>
>
> create hudi table (mor or cow)
>  
> val base_data = spark.read.parquet("/tmp/tb_base")
> val upsert_data = spark.read.parquet("/tmp/tb_upsert")
> base_data.write.format("hudi").option(TABLE_TYPE_OPT_KEY, 
> MOR_TABLE_TYPE_OPT_VAL).option(PRECOMBINE_FIELD_OPT_KEY, 
> "col2").option(RECORDKEY_FIELD_OPT_KEY, 
> "primary_key").option(PARTITIONPATH_FIELD_OPT_KEY, 
> "col0").option(KEYGENERATOR_CLASS_OPT_KEY, 
> "org.apache.hudi.keygen.SimpleKeyGenerator").option(OPERATION_OPT_KEY, 
> "bulk_insert").option(HIVE_SYNC_ENABLED_OPT_KEY, 
> "true").option(HIVE_PARTITION_FIELDS_OPT_KEY, 
> "col0").option(HIVE_PARTITION_EXTRACTOR_CLASS_OPT_KEY, 
> "org.apache.hudi.hive.MultiPartKeysValueExtractor").option(HIVE_DATABASE_OPT_KEY,
>  "testdb").option(HIVE_TABLE_OPT_KEY, 
> "tb_test_mor_par").option(HIVE_USE_JDBC_OPT_KEY, 
> "false").option("hoodie.bulkinsert.shuffle.parallelism", 
> 4).option("hoodie.insert.shuffle.parallelism", 
> 4).option("hoodie.upsert.shuffle.parallelism", 
> 4).option("hoodie.delete.shuffle.parallelism", 
> 4).option("hoodie.datasource.write.hive_style_partitioning", 
> "true").option(TABLE_NAME, 
> "tb_test_mor_par").mode(Overwrite).save(s"/tmp/testdb/tb_test_mor_par")
> upsert_data.write.format("hudi").option(TABLE_TYPE_OPT_KEY, 
> MOR_TABLE_TYPE_OPT_VAL).option(PRECOMBINE_FIELD_OPT_KEY, 
> "col2").option(RECORDKEY_FIELD_OPT_KEY, 
> "primary_key").option(PARTITIONPATH_FIELD_OPT_KEY, 
> "col0").option(KEYGENERATOR_CLASS_OPT_KEY, 
> "org.apache.hudi.keygen.SimpleKeyGenerator").option(OPERATION_OPT_KEY, 
> "upsert").option(HIVE_SYNC_ENABLED_OPT_KEY, 
> "true").option(HIVE_PARTITION_FIELDS_OPT_KEY, 
> "col0").option(HIVE_PARTITION_EXTRACTOR_CLASS_OPT_KEY, 
> "org.apache.hudi.hive.MultiPartKeysValueExtractor").option(HIVE_DATABASE_OPT_KEY,
>  "testdb").option(HIVE_TABLE_OPT_KEY, 
> "tb_test_mor_par").option(HIVE_USE_JDBC_OPT_KEY, 
> "false").option("hoodie.bulkinsert.shuffle.parallelism", 
> 4).option("hoodie.insert.shuffle.parallelism", 
> 4).option("hoodie.upsert.shuffle.parallelism", 
> 4).option("hoodie.delete.shuffle.parallelism", 
> 4).option("hoodie.datasource.write.hive_style_partitioning", 
> "true").option(TABLE_NAME, 
> "tb_test_mor_par").mode(Append).save(s"/tmp/testdb/tb_test_mor_par")
> query incr view by sparksql:
> set hoodie.tb_test_mor_par.consume.mode=INCREMENTAL;
> set hoodie.tb_test_mor_par.consume.start.timestamp=20210420145330;
> set hoodie.tb_test_mor_par.consume.max.commits=3;
> select 
> _hoodie_commit_time,primary_key,col0,col1,col2,col3,col4,col5,col6,col7 from 
> testdb.tb_test_mor_par_rt where _hoodie_commit_time > '20210420145330' order 
> by primary_key;
> +---+---+++++
> |_hoodie_commit_time|primary_key|col0|col1|col6 |col7|
> +---+---+++++
> |20210420155738 |20 |77 |sC |158788760400|739 |
> |20210420155738 |21 |66 |ps |160979049700|61 |
> |20210420155738 |22 |47 |1P |158460042900|835 |
> |20210420155738 |23 |36 |5K |160763480800|538 |
> |20210420155738 |24 |1 |BA |160685711300|775 |
> |20210420155738 |24 |101 |BA |160685711300|775 |
> |20210420155738 |24 |100 |BA |160685711300|775 |
> |20210420155738 |24 |102 |BA |160685711300|775 |
> +---+---+++++
>  
> the primary_key is repeated.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HUDI-1833) pending cluster rollback when there are greater commits

2021-04-26 Thread sivabalan narayanan (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sivabalan narayanan updated HUDI-1833:
--
Labels: pull-request-available sev:high  (was: pull-request-available)

> pending cluster rollback when there are greater commits
> ---
>
> Key: HUDI-1833
> URL: https://issues.apache.org/jira/browse/HUDI-1833
> Project: Apache Hudi
>  Issue Type: Sub-task
>Reporter: satish
>Assignee: satish
>Priority: Major
>  Labels: pull-request-available, sev:high
>
> pending clustering can be rolled back even if there are greater commits. Fix 
> rollback logic



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HUDI-1845) Exception Throws When Sync Non-Partitioned Table To Hive With MultiPartKeysValueExtractor

2021-04-26 Thread sivabalan narayanan (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sivabalan narayanan updated HUDI-1845:
--
Labels: pull-request-available sev:critical  (was: pull-request-available)

> Exception Throws When Sync Non-Partitioned Table To Hive With  
> MultiPartKeysValueExtractor 
> ---
>
> Key: HUDI-1845
> URL: https://issues.apache.org/jira/browse/HUDI-1845
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: Hive Integration
>Reporter: pengzhiwei
>Assignee: pengzhiwei
>Priority: Major
>  Labels: pull-request-available, sev:critical
>
> When sync a non-partitioned table to hive with MultiPartKeysValueExtractor, 
> There is an exception will throw out.
> {code:java}
> IllegalArgumentException: Partition key parts [] does not match with 
> partition values{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HUDI-1850) Read on table fails if the first write to table failed

2021-04-26 Thread sivabalan narayanan (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sivabalan narayanan updated HUDI-1850:
--
Labels: sev:critical sev:high spark  (was: sev:critical sev:triage spark)

> Read on table fails if the first write to table failed
> --
>
> Key: HUDI-1850
> URL: https://issues.apache.org/jira/browse/HUDI-1850
> Project: Apache Hudi
>  Issue Type: Bug
>Affects Versions: 0.8.0
>Reporter: Vaibhav Sinha
>Priority: Major
>  Labels: sev:critical, sev:high, spark
> Attachments: Screenshot 2021-04-24 at 7.53.22 PM.png
>
>
> {code:java}
> ava.util.NoSuchElementException: No value present in Option
>   at org.apache.hudi.common.util.Option.get(Option.java:88) 
> ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at 
> org.apache.hudi.common.table.TableSchemaResolver.getTableSchemaFromCommitMetadata(TableSchemaResolver.java:215)
>  ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at 
> org.apache.hudi.common.table.TableSchemaResolver.getTableAvroSchema(TableSchemaResolver.java:166)
>  ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at 
> org.apache.hudi.common.table.TableSchemaResolver.getTableAvroSchema(TableSchemaResolver.java:155)
>  ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at 
> org.apache.hudi.MergeOnReadSnapshotRelation.(MergeOnReadSnapshotRelation.scala:65)
>  ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at org.apache.hudi.DefaultSource.createRelation(DefaultSource.scala:99) 
> ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at org.apache.hudi.DefaultSource.createRelation(DefaultSource.scala:63) 
> ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at 
> org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:354)
>  ~[spark-sql_2.12-3.1.1.jar:3.1.1]
>   at 
> org.apache.spark.sql.DataFrameReader.loadV1Source(DataFrameReader.scala:326) 
> ~[spark-sql_2.12-3.1.1.jar:3.1.1]
>   at 
> org.apache.spark.sql.DataFrameReader.$anonfun$load$3(DataFrameReader.scala:308)
>  ~[spark-sql_2.12-3.1.1.jar:3.1.1]
>   at scala.Option.getOrElse(Option.scala:189) 
> ~[scala-library-2.12.10.jar:?]
>   at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:308) 
> ~[spark-sql_2.12-3.1.1.jar:3.1.1]
>   at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:240) 
> ~[spark-sql_2.12-3.1.1.jar:3.1.1]
> {code}
> The screenshot shows the files that got created before the write had failed.
>  
> !Screenshot 2021-04-24 at 7.53.22 PM.png!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HUDI-1806) Honor "skipROSuffix" in spark ds

2021-04-26 Thread sivabalan narayanan (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sivabalan narayanan updated HUDI-1806:
--
Labels: pull-request-available sev:normal  (was: pull-request-available 
sev:critical)

> Honor "skipROSuffix" in spark ds
> 
>
> Key: HUDI-1806
> URL: https://issues.apache.org/jira/browse/HUDI-1806
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: Hive Integration
>Reporter: sivabalan narayanan
>Priority: Major
>  Labels: pull-request-available, sev:normal
> Fix For: 0.9.0
>
>   Original Estimate: 1m
>  Remaining Estimate: 1m
>
> In HoodieSparkSqlWriter#buildSyncConfig(), we don't set skipROSuffix based on 
> configs. This needs fixing. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] cdmikechen commented on issue #2544: [SUPPORT]failed to read timestamp column in version 0.7.0 even when HIVE_SUPPORT_TIMESTAMP is enabled

2021-04-26 Thread GitBox


cdmikechen commented on issue #2544:
URL: https://github.com/apache/hudi/issues/2544#issuecomment-827219576


   @nsivabalan Sure ~ I'll deal with it next week.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Comment Edited] (HUDI-1138) Re-implement marker files via timeline server

2021-04-26 Thread Vinoth Chandar (Jira)


[ 
https://issues.apache.org/jira/browse/HUDI-1138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17332798#comment-17332798
 ] 

Vinoth Chandar edited comment on HUDI-1138 at 4/26/21, 10:48 PM:
-

[~balajeeUber] For the issue you faced with tasks still running during 
finalization, we also block any running tasks from creating new files, once a 
commit is about to be finalized. This will be a more general solution that is 
very nice, since we will be using timeline server anyway.

cc [~nagarwal] [~nagarwal] 

[~guoyihua] also expressed interested in doing this btw. 

 

So lets please decide soon, who is going to take this :) 

 


was (Author: vc):
[~balajeeUber] We also block any running tasks from creating new files, once a 
commit is about to be finalized. This will be a more general solution that is 
very nice, since we will be using timeline server anyway.

cc [~nagarwal] [~nagarwal] 

[~guoyihua] also expressed interested in doing this btw. 

 

So lets please decide soon, who is going to take this :) 

 

> Re-implement marker files via timeline server
> -
>
> Key: HUDI-1138
> URL: https://issues.apache.org/jira/browse/HUDI-1138
> Project: Apache Hudi
>  Issue Type: Improvement
>  Components: Writer Core
>Affects Versions: 0.9.0
>Reporter: Vinoth Chandar
>Priority: Blocker
> Fix For: 0.9.0
>
>
> Even as you can argue that RFC-15/consolidated metadata, removes the need for 
> deleting partial files written due to spark task failures/stage retries. It 
> will still leave extra files inside the table (and users will pay for it 
> every month) and we need the marker mechanism to be able to delete these 
> partial files. 
> Here we explore if we can improve the current marker file mechanism, that 
> creates one marker file per data file written, by 
> Delegating the createMarker() call to the driver/timeline server, and have it 
> create marker metadata into a single file handle, that is flushed for 
> durability guarantees
>  
> P.S: I was tempted to think Spark listener mechanism can help us deal with 
> failed tasks, but it has no guarantees. the writer job could die without 
> deleting a partial file. i.e it can improve things, but cant provide 
> guarantees 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HUDI-1138) Re-implement marker files via timeline server

2021-04-26 Thread Vinoth Chandar (Jira)


[ 
https://issues.apache.org/jira/browse/HUDI-1138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17332798#comment-17332798
 ] 

Vinoth Chandar commented on HUDI-1138:
--

[~balajeeUber] We also block any running tasks from creating new files, once a 
commit is about to be finalized. This will be a more general solution that is 
very nice, since we will be using timeline server anyway.

cc [~nagarwal] [~nagarwal] 

[~guoyihua] also expressed interested in doing this btw. 

 

So lets please decide soon, who is going to take this :) 

 

> Re-implement marker files via timeline server
> -
>
> Key: HUDI-1138
> URL: https://issues.apache.org/jira/browse/HUDI-1138
> Project: Apache Hudi
>  Issue Type: Improvement
>  Components: Writer Core
>Affects Versions: 0.9.0
>Reporter: Vinoth Chandar
>Priority: Blocker
> Fix For: 0.9.0
>
>
> Even as you can argue that RFC-15/consolidated metadata, removes the need for 
> deleting partial files written due to spark task failures/stage retries. It 
> will still leave extra files inside the table (and users will pay for it 
> every month) and we need the marker mechanism to be able to delete these 
> partial files. 
> Here we explore if we can improve the current marker file mechanism, that 
> creates one marker file per data file written, by 
> Delegating the createMarker() call to the driver/timeline server, and have it 
> create marker metadata into a single file handle, that is flushed for 
> durability guarantees
>  
> P.S: I was tempted to think Spark listener mechanism can help us deal with 
> failed tasks, but it has no guarantees. the writer job could die without 
> deleting a partial file. i.e it can improve things, but cant provide 
> guarantees 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HUDI-1850) Read on table fails if the first write to table failed

2021-04-26 Thread Nishith Agarwal (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nishith Agarwal updated HUDI-1850:
--
Labels: sev:critical sev:triage spark  (was: )

> Read on table fails if the first write to table failed
> --
>
> Key: HUDI-1850
> URL: https://issues.apache.org/jira/browse/HUDI-1850
> Project: Apache Hudi
>  Issue Type: Bug
>Affects Versions: 0.8.0
>Reporter: Vaibhav Sinha
>Priority: Major
>  Labels: sev:critical, sev:triage, spark
> Attachments: Screenshot 2021-04-24 at 7.53.22 PM.png
>
>
> {code:java}
> ava.util.NoSuchElementException: No value present in Option
>   at org.apache.hudi.common.util.Option.get(Option.java:88) 
> ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at 
> org.apache.hudi.common.table.TableSchemaResolver.getTableSchemaFromCommitMetadata(TableSchemaResolver.java:215)
>  ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at 
> org.apache.hudi.common.table.TableSchemaResolver.getTableAvroSchema(TableSchemaResolver.java:166)
>  ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at 
> org.apache.hudi.common.table.TableSchemaResolver.getTableAvroSchema(TableSchemaResolver.java:155)
>  ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at 
> org.apache.hudi.MergeOnReadSnapshotRelation.(MergeOnReadSnapshotRelation.scala:65)
>  ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at org.apache.hudi.DefaultSource.createRelation(DefaultSource.scala:99) 
> ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at org.apache.hudi.DefaultSource.createRelation(DefaultSource.scala:63) 
> ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
>   at 
> org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:354)
>  ~[spark-sql_2.12-3.1.1.jar:3.1.1]
>   at 
> org.apache.spark.sql.DataFrameReader.loadV1Source(DataFrameReader.scala:326) 
> ~[spark-sql_2.12-3.1.1.jar:3.1.1]
>   at 
> org.apache.spark.sql.DataFrameReader.$anonfun$load$3(DataFrameReader.scala:308)
>  ~[spark-sql_2.12-3.1.1.jar:3.1.1]
>   at scala.Option.getOrElse(Option.scala:189) 
> ~[scala-library-2.12.10.jar:?]
>   at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:308) 
> ~[spark-sql_2.12-3.1.1.jar:3.1.1]
>   at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:240) 
> ~[spark-sql_2.12-3.1.1.jar:3.1.1]
> {code}
> The screenshot shows the files that got created before the write had failed.
>  
> !Screenshot 2021-04-24 at 7.53.22 PM.png!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] jiegzhan opened a new issue #2883: [SUPPORT] Does Hudi support flink DataStream API?

2021-04-26 Thread GitBox


jiegzhan opened a new issue #2883:
URL: https://github.com/apache/hudi/issues/2883


   Besides 
[flink-sql-client](https://hudi.apache.org/docs/flink-quick-start-guide.html#setup),
 does hudi support [flink DataStream 
API](https://ci.apache.org/projects/flink/flink-docs-stable/dev/connectors/kafka.html#kafka-consumer)?
   
   ```
   // Read kafka topic with DataStream API
   val props = new Properties()
   props.put("bootstrap.servers", "localhost:9092")
   props.put("group.id", "flink-kafka-consumer")
   
   val consumer = new FlinkKafkaConsumer[String]("device_logs", new 
SimpleStringSchema(), props)
   val stream = senv.addSource(consumer)
   ```
   Given DataStream[String] above ^^^, is there a way to publish it to hudi 
table? If yes, any scala/java code snippet? Thanks for your help.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] codecov-commenter commented on pull request #2882: [HUDI-1806] Honoring skipROSuffix in spark ds

2021-04-26 Thread GitBox


codecov-commenter commented on pull request #2882:
URL: https://github.com/apache/hudi/pull/2882#issuecomment-827054323


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2882?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2882](https://codecov.io/gh/apache/hudi/pull/2882?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (7999033) into 
[master](https://codecov.io/gh/apache/hudi/commit/3e4fa170cfd2c198599c3bed3982f2f643c7fbe8?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (3e4fa17) will **increase** coverage by `8.77%`.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2882/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2882?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2882  +/-   ##
   
   + Coverage 53.06%   61.84%   +8.77% 
   + Complexity 3746  335-3411 
   
 Files   488   54 -434 
 Lines 23466 1997   -21469 
 Branches   2503  236-2267 
   
   - Hits  12453 1235   -11218 
   + Misses 9914  640-9274 
   + Partials   1099  122 -977 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `?` | `?` | |
   | hudiclient | `?` | `?` | |
   | hudicommon | `?` | `?` | |
   | hudiflink | `?` | `?` | |
   | hudihadoopmr | `?` | `?` | |
   | hudisparkdatasource | `?` | `?` | |
   | hudisync | `?` | `?` | |
   | huditimelineservice | `?` | `?` | |
   | hudiutilities | `61.84% <ø> (-7.90%)` | `335.00 <ø> (-39.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2882?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...ies/exception/HoodieSnapshotExporterException.java](https://codecov.io/gh/apache/hudi/pull/2882/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2V4Y2VwdGlvbi9Ib29kaWVTbmFwc2hvdEV4cG9ydGVyRXhjZXB0aW9uLmphdmE=)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-1.00%)` | |
   | 
[.../apache/hudi/utilities/HoodieSnapshotExporter.java](https://codecov.io/gh/apache/hudi/pull/2882/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL0hvb2RpZVNuYXBzaG90RXhwb3J0ZXIuamF2YQ==)
 | `5.17% <0.00%> (-83.63%)` | `0.00% <0.00%> (-28.00%)` | |
   | 
[...hudi/utilities/schema/JdbcbasedSchemaProvider.java](https://codecov.io/gh/apache/hudi/pull/2882/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NjaGVtYS9KZGJjYmFzZWRTY2hlbWFQcm92aWRlci5qYXZh)
 | `0.00% <0.00%> (-72.23%)` | `0.00% <0.00%> (-2.00%)` | |
   | 
[...he/hudi/utilities/transform/AWSDmsTransformer.java](https://codecov.io/gh/apache/hudi/pull/2882/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3RyYW5zZm9ybS9BV1NEbXNUcmFuc2Zvcm1lci5qYXZh)
 | `0.00% <0.00%> (-66.67%)` | `0.00% <0.00%> (-2.00%)` | |
   | 
[...in/java/org/apache/hudi/utilities/UtilHelpers.java](https://codecov.io/gh/apache/hudi/pull/2882/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL1V0aWxIZWxwZXJzLmphdmE=)
 | `40.69% <0.00%> (-23.84%)` | `27.00% <0.00%> (-6.00%)` | |
   | 

[GitHub] [hudi] satishkotha commented on a change in pull request #2809: [HUDI-1789] Support reading older snapshots

2021-04-26 Thread GitBox


satishkotha commented on a change in pull request #2809:
URL: https://github.com/apache/hudi/pull/2809#discussion_r620508454



##
File path: 
hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/utils/HoodieHiveUtils.java
##
@@ -122,19 +135,48 @@ public static Path getNthParent(Path path, int n) {
 return result;
   }
 
+  /**
+   * Depending on the configs hoodie.%s.consume.pending.commits and 
hoodie.%s.consume.commit of job
+   *
+   * (hoodie..consume.pending.commits, 
hoodie..consume.commit) ->
+   *  (true, validCommit)   -> returns activeTimeline filtered until 
validCommit
+   *  (true, InValidCommit) -> Raises HoodieIOException
+   *  (true, notSet)-> Raises HoodieIOException
+   *  (false, validCommit)  -> returns compeltedTimeline filtered 
until validCommit
+   *  (false, InValidCommit)-> Raises HoodieIOException
+   *  (false or notSet, notSet) -> returns completedTimeline unfiltered
+   *
+   *  validCommit is one which exists in the timeline being checked and 
vice versa
+   *
+   * @param tableName
+   * @param job
+   * @param metaClient
+   * @return
+   */
   public static HoodieTimeline getTableTimeline(final String tableName, final 
JobConf job, final HoodieTableMetaClient metaClient) {
+HoodieTimeline timeline = 
metaClient.getActiveTimeline().getCommitsTimeline();
+
 boolean includePendingCommits = 
job.getBoolean(String.format(HOODIE_CONSUME_PENDING_COMMITS, tableName), false);
+String maxCommit = job.get(String.format(HOODIE_CONSUME_COMMIT, 
tableName));
+
 if (includePendingCommits) {
-  HoodieTimeline timeline = 
metaClient.getActiveTimeline().getCommitsTimeline();
-  String maxCommit = job.get(String.format(HOODIE_CONSUME_COMMIT, 
tableName));
-  if (maxCommit == null || !timeline.containsInstant(maxCommit)) {
-LOG.info("Timestamp configured for validation: " + maxCommit + " 
commits timeline:" + timeline + " table: " + tableName);
-throw new HoodieIOException("Valid timestamp is required for " + 
HOODIE_CONSUME_COMMIT + " in validate mode");
-  }
-  return timeline.findInstantsBeforeOrEquals(maxCommit);
+  return filterIfInstantExists(tableName, timeline, maxCommit);
+}
+
+timeline = timeline.filterCompletedInstants();
+if (maxCommit != null) {
+  return filterIfInstantExists(tableName, timeline, maxCommit);
 }
 
 // by default return all completed commits.
-return 
metaClient.getActiveTimeline().getCommitsTimeline().filterCompletedInstants();
+return timeline;
+  }
+
+  private static HoodieTimeline filterIfInstantExists(String tableName, 
HoodieTimeline timeline, String maxCommit) {
+if (maxCommit == null || !timeline.containsInstant(maxCommit)) {
+  LOG.info("Timestamp configured for validation: " + maxCommit + " commits 
timeline:" + timeline + " table: " + tableName);

Review comment:
   minor: could you update log message and below error message (This is no 
longer related to validate?)

##
File path: 
hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/utils/HoodieInputFormatUtils.java
##
@@ -438,11 +437,20 @@ public static HoodieMetadataConfig 
buildMetadataConfig(Configuration conf) {
 if (LOG.isDebugEnabled()) {
   LOG.debug("Hoodie Metadata initialized with completed commit instant 
as :" + metaClient);
 }
-
 HoodieTimeline timeline = 
HoodieHiveUtils.getTableTimeline(metaClient.getTableConfig().getTableName(), 
job, metaClient);
+

Review comment:
   minor: this file seems to only have new line changes, can we reset this 
file to look like master (unless you have strong opinion on new lines here)?

##
File path: 
hudi-hadoop-mr/src/main/java/org/apache/hudi/hadoop/utils/HoodieHiveUtils.java
##
@@ -122,19 +135,48 @@ public static Path getNthParent(Path path, int n) {
 return result;
   }
 
+  /**
+   * Depending on the configs hoodie.%s.consume.pending.commits and 
hoodie.%s.consume.commit of job
+   *
+   * (hoodie..consume.pending.commits, 
hoodie..consume.commit) ->
+   *  (true, validCommit)   -> returns activeTimeline filtered until 
validCommit
+   *  (true, InValidCommit) -> Raises HoodieIOException
+   *  (true, notSet)-> Raises HoodieIOException
+   *  (false, validCommit)  -> returns compeltedTimeline filtered 
until validCommit
+   *  (false, InValidCommit)-> Raises HoodieIOException
+   *  (false or notSet, notSet) -> returns completedTimeline unfiltered
+   *
+   *  validCommit is one which exists in the timeline being checked and 
vice versa
+   *
+   * @param tableName
+   * @param job
+   * @param metaClient
+   * @return
+   */
   public static HoodieTimeline getTableTimeline(final String tableName, final 
JobConf job, final HoodieTableMetaClient metaClient) {
+HoodieTimeline timeline = 
metaClient.getActiveTimeline().getCommitsTimeline();
+

[jira] [Updated] (HUDI-1806) Honor "skipROSuffix" in spark ds

2021-04-26 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HUDI-1806:
-
Labels: pull-request-available sev:critical  (was: sev:critical)

> Honor "skipROSuffix" in spark ds
> 
>
> Key: HUDI-1806
> URL: https://issues.apache.org/jira/browse/HUDI-1806
> Project: Apache Hudi
>  Issue Type: Bug
>  Components: Hive Integration
>Reporter: sivabalan narayanan
>Priority: Major
>  Labels: pull-request-available, sev:critical
> Fix For: 0.9.0
>
>   Original Estimate: 1m
>  Remaining Estimate: 1m
>
> In HoodieSparkSqlWriter#buildSyncConfig(), we don't set skipROSuffix based on 
> configs. This needs fixing. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] nsivabalan opened a new pull request #2882: [HUDI-1806] Honoring skipROSuffix in spark ds

2021-04-26 Thread GitBox


nsivabalan opened a new pull request #2882:
URL: https://github.com/apache/hudi/pull/2882


   # What is the purpose of the pull request
   
   *Honoring skipROSuffix in spark ds
   
   ## Brief change log
   
 - Honoring skipROSuffix in spark ds

   ## Verify this pull request
   
   This change added tests and can be verified as follows:
   
 - Tested via HoodieSparkSqlWriterSuite.Test build sync config for spark sql
   
   ## Committer checklist
   
- [ ] Has a corresponding JIRA in PR title & commit

- [ ] Commit message is descriptive of the change

- [ ] CI is green
   
- [ ] Necessary doc changes done or have another open PR
  
- [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] vinothchandar commented on pull request #2645: [HUDI-1659] Basic Implementation Of Spark Sql Support

2021-04-26 Thread GitBox


vinothchandar commented on pull request #2645:
URL: https://github.com/apache/hudi/pull/2645#issuecomment-826999206


   @pengzhiwei2018 Will re-review this again today. Also have a test env now. 
So will start running some tests as well and report results here.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] codecov-commenter edited a comment on pull request #2881: [Hotfix][minor] Remove redundant method-calling.

2021-04-26 Thread GitBox


codecov-commenter edited a comment on pull request #2881:
URL: https://github.com/apache/hudi/pull/2881#issuecomment-826968742


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2881?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2881](https://codecov.io/gh/apache/hudi/pull/2881?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (4632790) into 
[master](https://codecov.io/gh/apache/hudi/commit/f4e3b949714aceaf8823fd1659e44d3b7e98089a?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (f4e3b94) will **increase** coverage by `16.75%`.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2881/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2881?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@  Coverage Diff  @@
   ## master#2881   +/-   ##
   =
   + Coverage 52.94%   69.70%   +16.75% 
   + Complexity 3743  374 -3369 
   =
 Files   488   54  -434 
 Lines 23511 1997-21514 
 Branches   2501  236 -2265 
   =
   - Hits  12449 1392-11057 
   + Misses 9965  473 -9492 
   + Partials   1097  132  -965 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `?` | `?` | |
   | hudiclient | `?` | `?` | |
   | hudicommon | `?` | `?` | |
   | hudiflink | `?` | `?` | |
   | hudihadoopmr | `?` | `?` | |
   | hudisparkdatasource | `?` | `?` | |
   | hudisync | `?` | `?` | |
   | huditimelineservice | `?` | `?` | |
   | hudiutilities | `69.70% <ø> (-0.06%)` | `374.00 <ø> (-1.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2881?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...apache/hudi/utilities/deltastreamer/DeltaSync.java](https://codecov.io/gh/apache/hudi/pull/2881/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2RlbHRhc3RyZWFtZXIvRGVsdGFTeW5jLmphdmE=)
 | `71.08% <0.00%> (-0.35%)` | `55.00% <0.00%> (-1.00%)` | |
   | 
[.../apache/hudi/hadoop/RecordReaderValueIterator.java](https://codecov.io/gh/apache/hudi/pull/2881/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1oYWRvb3AtbXIvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvaGFkb29wL1JlY29yZFJlYWRlclZhbHVlSXRlcmF0b3IuamF2YQ==)
 | | | |
   | 
[...main/java/org/apache/hudi/hive/HiveSyncConfig.java](https://codecov.io/gh/apache/hudi/pull/2881/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1zeW5jL2h1ZGktaGl2ZS1zeW5jL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9odWRpL2hpdmUvSGl2ZVN5bmNDb25maWcuamF2YQ==)
 | | | |
   | 
[...i/common/table/view/FileSystemViewStorageType.java](https://codecov.io/gh/apache/hudi/pull/2881/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL3RhYmxlL3ZpZXcvRmlsZVN5c3RlbVZpZXdTdG9yYWdlVHlwZS5qYXZh)
 | | | |
   | 
[.../hadoop/realtime/RealtimeUnmergedRecordReader.java](https://codecov.io/gh/apache/hudi/pull/2881/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1oYWRvb3AtbXIvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvaGFkb29wL3JlYWx0aW1lL1JlYWx0aW1lVW5tZXJnZWRSZWNvcmRSZWFkZXIuamF2YQ==)
 | | | |
   | 
[.../org/apache/hudi/common/model/HoodieFileGroup.java](https://codecov.io/gh/apache/hudi/pull/2881/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL21vZGVsL0hvb2RpZUZpbGVHcm91cC5qYXZh)
 | | | |
   | 

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2880: [HUDI-1851] Automate suite of tests suites w/ docker and integ test

2021-04-26 Thread GitBox


codecov-commenter edited a comment on pull request #2880:
URL: https://github.com/apache/hudi/pull/2880#issuecomment-826968162


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2880?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2880](https://codecov.io/gh/apache/hudi/pull/2880?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (718fa28) into 
[master](https://codecov.io/gh/apache/hudi/commit/a5789c40673b36d40adab696706acc1446a286f8?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (a5789c4) will **decrease** coverage by `0.06%`.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2880/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2880?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2880  +/-   ##
   
   - Coverage 53.00%   52.93%   -0.07% 
   + Complexity 3743 3742   -1 
   
 Files   488  488  
 Lines 2343523511  +76 
 Branches   2500 2501   +1 
   
   + Hits  1242212446  +24 
   - Misses 9913 9966  +53 
   + Partials   1100 1099   -1 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `39.53% <ø> (ø)` | `220.00 <ø> (ø)` | |
   | hudiclient | `∅ <ø> (∅)` | `0.00 <ø> (ø)` | |
   | hudicommon | `50.37% <ø> (-0.29%)` | `1975.00 <ø> (-1.00)` | |
   | hudiflink | `59.45% <ø> (+0.33%)` | `534.00 <ø> (-1.00)` | :arrow_up: |
   | hudihadoopmr | `33.33% <ø> (ø)` | `198.00 <ø> (ø)` | |
   | hudisparkdatasource | `73.33% <ø> (ø)` | `237.00 <ø> (ø)` | |
   | hudisync | `46.10% <ø> (ø)` | `142.00 <ø> (ø)` | |
   | huditimelineservice | `64.36% <ø> (ø)` | `62.00 <ø> (ø)` | |
   | hudiutilities | `69.70% <ø> (+0.01%)` | `374.00 <ø> (+1.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2880?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...common/table/log/HoodieMergedLogRecordScanner.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL3RhYmxlL2xvZy9Ib29kaWVNZXJnZWRMb2dSZWNvcmRTY2FubmVyLmphdmE=)
 | `82.35% <0.00%> (-5.15%)` | `9.00% <0.00%> (-1.00%)` | |
   | 
[...mmon/table/log/AbstractHoodieLogRecordScanner.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL3RhYmxlL2xvZy9BYnN0cmFjdEhvb2RpZUxvZ1JlY29yZFNjYW5uZXIuamF2YQ==)
 | `80.00% <0.00%> (-1.62%)` | `34.00% <0.00%> (ø%)` | |
   | 
[...pache/hudi/metadata/HoodieBackedTableMetadata.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvbWV0YWRhdGEvSG9vZGllQmFja2VkVGFibGVNZXRhZGF0YS5qYXZh)
 | `0.00% <0.00%> (ø)` | `0.00% <0.00%> (ø%)` | |
   | 
[...e/hudi/table/format/mor/MergeOnReadInputSplit.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS90YWJsZS9mb3JtYXQvbW9yL01lcmdlT25SZWFkSW5wdXRTcGxpdC5qYXZh)
 | `94.44% <0.00%> (ø)` | `8.00% <0.00%> (ø%)` | |
   | 
[...metadata/HoodieMetadataMergedLogRecordScanner.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvbWV0YWRhdGEvSG9vZGllTWV0YWRhdGFNZXJnZWRMb2dSZWNvcmRTY2FubmVyLmphdmE=)
 | `0.00% 

[GitHub] [hudi] codecov-commenter commented on pull request #2881: [Hotfix][minor] Remove redundant method-calling.

2021-04-26 Thread GitBox


codecov-commenter commented on pull request #2881:
URL: https://github.com/apache/hudi/pull/2881#issuecomment-826968742


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2881?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2881](https://codecov.io/gh/apache/hudi/pull/2881?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (4632790) into 
[master](https://codecov.io/gh/apache/hudi/commit/f4e3b949714aceaf8823fd1659e44d3b7e98089a?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (f4e3b94) will **decrease** coverage by `43.58%`.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2881/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2881?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master   #2881   +/-   ##
   
   - Coverage 52.94%   9.36%   -43.59% 
   + Complexity 3743  48 -3695 
   
 Files   488  54  -434 
 Lines 235111997-21514 
 Branches   2501 236 -2265 
   
   - Hits  12449 187-12262 
   + Misses 99651797 -8168 
   + Partials   1097  13 -1084 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `?` | `?` | |
   | hudiclient | `?` | `?` | |
   | hudicommon | `?` | `?` | |
   | hudiflink | `?` | `?` | |
   | hudihadoopmr | `?` | `?` | |
   | hudisparkdatasource | `?` | `?` | |
   | hudisync | `?` | `?` | |
   | huditimelineservice | `?` | `?` | |
   | hudiutilities | `9.36% <ø> (-60.40%)` | `48.00 <ø> (-327.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2881?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...va/org/apache/hudi/utilities/IdentitySplitter.java](https://codecov.io/gh/apache/hudi/pull/2881/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL0lkZW50aXR5U3BsaXR0ZXIuamF2YQ==)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-2.00%)` | |
   | 
[...va/org/apache/hudi/utilities/schema/SchemaSet.java](https://codecov.io/gh/apache/hudi/pull/2881/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NjaGVtYS9TY2hlbWFTZXQuamF2YQ==)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-3.00%)` | |
   | 
[...a/org/apache/hudi/utilities/sources/RowSource.java](https://codecov.io/gh/apache/hudi/pull/2881/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NvdXJjZXMvUm93U291cmNlLmphdmE=)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-4.00%)` | |
   | 
[.../org/apache/hudi/utilities/sources/AvroSource.java](https://codecov.io/gh/apache/hudi/pull/2881/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NvdXJjZXMvQXZyb1NvdXJjZS5qYXZh)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-1.00%)` | |
   | 
[.../org/apache/hudi/utilities/sources/JsonSource.java](https://codecov.io/gh/apache/hudi/pull/2881/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NvdXJjZXMvSnNvblNvdXJjZS5qYXZh)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-1.00%)` | |
   | 

[GitHub] [hudi] codecov-commenter edited a comment on pull request #2880: [HUDI-1851] Automate suite of tests suites w/ docker and integ test

2021-04-26 Thread GitBox


codecov-commenter edited a comment on pull request #2880:
URL: https://github.com/apache/hudi/pull/2880#issuecomment-826968162


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2880?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2880](https://codecov.io/gh/apache/hudi/pull/2880?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (718fa28) into 
[master](https://codecov.io/gh/apache/hudi/commit/a5789c40673b36d40adab696706acc1446a286f8?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (a5789c4) will **decrease** coverage by `0.06%`.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2880/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2880?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2880  +/-   ##
   
   - Coverage 53.00%   52.93%   -0.07% 
   + Complexity 3743 3742   -1 
   
 Files   488  488  
 Lines 2343523511  +76 
 Branches   2500 2501   +1 
   
   + Hits  1242212446  +24 
   - Misses 9913 9966  +53 
   + Partials   1100 1099   -1 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `39.53% <ø> (ø)` | `220.00 <ø> (ø)` | |
   | hudiclient | `∅ <ø> (∅)` | `0.00 <ø> (ø)` | |
   | hudicommon | `50.37% <ø> (-0.29%)` | `1975.00 <ø> (-1.00)` | |
   | hudiflink | `59.45% <ø> (+0.33%)` | `534.00 <ø> (-1.00)` | :arrow_up: |
   | hudihadoopmr | `33.33% <ø> (ø)` | `198.00 <ø> (ø)` | |
   | hudisparkdatasource | `73.33% <ø> (ø)` | `237.00 <ø> (ø)` | |
   | hudisync | `46.10% <ø> (ø)` | `142.00 <ø> (ø)` | |
   | huditimelineservice | `64.36% <ø> (ø)` | `62.00 <ø> (ø)` | |
   | hudiutilities | `69.70% <ø> (+0.01%)` | `374.00 <ø> (+1.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2880?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...common/table/log/HoodieMergedLogRecordScanner.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL3RhYmxlL2xvZy9Ib29kaWVNZXJnZWRMb2dSZWNvcmRTY2FubmVyLmphdmE=)
 | `82.35% <0.00%> (-5.15%)` | `9.00% <0.00%> (-1.00%)` | |
   | 
[...mmon/table/log/AbstractHoodieLogRecordScanner.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL3RhYmxlL2xvZy9BYnN0cmFjdEhvb2RpZUxvZ1JlY29yZFNjYW5uZXIuamF2YQ==)
 | `80.00% <0.00%> (-1.62%)` | `34.00% <0.00%> (ø%)` | |
   | 
[...pache/hudi/metadata/HoodieBackedTableMetadata.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvbWV0YWRhdGEvSG9vZGllQmFja2VkVGFibGVNZXRhZGF0YS5qYXZh)
 | `0.00% <0.00%> (ø)` | `0.00% <0.00%> (ø%)` | |
   | 
[...e/hudi/table/format/mor/MergeOnReadInputSplit.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS90YWJsZS9mb3JtYXQvbW9yL01lcmdlT25SZWFkSW5wdXRTcGxpdC5qYXZh)
 | `94.44% <0.00%> (ø)` | `8.00% <0.00%> (ø%)` | |
   | 
[...metadata/HoodieMetadataMergedLogRecordScanner.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvbWV0YWRhdGEvSG9vZGllTWV0YWRhdGFNZXJnZWRMb2dSZWNvcmRTY2FubmVyLmphdmE=)
 | `0.00% 

[GitHub] [hudi] codecov-commenter commented on pull request #2880: [HUDI-1851] Automate suite of tests suites w/ docker and integ test

2021-04-26 Thread GitBox


codecov-commenter commented on pull request #2880:
URL: https://github.com/apache/hudi/pull/2880#issuecomment-826968162


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2880?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2880](https://codecov.io/gh/apache/hudi/pull/2880?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (718fa28) into 
[master](https://codecov.io/gh/apache/hudi/commit/a5789c40673b36d40adab696706acc1446a286f8?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (a5789c4) will **decrease** coverage by `0.06%`.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2880/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2880?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2880  +/-   ##
   
   - Coverage 53.00%   52.93%   -0.07% 
   + Complexity 3743 3742   -1 
   
 Files   488  488  
 Lines 2343523511  +76 
 Branches   2500 2501   +1 
   
   + Hits  1242212446  +24 
   - Misses 9913 9966  +53 
   + Partials   1100 1099   -1 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `39.53% <ø> (ø)` | `220.00 <ø> (ø)` | |
   | hudiclient | `∅ <ø> (∅)` | `0.00 <ø> (ø)` | |
   | hudicommon | `50.37% <ø> (-0.29%)` | `1975.00 <ø> (-1.00)` | |
   | hudiflink | `59.45% <ø> (+0.33%)` | `534.00 <ø> (-1.00)` | :arrow_up: |
   | hudihadoopmr | `33.33% <ø> (ø)` | `198.00 <ø> (ø)` | |
   | hudisparkdatasource | `73.33% <ø> (ø)` | `237.00 <ø> (ø)` | |
   | hudisync | `46.10% <ø> (ø)` | `142.00 <ø> (ø)` | |
   | huditimelineservice | `64.36% <ø> (ø)` | `62.00 <ø> (ø)` | |
   | hudiutilities | `69.70% <ø> (+0.01%)` | `374.00 <ø> (+1.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2880?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...common/table/log/HoodieMergedLogRecordScanner.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL3RhYmxlL2xvZy9Ib29kaWVNZXJnZWRMb2dSZWNvcmRTY2FubmVyLmphdmE=)
 | `82.35% <0.00%> (-5.15%)` | `9.00% <0.00%> (-1.00%)` | |
   | 
[...mmon/table/log/AbstractHoodieLogRecordScanner.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL3RhYmxlL2xvZy9BYnN0cmFjdEhvb2RpZUxvZ1JlY29yZFNjYW5uZXIuamF2YQ==)
 | `80.00% <0.00%> (-1.62%)` | `34.00% <0.00%> (ø%)` | |
   | 
[...pache/hudi/metadata/HoodieBackedTableMetadata.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvbWV0YWRhdGEvSG9vZGllQmFja2VkVGFibGVNZXRhZGF0YS5qYXZh)
 | `0.00% <0.00%> (ø)` | `0.00% <0.00%> (ø%)` | |
   | 
[...e/hudi/table/format/mor/MergeOnReadInputSplit.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS90YWJsZS9mb3JtYXQvbW9yL01lcmdlT25SZWFkSW5wdXRTcGxpdC5qYXZh)
 | `94.44% <0.00%> (ø)` | `8.00% <0.00%> (ø%)` | |
   | 
[...metadata/HoodieMetadataMergedLogRecordScanner.java](https://codecov.io/gh/apache/hudi/pull/2880/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvbWV0YWRhdGEvSG9vZGllTWV0YWRhdGFNZXJnZWRMb2dSZWNvcmRTY2FubmVyLmphdmE=)
 | `0.00% <0.00%> 

[GitHub] [hudi] RocMarshal opened a new pull request #2881: [Hotfix][minor] Remove redundant method-calling.

2021-04-26 Thread GitBox


RocMarshal opened a new pull request #2881:
URL: https://github.com/apache/hudi/pull/2881


   ## *Tips*
   - *Thank you very much for contributing to Apache Hudi.*
   - *Please review https://hudi.apache.org/contributing.html before opening a 
pull request.*
   
   ## What is the purpose of the pull request
   
   *(For example: This pull request adds quick-start document.)*
   
   ## Brief change log
   
   *(for example:)*
 - *Modify AnnotationLocation checkstyle rule in checkstyle.xml*
   
   ## Verify this pull request
   
   *(Please pick either of the following options)*
   
   This pull request is a trivial rework / code cleanup without any test 
coverage.
   
   *(or)*
   
   This pull request is already covered by existing tests, such as *(please 
describe tests)*.
   
   (or)
   
   This change added tests and can be verified as follows:
   
   *(example:)*
   
 - *Added integration tests for end-to-end.*
 - *Added HoodieClientWriteTest to verify the change.*
 - *Manually verified the change by running a job locally.*
   
   ## Committer checklist
   
- [ ] Has a corresponding JIRA in PR title & commit

- [ ] Commit message is descriptive of the change

- [ ] CI is green
   
- [ ] Necessary doc changes done or have another open PR
  
- [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] RocMarshal commented on a change in pull request #2822: [Hotfix][hudi-sync] Refactor method up to parent-class

2021-04-26 Thread GitBox


RocMarshal commented on a change in pull request #2822:
URL: https://github.com/apache/hudi/pull/2822#discussion_r620411008



##
File path: 
hudi-sync/hudi-sync-common/src/main/java/org/apache/hudi/sync/common/AbstractSyncHoodieClient.java
##
@@ -29,20 +29,25 @@
 
 import org.apache.hadoop.fs.FileSystem;
 import org.apache.hadoop.fs.Path;
+import org.apache.hudi.common.util.ValidationUtils;
 import org.apache.log4j.LogManager;
 import org.apache.log4j.Logger;
 import org.apache.parquet.schema.MessageType;
 
+import java.io.Serializable;
 import java.sql.ResultSet;
 import java.sql.SQLException;
 import java.sql.Statement;
 import java.util.List;
 import java.util.Map;
+import java.util.Objects;
 
 public abstract class AbstractSyncHoodieClient {
 
   private static final Logger LOG = 
LogManager.getLogger(AbstractSyncHoodieClient.class);
 
+  public static final TypeConverter DECIMAL_TYPE_CONVERTOR = new 
TypeConverter() {};

Review comment:
   IMO, The current default implementation is the default implementation of 
the decimal type convertor. And we would do a special implementation by 
extending `TypeConverter` in the specific scene. So, keeping 
`DECIMAL_TYPE_CONVERTOR` would be a fine choice in the case...




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Updated] (HUDI-1851) Automate suite of tests suites w/ docker and integ test

2021-04-26 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HUDI-1851:
-
Labels: pull-request-available  (was: )

> Automate suite of tests suites w/ docker and integ test
> ---
>
> Key: HUDI-1851
> URL: https://issues.apache.org/jira/browse/HUDI-1851
> Project: Apache Hudi
>  Issue Type: Improvement
>  Components: Testing
>Reporter: sivabalan narayanan
>Priority: Major
>  Labels: pull-request-available
>
> Automate suite of tests suites w/ docker and integ test



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] nsivabalan opened a new pull request #2880: [HUDI-1851] Automate suite of tests suites w/ docker and integ test

2021-04-26 Thread GitBox


nsivabalan opened a new pull request #2880:
URL: https://github.com/apache/hudi/pull/2880


   ## What is the purpose of the pull request
   
   *Automate suite of tests suites w/ docker and integ test
   
   ## Brief change log
   
 - *Added templates and script to automate tests w/ integ test suite job in 
docker*
   
   ## Verify this pull request
   
   Ran locally to verify
   ./generate_test_suite.sh 
   ./generate_test_suite.sh --include_cluster_yaml true
   ./generate_test_suite.sh --num_iterations 50 --intermittent_delay_mins 1 
--table_type MERGE_ON_READ
   
   ## Committer checklist
   
- [ ] Has a corresponding JIRA in PR title & commit

- [ ] Commit message is descriptive of the change

- [ ] CI is green
   
- [ ] Necessary doc changes done or have another open PR
  
- [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Created] (HUDI-1850) Read on table fails if the first write to table failed

2021-04-26 Thread Vaibhav Sinha (Jira)
Vaibhav Sinha created HUDI-1850:
---

 Summary: Read on table fails if the first write to table failed
 Key: HUDI-1850
 URL: https://issues.apache.org/jira/browse/HUDI-1850
 Project: Apache Hudi
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: Vaibhav Sinha
 Attachments: Screenshot 2021-04-24 at 7.53.22 PM.png

{code:java}
ava.util.NoSuchElementException: No value present in Option
at org.apache.hudi.common.util.Option.get(Option.java:88) 
~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
at 
org.apache.hudi.common.table.TableSchemaResolver.getTableSchemaFromCommitMetadata(TableSchemaResolver.java:215)
 ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
at 
org.apache.hudi.common.table.TableSchemaResolver.getTableAvroSchema(TableSchemaResolver.java:166)
 ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
at 
org.apache.hudi.common.table.TableSchemaResolver.getTableAvroSchema(TableSchemaResolver.java:155)
 ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
at 
org.apache.hudi.MergeOnReadSnapshotRelation.(MergeOnReadSnapshotRelation.scala:65)
 ~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
at org.apache.hudi.DefaultSource.createRelation(DefaultSource.scala:99) 
~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
at org.apache.hudi.DefaultSource.createRelation(DefaultSource.scala:63) 
~[hudi-spark3-bundle_2.12-0.8.0.jar:0.8.0]
at 
org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:354)
 ~[spark-sql_2.12-3.1.1.jar:3.1.1]
at 
org.apache.spark.sql.DataFrameReader.loadV1Source(DataFrameReader.scala:326) 
~[spark-sql_2.12-3.1.1.jar:3.1.1]
at 
org.apache.spark.sql.DataFrameReader.$anonfun$load$3(DataFrameReader.scala:308) 
~[spark-sql_2.12-3.1.1.jar:3.1.1]
at scala.Option.getOrElse(Option.scala:189) 
~[scala-library-2.12.10.jar:?]
at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:308) 
~[spark-sql_2.12-3.1.1.jar:3.1.1]
at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:240) 
~[spark-sql_2.12-3.1.1.jar:3.1.1]
{code}
The screenshot shows the files that got created before the write had failed.

 

!Screenshot 2021-04-24 at 7.53.22 PM.png!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] abhijeetkushe commented on issue #2850: [SUPPORT] S3 files skipped by HoodieDeltaStreamer on s3 bucket in continuous mode

2021-04-26 Thread GitBox


abhijeetkushe commented on issue #2850:
URL: https://github.com/apache/hudi/issues/2850#issuecomment-826883701


   Thanks your help.Closing this issue


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] leesf commented on a change in pull request #2822: [Hotfix][hudi-sync] Refactor method up to parent-class

2021-04-26 Thread GitBox


leesf commented on a change in pull request #2822:
URL: https://github.com/apache/hudi/pull/2822#discussion_r620331369



##
File path: 
hudi-sync/hudi-sync-common/src/main/java/org/apache/hudi/sync/common/AbstractSyncHoodieClient.java
##
@@ -29,20 +29,25 @@
 
 import org.apache.hadoop.fs.FileSystem;
 import org.apache.hadoop.fs.Path;
+import org.apache.hudi.common.util.ValidationUtils;
 import org.apache.log4j.LogManager;
 import org.apache.log4j.Logger;
 import org.apache.parquet.schema.MessageType;
 
+import java.io.Serializable;
 import java.sql.ResultSet;
 import java.sql.SQLException;
 import java.sql.Statement;
 import java.util.List;
 import java.util.Map;
+import java.util.Objects;
 
 public abstract class AbstractSyncHoodieClient {
 
   private static final Logger LOG = 
LogManager.getLogger(AbstractSyncHoodieClient.class);
 
+  public static final TypeConverter DECIMAL_TYPE_CONVERTOR = new 
TypeConverter() {};

Review comment:
   rename to `TYPE_CONVERTOR`?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[hudi] branch master updated: [HUDI-1742] Improve table level config priority for HoodieMultiTableDeltaStreamer (#2744)

2021-04-26 Thread leesf
This is an automated email from the ASF dual-hosted git repository.

leesf pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/hudi.git


The following commit(s) were added to refs/heads/master by this push:
 new f4e3b94  [HUDI-1742] Improve table level config priority for 
HoodieMultiTableDeltaStreamer (#2744)
f4e3b94 is described below

commit f4e3b949714aceaf8823fd1659e44d3b7e98089a
Author: Nick Young <72905543+nickyoungp...@users.noreply.github.com>
AuthorDate: Mon Apr 26 22:05:06 2021 +0800

[HUDI-1742] Improve table level config priority for 
HoodieMultiTableDeltaStreamer (#2744)
---
 .../deltastreamer/HoodieMultiTableDeltaStreamer.java   |  4 +++-
 .../utilities/functional/TestHoodieDeltaStreamer.java  |  7 +++
 .../functional/TestHoodieMultiTableDeltaStreamer.java  | 18 ++
 .../short_trip_uber_config.properties  |  3 ++-
 4 files changed, 30 insertions(+), 2 deletions(-)

diff --git 
a/hudi-utilities/src/main/java/org/apache/hudi/utilities/deltastreamer/HoodieMultiTableDeltaStreamer.java
 
b/hudi-utilities/src/main/java/org/apache/hudi/utilities/deltastreamer/HoodieMultiTableDeltaStreamer.java
index a39b973..8e557f1 100644
--- 
a/hudi-utilities/src/main/java/org/apache/hudi/utilities/deltastreamer/HoodieMultiTableDeltaStreamer.java
+++ 
b/hudi-utilities/src/main/java/org/apache/hudi/utilities/deltastreamer/HoodieMultiTableDeltaStreamer.java
@@ -118,7 +118,9 @@ public class HoodieMultiTableDeltaStreamer {
   checkIfTableConfigFileExists(configFolder, fs, configFilePath);
   TypedProperties tableProperties = UtilHelpers.readConfig(fs, new 
Path(configFilePath), new ArrayList<>()).getConfig();
   properties.forEach((k, v) -> {
-tableProperties.setProperty(k.toString(), v.toString());
+if (tableProperties.get(k) == null) {
+  tableProperties.setProperty(k.toString(), v.toString());
+}
   });
   final HoodieDeltaStreamer.Config cfg = new HoodieDeltaStreamer.Config();
   //copy all the values from config to cfg
diff --git 
a/hudi-utilities/src/test/java/org/apache/hudi/utilities/functional/TestHoodieDeltaStreamer.java
 
b/hudi-utilities/src/test/java/org/apache/hudi/utilities/functional/TestHoodieDeltaStreamer.java
index 7d4db2c..362a294 100644
--- 
a/hudi-utilities/src/test/java/org/apache/hudi/utilities/functional/TestHoodieDeltaStreamer.java
+++ 
b/hudi-utilities/src/test/java/org/apache/hudi/utilities/functional/TestHoodieDeltaStreamer.java
@@ -1631,6 +1631,13 @@ public class TestHoodieDeltaStreamer extends 
UtilitiesTestBase {
 }
   }
 
+  public static class TestTableLevelGenerator extends SimpleKeyGenerator {
+
+public TestTableLevelGenerator(TypedProperties props) {
+  super(props);
+}
+  }
+
   public static class DummyAvroPayload extends OverwriteWithLatestAvroPayload {
 
 public DummyAvroPayload(GenericRecord gr, Comparable orderingVal) {
diff --git 
a/hudi-utilities/src/test/java/org/apache/hudi/utilities/functional/TestHoodieMultiTableDeltaStreamer.java
 
b/hudi-utilities/src/test/java/org/apache/hudi/utilities/functional/TestHoodieMultiTableDeltaStreamer.java
index 7b5ce9d..17450a0 100644
--- 
a/hudi-utilities/src/test/java/org/apache/hudi/utilities/functional/TestHoodieMultiTableDeltaStreamer.java
+++ 
b/hudi-utilities/src/test/java/org/apache/hudi/utilities/functional/TestHoodieMultiTableDeltaStreamer.java
@@ -213,6 +213,24 @@ public class TestHoodieMultiTableDeltaStreamer extends 
TestHoodieDeltaStreamer {
 }
   }
 
+  @Test
+  public void testTableLevelProperties() throws IOException {
+HoodieMultiTableDeltaStreamer.Config cfg = 
TestHelpers.getConfig(PROPS_FILENAME_TEST_SOURCE1, dfsBasePath + "/config", 
TestDataSource.class.getName(), false);
+HoodieMultiTableDeltaStreamer streamer = new 
HoodieMultiTableDeltaStreamer(cfg, jsc);
+List tableExecutionContexts = 
streamer.getTableExecutionContexts();
+tableExecutionContexts.forEach(tableExecutionContext -> {
+  switch (tableExecutionContext.getTableName()) {
+case "dummy_table_short_trip":
+  String tableLevelKeyGeneratorClass = 
tableExecutionContext.getProperties().getString(DataSourceWriteOptions.KEYGENERATOR_CLASS_OPT_KEY());
+  
assertEquals(TestHoodieDeltaStreamer.TestTableLevelGenerator.class.getName(), 
tableLevelKeyGeneratorClass);
+  break;
+default:
+  String defaultKeyGeneratorClass = 
tableExecutionContext.getProperties().getString(DataSourceWriteOptions.KEYGENERATOR_CLASS_OPT_KEY());
+  assertEquals(TestHoodieDeltaStreamer.TestGenerator.class.getName(), 
defaultKeyGeneratorClass);
+  }
+});
+  }
+
   private String populateCommonPropsAndWriteToFile() throws IOException {
 TypedProperties commonProps = new TypedProperties();
 populateCommonProps(commonProps);
diff --git 
a/hudi-utilities/src/test/resources/delta-streamer-config/short_trip_uber_config.properties
 

[GitHub] [hudi] leesf merged pull request #2744: [HUDI-1742] improve table level config priority for HoodieMultiTableDeltaStreamer

2021-04-26 Thread GitBox


leesf merged pull request #2744:
URL: https://github.com/apache/hudi/pull/2744


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] liujinhui1994 commented on pull request #2710: [HUDI-648][RFC-20] Implement error log/table for Datasource/DeltaStreamer/WriteClient/Compaction writes

2021-04-26 Thread GitBox


liujinhui1994 commented on pull request #2710:
URL: https://github.com/apache/hudi/pull/2710#issuecomment-826771631


   Thanks for the suggestion,I will take the time to deal with


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] codecov-commenter edited a comment on pull request #2875: [HUDI-1841] Tweak the min max commits to keep when setting up cleanin…

2021-04-26 Thread GitBox


codecov-commenter edited a comment on pull request #2875:
URL: https://github.com/apache/hudi/pull/2875#issuecomment-826288360


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2875?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2875](https://codecov.io/gh/apache/hudi/pull/2875?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (3cb3128) into 
[master](https://codecov.io/gh/apache/hudi/commit/3e4fa170cfd2c198599c3bed3982f2f643c7fbe8?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (3e4fa17) will **decrease** coverage by `0.08%`.
   > The diff coverage is `90.90%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2875/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2875?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2875  +/-   ##
   
   - Coverage 53.06%   52.98%   -0.09% 
   + Complexity 3746 3743   -3 
   
 Files   488  488  
 Lines 2346623538  +72 
 Branches   2503 2503  
   
   + Hits  1245312471  +18 
   - Misses 9914 9966  +52 
   - Partials   1099 1101   +2 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `39.53% <ø> (ø)` | `220.00 <ø> (ø)` | |
   | hudiclient | `∅ <ø> (∅)` | `0.00 <ø> (ø)` | |
   | hudicommon | `50.38% <ø> (-0.27%)` | `1975.00 <ø> (-1.00)` | |
   | hudiflink | `59.65% <90.90%> (+0.22%)` | `536.00 <3.00> (-1.00)` | 
:arrow_up: |
   | hudihadoopmr | `33.33% <ø> (ø)` | `198.00 <ø> (ø)` | |
   | hudisparkdatasource | `73.33% <ø> (ø)` | `237.00 <ø> (ø)` | |
   | hudisync | `46.10% <ø> (ø)` | `142.00 <ø> (ø)` | |
   | huditimelineservice | `64.36% <ø> (ø)` | `62.00 <ø> (ø)` | |
   | hudiutilities | `69.68% <ø> (-0.06%)` | `373.00 <ø> (-1.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2875?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...java/org/apache/hudi/sink/StreamWriteFunction.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS9zaW5rL1N0cmVhbVdyaXRlRnVuY3Rpb24uamF2YQ==)
 | `79.24% <76.92%> (-1.68%)` | `24.00 <1.00> (ø)` | |
   | 
[...va/org/apache/hudi/configuration/FlinkOptions.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS9jb25maWd1cmF0aW9uL0ZsaW5rT3B0aW9ucy5qYXZh)
 | `90.48% <100.00%> (+0.42%)` | `11.00 <0.00> (ø)` | |
   | 
[...java/org/apache/hudi/table/HoodieTableFactory.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS90YWJsZS9Ib29kaWVUYWJsZUZhY3RvcnkuamF2YQ==)
 | `89.83% <100.00%> (+2.07%)` | `17.00 <2.00> (+2.00)` | |
   | 
[...c/main/java/org/apache/hudi/util/StreamerUtil.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS91dGlsL1N0cmVhbWVyVXRpbC5qYXZh)
 | `58.33% <100.00%> (+0.70%)` | `18.00 <0.00> (ø)` | |
   | 
[...common/table/log/HoodieMergedLogRecordScanner.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL3RhYmxlL2xvZy9Ib29kaWVNZXJnZWRMb2dSZWNvcmRTY2FubmVyLmphdmE=)
 | `82.35% <0.00%> (-5.15%)` | `9.00% <0.00%> (-1.00%)` | |
   | 

[GitHub] [hudi] PavelPetukhov edited a comment on issue #2856: [SUPPORT] Metrics Prometheus pushgateway

2021-04-26 Thread GitBox


PavelPetukhov edited a comment on issue #2856:
URL: https://github.com/apache/hudi/issues/2856#issuecomment-826758533


   Thanks a lot for the reply
   
   I my case I am getting the following error:
   
   1/04/26 13:29:52 WARN prometheus.PushGatewayReporter: Can't push monitoring 
information to pushGateway
   java.net.UnknownHostException: https
at 
java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
at java.net.Socket.connect(Socket.java:607)
at sun.net.NetworkClient.doConnect(NetworkClient.java:175)
at sun.net.www.http.HttpClient.openServer(HttpClient.java:463)
at sun.net.www.http.HttpClient.openServer(HttpClient.java:558)
at sun.net.www.http.HttpClient.(HttpClient.java:242)
at sun.net.www.http.HttpClient.New(HttpClient.java:339)
at sun.net.www.http.HttpClient.New(HttpClient.java:357)
at 
sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1226)
at 
sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1162)
at 
sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:1056)
at 
sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:990)
at 
io.prometheus.client.exporter.PushGateway.doRequest(PushGateway.java:305)
at 
io.prometheus.client.exporter.PushGateway.pushAdd(PushGateway.java:160)
at 
org.apache.hudi.metrics.prometheus.PushGatewayReporter.report(PushGatewayReporter.java:72)
at 
org.apache.hudi.metrics.prometheus.PushGatewayMetricsReporter.report(PushGatewayMetricsReporter.java:64)
at 
org.apache.hudi.metrics.Metrics.reportAndCloseReporter(Metrics.java:64)
at org.apache.hudi.metrics.Metrics.lambda$new$0(Metrics.java:57)
at java.lang.Thread.run(Thread.java:748)
   
   
   my parameters are
   --hoodie-conf hoodie.metrics.on=true 
   --hoodie-conf hoodie.metrics.reporter.type=PROMETHEUS_PUSHGATEWAY 
   --hoodie-conf hoodie.metrics.pushgateway.host=https://pushgateway_url.com/
   --hoodie-conf hoodie.metrics.pushgateway.port=8080 
   --hoodie-conf hoodie.metrics.pushgateway.job.name=test_job_1 
   --hoodie-conf hoodie.metrics.pushgateway.random.job.name.suffix=false 
   --hoodie-conf hoodie.metrics.pushgateway.delete.on.shutdown=false 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] PavelPetukhov commented on issue #2856: [SUPPORT] Metrics Prometheus pushgateway

2021-04-26 Thread GitBox


PavelPetukhov commented on issue #2856:
URL: https://github.com/apache/hudi/issues/2856#issuecomment-826758533


   Thanks a lot for the reply
   
   I my case I am getting the following error:
   
   1/04/26 13:29:52 WARN prometheus.PushGatewayReporter: Can't push monitoring 
information to pushGateway
   java.net.UnknownHostException: https
at 
java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
at java.net.Socket.connect(Socket.java:607)
at sun.net.NetworkClient.doConnect(NetworkClient.java:175)
at sun.net.www.http.HttpClient.openServer(HttpClient.java:463)
at sun.net.www.http.HttpClient.openServer(HttpClient.java:558)
at sun.net.www.http.HttpClient.(HttpClient.java:242)
at sun.net.www.http.HttpClient.New(HttpClient.java:339)
at sun.net.www.http.HttpClient.New(HttpClient.java:357)
at 
sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1226)
at 
sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1162)
at 
sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:1056)
at 
sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:990)
at 
io.prometheus.client.exporter.PushGateway.doRequest(PushGateway.java:305)
at 
io.prometheus.client.exporter.PushGateway.pushAdd(PushGateway.java:160)
at 
org.apache.hudi.metrics.prometheus.PushGatewayReporter.report(PushGatewayReporter.java:72)
at 
org.apache.hudi.metrics.prometheus.PushGatewayMetricsReporter.report(PushGatewayMetricsReporter.java:64)
at 
org.apache.hudi.metrics.Metrics.reportAndCloseReporter(Metrics.java:64)
at org.apache.hudi.metrics.Metrics.lambda$new$0(Metrics.java:57)
at java.lang.Thread.run(Thread.java:748)
   
   
   my parameters are
   --hoodie-conf hoodie.metrics.on=true 
   --hoodie-conf hoodie.metrics.reporter.type=PROMETHEUS_PUSHGATEWAY 
   --hoodie-conf hoodie.metrics.pushgateway.host=https://pushgateway_url.com/
   --hoodie-conf hoodie.metrics.pushgateway.port=8081 \
   --hoodie-conf hoodie.metrics.pushgateway.job.name=test_job_1 \
   --hoodie-conf hoodie.metrics.pushgateway.random.job.name.suffix=false \
   --hoodie-conf hoodie.metrics.pushgateway.delete.on.shutdown=false \


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] codecov-commenter edited a comment on pull request #2875: [HUDI-1841] Tweak the min max commits to keep when setting up cleanin…

2021-04-26 Thread GitBox


codecov-commenter edited a comment on pull request #2875:
URL: https://github.com/apache/hudi/pull/2875#issuecomment-826288360


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2875?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2875](https://codecov.io/gh/apache/hudi/pull/2875?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (3cb3128) into 
[master](https://codecov.io/gh/apache/hudi/commit/3e4fa170cfd2c198599c3bed3982f2f643c7fbe8?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (3e4fa17) will **increase** coverage by `16.62%`.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2875/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2875?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@  Coverage Diff  @@
   ## master#2875   +/-   ##
   =
   + Coverage 53.06%   69.68%   +16.62% 
   + Complexity 3746  373 -3373 
   =
 Files   488   54  -434 
 Lines 23466 1996-21470 
 Branches   2503  236 -2267 
   =
   - Hits  12453 1391-11062 
   + Misses 9914  473 -9441 
   + Partials   1099  132  -967 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `?` | `?` | |
   | hudiclient | `?` | `?` | |
   | hudicommon | `?` | `?` | |
   | hudiflink | `?` | `?` | |
   | hudihadoopmr | `?` | `?` | |
   | hudisparkdatasource | `?` | `?` | |
   | hudisync | `?` | `?` | |
   | huditimelineservice | `?` | `?` | |
   | hudiutilities | `69.68% <ø> (-0.06%)` | `373.00 <ø> (-1.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2875?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...apache/hudi/utilities/deltastreamer/DeltaSync.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL2RlbHRhc3RyZWFtZXIvRGVsdGFTeW5jLmphdmE=)
 | `71.08% <0.00%> (-0.35%)` | `55.00% <0.00%> (-1.00%)` | |
   | 
[...odie/hadoop/hive/HoodieCombineHiveInputFormat.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1oYWRvb3AtbXIvc3JjL21haW4vamF2YS9jb20vdWJlci9ob29kaWUvaGFkb29wL2hpdmUvSG9vZGllQ29tYmluZUhpdmVJbnB1dEZvcm1hdC5qYXZh)
 | | | |
   | 
[.../hudi/table/format/mor/MergeOnReadInputFormat.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS90YWJsZS9mb3JtYXQvbW9yL01lcmdlT25SZWFkSW5wdXRGb3JtYXQuamF2YQ==)
 | | | |
   | 
[...til/jvm/HotSpotMemoryLayoutSpecification32bit.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL3V0aWwvanZtL0hvdFNwb3RNZW1vcnlMYXlvdXRTcGVjaWZpY2F0aW9uMzJiaXQuamF2YQ==)
 | | | |
   | 
[...in/java/org/apache/hudi/common/model/BaseFile.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1jb21tb24vc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvY29tbW9uL21vZGVsL0Jhc2VGaWxlLmphdmE=)
 | | | |
   | 
[...rg/apache/hudi/sink/utils/CoordinatorExecutor.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS1mbGluay9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvaHVkaS9zaW5rL3V0aWxzL0Nvb3JkaW5hdG9yRXhlY3V0b3IuamF2YQ==)
 | | | |
   | 

[jira] [Assigned] (HUDI-1849) Remove the unit from the Flink sql options

2021-04-26 Thread Jira


 [ 
https://issues.apache.org/jira/browse/HUDI-1849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

谢波 reassigned HUDI-1849:


Assignee: 谢波

> Remove the unit from the Flink sql options
> --
>
> Key: HUDI-1849
> URL: https://issues.apache.org/jira/browse/HUDI-1849
> Project: Apache Hudi
>  Issue Type: Improvement
>  Components: Flink Integration
>Reporter: Danny Chen
>Assignee: 谢波
>Priority: Major
> Fix For: 0.9.0
>
>
> Remove the unit suffix because it is verbose.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [hudi] aditiwari01 commented on issue #2802: Hive read issues when different partition have different schemas.

2021-04-26 Thread GitBox


aditiwari01 commented on issue #2802:
URL: https://github.com/apache/hudi/issues/2802#issuecomment-826671636


   Have quite a busy sprint. I'll try to get back. If not, I'll update by 
weekend.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [hudi] codecov-commenter edited a comment on pull request #2875: [HUDI-1841] Tweak the min max commits to keep when setting up cleanin…

2021-04-26 Thread GitBox


codecov-commenter edited a comment on pull request #2875:
URL: https://github.com/apache/hudi/pull/2875#issuecomment-826288360


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2875?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2875](https://codecov.io/gh/apache/hudi/pull/2875?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (371c2e9) into 
[master](https://codecov.io/gh/apache/hudi/commit/3e4fa170cfd2c198599c3bed3982f2f643c7fbe8?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (3e4fa17) will **decrease** coverage by `43.69%`.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2875/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2875?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master   #2875   +/-   ##
   
   - Coverage 53.06%   9.36%   -43.70% 
   + Complexity 3746  48 -3698 
   
 Files   488  54  -434 
 Lines 234661996-21470 
 Branches   2503 236 -2267 
   
   - Hits  12453 187-12266 
   + Misses 99141796 -8118 
   + Partials   1099  13 -1086 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `?` | `?` | |
   | hudiclient | `?` | `?` | |
   | hudicommon | `?` | `?` | |
   | hudiflink | `?` | `?` | |
   | hudihadoopmr | `?` | `?` | |
   | hudisparkdatasource | `?` | `?` | |
   | hudisync | `?` | `?` | |
   | huditimelineservice | `?` | `?` | |
   | hudiutilities | `9.36% <ø> (-60.38%)` | `48.00 <ø> (-326.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2875?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...va/org/apache/hudi/utilities/IdentitySplitter.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL0lkZW50aXR5U3BsaXR0ZXIuamF2YQ==)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-2.00%)` | |
   | 
[...va/org/apache/hudi/utilities/schema/SchemaSet.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NjaGVtYS9TY2hlbWFTZXQuamF2YQ==)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-3.00%)` | |
   | 
[...a/org/apache/hudi/utilities/sources/RowSource.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NvdXJjZXMvUm93U291cmNlLmphdmE=)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-4.00%)` | |
   | 
[.../org/apache/hudi/utilities/sources/AvroSource.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NvdXJjZXMvQXZyb1NvdXJjZS5qYXZh)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-1.00%)` | |
   | 
[.../org/apache/hudi/utilities/sources/JsonSource.java](https://codecov.io/gh/apache/hudi/pull/2875/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NvdXJjZXMvSnNvblNvdXJjZS5qYXZh)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-1.00%)` | |
   | 

[GitHub] [hudi] codecov-commenter commented on pull request #2879: [HUDI-1848] Adding support for HMS for running DDL queries in hive-sy…

2021-04-26 Thread GitBox


codecov-commenter commented on pull request #2879:
URL: https://github.com/apache/hudi/pull/2879#issuecomment-82454


   # 
[Codecov](https://codecov.io/gh/apache/hudi/pull/2879?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 Report
   > Merging 
[#2879](https://codecov.io/gh/apache/hudi/pull/2879?src=pr=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (56a2a21) into 
[master](https://codecov.io/gh/apache/hudi/commit/a5789c40673b36d40adab696706acc1446a286f8?el=desc_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 (a5789c4) will **decrease** coverage by `43.63%`.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/hudi/pull/2879/graphs/tree.svg?width=650=150=pr=VTTXabwbs2_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)](https://codecov.io/gh/apache/hudi/pull/2879?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
   
   ```diff
   @@ Coverage Diff  @@
   ## master   #2879   +/-   ##
   
   - Coverage 53.00%   9.36%   -43.64% 
   + Complexity 3743  48 -3695 
   
 Files   488  54  -434 
 Lines 234351996-21439 
 Branches   2500 236 -2264 
   
   - Hits  12422 187-12235 
   + Misses 99131796 -8117 
   + Partials   1100  13 -1087 
   ```
   
   | Flag | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | hudicli | `?` | `?` | |
   | hudiclient | `?` | `?` | |
   | hudicommon | `?` | `?` | |
   | hudiflink | `?` | `?` | |
   | hudihadoopmr | `?` | `?` | |
   | hudisparkdatasource | `?` | `?` | |
   | hudisync | `?` | `?` | |
   | huditimelineservice | `?` | `?` | |
   | hudiutilities | `9.36% <ø> (-60.33%)` | `48.00 <ø> (-325.00)` | |
   
   Flags with carried forward coverage won't be shown. [Click 
here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#carryforward-flags-in-the-pull-request-comment)
 to find out more.
   
   | [Impacted 
Files](https://codecov.io/gh/apache/hudi/pull/2879?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)
 | Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...va/org/apache/hudi/utilities/IdentitySplitter.java](https://codecov.io/gh/apache/hudi/pull/2879/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL0lkZW50aXR5U3BsaXR0ZXIuamF2YQ==)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-2.00%)` | |
   | 
[...va/org/apache/hudi/utilities/schema/SchemaSet.java](https://codecov.io/gh/apache/hudi/pull/2879/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NjaGVtYS9TY2hlbWFTZXQuamF2YQ==)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-3.00%)` | |
   | 
[...a/org/apache/hudi/utilities/sources/RowSource.java](https://codecov.io/gh/apache/hudi/pull/2879/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NvdXJjZXMvUm93U291cmNlLmphdmE=)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-4.00%)` | |
   | 
[.../org/apache/hudi/utilities/sources/AvroSource.java](https://codecov.io/gh/apache/hudi/pull/2879/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NvdXJjZXMvQXZyb1NvdXJjZS5qYXZh)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-1.00%)` | |
   | 
[.../org/apache/hudi/utilities/sources/JsonSource.java](https://codecov.io/gh/apache/hudi/pull/2879/diff?src=pr=tree_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation#diff-aHVkaS11dGlsaXRpZXMvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2h1ZGkvdXRpbGl0aWVzL3NvdXJjZXMvSnNvblNvdXJjZS5qYXZh)
 | `0.00% <0.00%> (-100.00%)` | `0.00% <0.00%> (-1.00%)` | |
   | 

[jira] [Closed] (HUDI-1837) Add optional instant range to log record scanner for log block filtering

2021-04-26 Thread vinoyang (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

vinoyang closed HUDI-1837.
--
Resolution: Implemented

d047e91d86d6905c0f66dbdb24f2e9ecd3684430

> Add optional instant range to log record scanner for log  block filtering
> -
>
> Key: HUDI-1837
> URL: https://issues.apache.org/jira/browse/HUDI-1837
> Project: Apache Hudi
>  Issue Type: Improvement
>  Components: Flink Integration
>Reporter: Danny Chen
>Assignee: Danny Chen
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.9.0
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HUDI-1837) Add optional instant range to log record scanner for log block filtering

2021-04-26 Thread vinoyang (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-1837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

vinoyang updated HUDI-1837:
---
Fix Version/s: 0.9.0

> Add optional instant range to log record scanner for log  block filtering
> -
>
> Key: HUDI-1837
> URL: https://issues.apache.org/jira/browse/HUDI-1837
> Project: Apache Hudi
>  Issue Type: Improvement
>  Components: Flink Integration
>Reporter: Danny Chen
>Assignee: Danny Chen
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.9.0
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[hudi] branch master updated (3e4fa17 -> d047e91)

2021-04-26 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository.

vinoyang pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/hudi.git.


from 3e4fa17  [HUDI-1835] Fixing kafka native config param for auto offset 
reset (#2864)
 add d047e91  [HUDI-1837] Add optional instant range to log record scanner 
for log (#2870)

No new revisions were added by this update.

Summary of changes:
 ...SparkExecuteClusteringCommitActionExecutor.java | 17 +++--
 .../table/log/AbstractHoodieLogRecordScanner.java  | 18 -
 .../table/log/HoodieMergedLogRecordScanner.java| 52 --
 .../table/log/HoodieUnMergedLogRecordScanner.java  |  8 +--
 .../hudi/common/table/log}/InstantRange.java   |  4 +-
 .../hudi/metadata/HoodieBackedTableMetadata.java   | 12 +++-
 .../HoodieMetadataMergedLogRecordScanner.java  | 79 +-
 .../apache/hudi/configuration/FlinkOptions.java|  6 ++
 .../hudi/source/StreamReadMonitoringFunction.java  |  2 +-
 .../org/apache/hudi/table/format/FormatUtils.java  |  1 +
 .../table/format/mor/MergeOnReadInputFormat.java   | 12 
 .../table/format/mor/MergeOnReadInputSplit.java|  1 +
 .../java/org/apache/hudi/util/StreamerUtil.java|  1 +
 13 files changed, 159 insertions(+), 54 deletions(-)
 rename {hudi-flink/src/main/java/org/apache/hudi/table/format/mor => 
hudi-common/src/main/java/org/apache/hudi/common/table/log}/InstantRange.java 
(97%)


[GitHub] [hudi] yanghua merged pull request #2870: [HUDI-1837] Add optional instant range to log record scanner for log block filtering

2021-04-26 Thread GitBox


yanghua merged pull request #2870:
URL: https://github.com/apache/hudi/pull/2870


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Created] (HUDI-1849) Remove the unit from the Flink sql options

2021-04-26 Thread Danny Chen (Jira)
Danny Chen created HUDI-1849:


 Summary: Remove the unit from the Flink sql options
 Key: HUDI-1849
 URL: https://issues.apache.org/jira/browse/HUDI-1849
 Project: Apache Hudi
  Issue Type: Improvement
  Components: Flink Integration
Reporter: Danny Chen
 Fix For: 0.9.0


Remove the unit suffix because it is verbose.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


  1   2   >