lw309637554 commented on a change in pull request #2275:
URL: https://github.com/apache/hudi/pull/2275#discussion_r541677201
##
File path:
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/table/action/commit/BaseSparkCommitActionExecutor.java
##
@@ -103,6 +104,9 @@
lw309637554 commented on a change in pull request #2275:
URL: https://github.com/apache/hudi/pull/2275#discussion_r541675903
##
File path:
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/table/action/commit/BaseSparkCommitActionExecutor.java
##
@@ -103,6 +104,9 @@
sumihehe opened a new issue #2346:
URL: https://github.com/apache/hudi/issues/2346
Hi all,
The rt view query returns a wrong result with predicate push down.
This is my query on a rt view of MOR table:
select count(1) from ***mor_rt where platform = "HYLOOP" and
[
https://issues.apache.org/jira/browse/HUDI-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17251538#comment-17251538
]
Mani Jindal commented on HUDI-1463:
---
Great Idea [~vinoth] ya i would be interested to do that let me
kingkongpoon opened a new issue #2345:
URL: https://github.com/apache/hudi/issues/2345
When I use hudi-0.6.0, I find that the option PRECOMBINE_FIELD_OPT_KEY is
useless ?
I want to use a rt table to update my data by it's timestamp (ts)
### Test Data filename a.csv
codecov-io edited a comment on pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#issuecomment-730664726
This is an automated message from the Apache Git Service.
To respond to the message, please log on to
codecov-io edited a comment on pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#issuecomment-730664726
# [Codecov](https://codecov.io/gh/apache/hudi/pull/2263?src=pr=h1) Report
> Merging
[#2263](https://codecov.io/gh/apache/hudi/pull/2263?src=pr=desc) (039e6a5)
into
codecov-io edited a comment on pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#issuecomment-730664726
# [Codecov](https://codecov.io/gh/apache/hudi/pull/2263?src=pr=h1) Report
> Merging
[#2263](https://codecov.io/gh/apache/hudi/pull/2263?src=pr=desc) (039e6a5)
into
satishkotha commented on pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#issuecomment-747852533
> This is great, thanks @satishkotha !
>
> I have completed a first pass. Don't have major concerns. May be we can
work through these initial comments, as I complete the
satishkotha commented on pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#issuecomment-747850886
> @satishkotha
>
> > After clustering all new records have a new commit time. I'm trying to
see if it's possible to preserve commit_time from original file to support
satishkotha commented on a change in pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#discussion_r54517
##
File path:
hudi-common/src/main/java/org/apache/hudi/common/util/FileSliceUtils.java
##
@@ -0,0 +1,67 @@
+/*
+ * Licensed to the Apache Software
satishkotha commented on a change in pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#discussion_r54370
##
File path:
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/compact/strategy/LogFileSizeBasedCompactionStrategy.java
##
@@
satishkotha commented on a change in pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#discussion_r545554989
##
File path:
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/commit/AbstractBulkInsertHelper.java
##
@@ -27,8 +27,21 @@
satishkotha commented on a change in pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#discussion_r545554841
##
File path:
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/cluster/strategy/ScheduleClusteringStrategy.java
##
@@ -0,0
satishkotha commented on a change in pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#discussion_r545554494
##
File path:
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/cluster/strategy/ScheduleClusteringStrategy.java
##
@@ -0,0
satishkotha commented on a change in pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#discussion_r545554378
##
File path:
umehrot2 commented on a change in pull request #2343:
URL: https://github.com/apache/hudi/pull/2343#discussion_r545546646
##
File path:
hudi-client/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java
##
@@ -369,10 +343,56 @@ private void
satishkotha commented on a change in pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#discussion_r545554043
##
File path:
satishkotha commented on a change in pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#discussion_r545553782
##
File path:
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metrics/HoodieMetrics.java
##
@@ -48,6 +49,7 @@
private Timer
codecov-io edited a comment on pull request #2311:
URL: https://github.com/apache/hudi/pull/2311#issuecomment-741799295
This is an automated message from the Apache Git Service.
To respond to the message, please log on to
codecov-io edited a comment on pull request #2311:
URL: https://github.com/apache/hudi/pull/2311#issuecomment-741799295
# [Codecov](https://codecov.io/gh/apache/hudi/pull/2311?src=pr=h1) Report
> Merging
[#2311](https://codecov.io/gh/apache/hudi/pull/2311?src=pr=desc) (d932723)
into
codecov-io commented on pull request #2344:
URL: https://github.com/apache/hudi/pull/2344#issuecomment-747810064
# [Codecov](https://codecov.io/gh/apache/hudi/pull/2344?src=pr=h1) Report
> Merging
[#2344](https://codecov.io/gh/apache/hudi/pull/2344?src=pr=desc) (3649a24)
into
rmpifer commented on a change in pull request #2342:
URL: https://github.com/apache/hudi/pull/2342#discussion_r545511401
##
File path: hudi-client/src/main/java/org/apache/hudi/table/HoodieTable.java
##
@@ -635,9 +635,9 @@ public boolean requireSortedRecords() {
return
umehrot2 merged pull request #2326:
URL: https://github.com/apache/hudi/pull/2326
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to
This is an automated email from the ASF dual-hosted git repository.
uditme pushed a commit to branch rfc-15
in repository https://gitbox.apache.org/repos/asf/hudi.git
The following commit(s) were added to refs/heads/rfc-15 by this push:
new 3cab54e Use metadata table for listing in
umehrot2 commented on a change in pull request #2326:
URL: https://github.com/apache/hudi/pull/2326#discussion_r545510206
##
File path:
hudi-spark/src/test/scala/org/apache/hudi/functional/TestCOWDataSource.scala
##
@@ -194,4 +195,42 @@ class TestCOWDataSource extends
umehrot2 commented on a change in pull request #2326:
URL: https://github.com/apache/hudi/pull/2326#discussion_r545508178
##
File path:
hudi-spark/src/test/scala/org/apache/hudi/functional/TestCOWDataSource.scala
##
@@ -194,4 +195,42 @@ class TestCOWDataSource extends
umehrot2 commented on a change in pull request #2326:
URL: https://github.com/apache/hudi/pull/2326#discussion_r545508037
##
File path:
hudi-common/src/main/java/org/apache/hudi/common/config/HoodieMetadataConfig.java
##
@@ -128,24 +128,23 @@ public Builder retainCommits(int
codecov-io edited a comment on pull request #2168:
URL: https://github.com/apache/hudi/pull/2168#issuecomment-706649760
# [Codecov](https://codecov.io/gh/apache/hudi/pull/2168?src=pr=h1) Report
> Merging
[#2168](https://codecov.io/gh/apache/hudi/pull/2168?src=pr=desc) (e6f76b0)
into
[
https://issues.apache.org/jira/browse/HUDI-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-1470:
-
Labels: pull-request-available (was: )
> Hudi-test-suite - DFSHoodieDatasetInputReader.java -
nbalajee opened a new pull request #2344:
URL: https://github.com/apache/hudi/pull/2344
## What is the purpose of the pull request
When hudi-test-suite is reading records from the existing parquet files, it
is using the reader schema (original schema used to write the parquet file).
Balajee Nagasubramaniam created HUDI-1470:
-
Summary: Hudi-test-suite - DFSHoodieDatasetInputReader.java - Use
the latest writer schema, when reading the parquet files.
Key: HUDI-1470
URL:
bvaradar commented on issue #2076:
URL: https://github.com/apache/hudi/issues/2076#issuecomment-747781069
@AnweshaSen : Can you create a new github issue with full exception stack
trace and all the configurations that you are passing.
[
https://issues.apache.org/jira/browse/HUDI-1399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17251393#comment-17251393
]
Vinoth Chandar commented on HUDI-1399:
--
yes starting with 1 and 2 sounds good to me.
Followed by 4
[
https://issues.apache.org/jira/browse/HUDI-1399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17251392#comment-17251392
]
Vinoth Chandar commented on HUDI-1399:
--
yes love to get a release out by end of year. There is a lot
[
https://issues.apache.org/jira/browse/HUDI-1455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17251390#comment-17251390
]
Vinoth Chandar commented on HUDI-1455:
--
Thanks for the information Ryan. Let me process this and come
This is an automated email from the ASF dual-hosted git repository.
vinoth pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/hudi.git.
from 14d5d11 [HUDI-1406] Add date partition based source input selector
for Delta streamer (#2264)
add 8b5d6f9
vinothchandar merged pull request #2322:
URL: https://github.com/apache/hudi/pull/2322
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go
vinothchandar commented on a change in pull request #2332:
URL: https://github.com/apache/hudi/pull/2332#discussion_r545467367
##
File path:
hudi-client/src/main/java/org/apache/hudi/client/HoodieWriteClient.java
##
@@ -701,8 +704,6 @@ protected void
vinothchandar commented on a change in pull request #2332:
URL: https://github.com/apache/hudi/pull/2332#discussion_r545466691
##
File path:
hudi-client/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java
##
@@ -725,6 +698,13 @@ private synchronized
umehrot2 commented on pull request #2343:
URL: https://github.com/apache/hudi/pull/2343#issuecomment-747755598
Its much needed. We should have done this for Hudi in general long time back.
This is an automated message from
codecov-io edited a comment on pull request #2343:
URL: https://github.com/apache/hudi/pull/2343#issuecomment-747754631
# [Codecov](https://codecov.io/gh/apache/hudi/pull/2343?src=pr=h1) Report
> Merging
[#2343](https://codecov.io/gh/apache/hudi/pull/2343?src=pr=desc) (374bbb6)
into
codecov-io edited a comment on pull request #2343:
URL: https://github.com/apache/hudi/pull/2343#issuecomment-747754631
# [Codecov](https://codecov.io/gh/apache/hudi/pull/2343?src=pr=h1) Report
> Merging
[#2343](https://codecov.io/gh/apache/hudi/pull/2343?src=pr=desc) (374bbb6)
into
codecov-io commented on pull request #2343:
URL: https://github.com/apache/hudi/pull/2343#issuecomment-747754631
# [Codecov](https://codecov.io/gh/apache/hudi/pull/2343?src=pr=h1) Report
> Merging
[#2343](https://codecov.io/gh/apache/hudi/pull/2343?src=pr=desc) (374bbb6)
into
[
https://issues.apache.org/jira/browse/HUDI-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17251378#comment-17251378
]
Vinoth Chandar commented on HUDI-1463:
--
We can start by outlining the summary of releases this year
[
https://issues.apache.org/jira/browse/HUDI-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17251379#comment-17251379
]
Vinoth Chandar commented on HUDI-1463:
--
wdyt?
> Accomplishments (2019-2020) and Roadmap (2021-2022)
vinothchandar commented on pull request #2343:
URL: https://github.com/apache/hudi/pull/2343#issuecomment-747747412
Good optimization. Will review
This is an automated message from the Apache Git Service.
To respond to the
[
https://issues.apache.org/jira/browse/HUDI-1469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-1469:
-
Labels: pull-request-available (was: )
> Faster initialization for larger datasets
>
prashantwason commented on pull request #2343:
URL: https://github.com/apache/hudi/pull/2343#issuecomment-747741409
@vinothchandar Please take a look.
This is an automated message from the Apache Git Service.
To respond to
prashantwason opened a new pull request #2343:
URL: https://github.com/apache/hudi/pull/2343
This finds partitions and files in a single scan rather than listing
partitions first and then listing each partition.
## Brief change log
*(for example:)*
- *Modify
Prashant Wason created HUDI-1469:
Summary: Faster initialization for larger datasets
Key: HUDI-1469
URL: https://issues.apache.org/jira/browse/HUDI-1469
Project: Apache Hudi
Issue Type:
[
https://issues.apache.org/jira/browse/HUDI-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Prashant Wason closed HUDI-1305.
Resolution: Fixed
> Prevent log pollution from console metrics logger
>
[
https://issues.apache.org/jira/browse/HUDI-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Prashant Wason updated HUDI-1305:
-
Status: Open (was: New)
> Prevent log pollution from console metrics logger
>
[
https://issues.apache.org/jira/browse/HUDI-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Prashant Wason updated HUDI-1303:
-
Status: Open (was: New)
> Some improvements for the HUDI Test Suite
>
[
https://issues.apache.org/jira/browse/HUDI-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Prashant Wason closed HUDI-1303.
Resolution: Fixed
> Some improvements for the HUDI Test Suite
>
vinothchandar commented on pull request #2311:
URL: https://github.com/apache/hudi/pull/2311#issuecomment-747703536
@nsivabalan could you check the build?
This is an automated message from the Apache Git Service.
To respond
vinothchandar commented on a change in pull request #2342:
URL: https://github.com/apache/hudi/pull/2342#discussion_r545394203
##
File path:
hudi-common/src/main/java/org/apache/hudi/metadata/HoodieTableMetadataTimelineUtil.java
##
@@ -0,0 +1,334 @@
+/*
+ * Licensed to the
AnweshaSen commented on issue #2076:
URL: https://github.com/apache/hudi/issues/2076#issuecomment-747676296
Hi, I am very new to Hudi and I faced similar kind error while dealing with
csv.
I tried with a simple csv, having a structure like:
+---+-+
|age| Name|
satish created HUDI-1468:
Summary: incremental read support with clustering
Key: HUDI-1468
URL: https://issues.apache.org/jira/browse/HUDI-1468
Project: Apache Hudi
Issue Type: Sub-task
satishkotha commented on a change in pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#discussion_r545367096
##
File path:
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieClusteringConfig.java
##
@@ -0,0 +1,155 @@
+/*
+ * Licensed to
satishkotha commented on pull request #2254:
URL: https://github.com/apache/hudi/pull/2254#issuecomment-747667904
@n3nash @bvaradar Can one of you review as well?
This is an automated message from the Apache Git Service.
To
satishkotha commented on a change in pull request #2254:
URL: https://github.com/apache/hudi/pull/2254#discussion_r545364740
##
File path:
hudi-client/hudi-spark-client/src/test/java/org/apache/hudi/client/TestHoodieClientOnCopyOnWriteStorage.java
##
@@ -999,6 +999,103 @@
pengzhiwei2018 edited a comment on pull request #2283:
URL: https://github.com/apache/hudi/pull/2283#issuecomment-739497762
> @pengzhiwei2018 would you please describe in more details about the issue?
Hi @leesf ,Sorry for the late response. I find that when reading a hudi
table
[
https://issues.apache.org/jira/browse/HUDI-1415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
pengzhiwei updated HUDI-1415:
-
Description:
If we update a hudi table twice more, we will get an incorrect query count by
spark sql.
brandon-stanley commented on issue #2331:
URL: https://github.com/apache/hudi/issues/2331#issuecomment-747551023
Thanks for your response @prashantwason.
Does this mean that the implementation of maintaining schemas within Hudi is
more of a _wrapper_ around Avro which has an
sbernauer commented on pull request #2316:
URL: https://github.com/apache/hudi/pull/2316#issuecomment-747528223
Im using >= 0.6.0 from master branch and Spark 3.0.1
I'm sorry I can't downgrade to spark 2.4
But I will try removing the relocation
vinothchandar commented on pull request #2342:
URL: https://github.com/apache/hudi/pull/2342#issuecomment-747486716
@rmpifer could you please rebase this against latest `rfc-15` branch. I ll
get started with the review in the meantime
vinothchandar commented on a change in pull request #2326:
URL: https://github.com/apache/hudi/pull/2326#discussion_r545145067
##
File path:
hudi-common/src/main/java/org/apache/hudi/common/config/HoodieMetadataConfig.java
##
@@ -128,24 +128,23 @@ public Builder
vinothchandar commented on a change in pull request #2263:
URL: https://github.com/apache/hudi/pull/2263#discussion_r544859334
##
File path:
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieClusteringConfig.java
##
@@ -0,0 +1,165 @@
+/*
+ * Licensed
This is an automated email from the ASF dual-hosted git repository.
vinoth pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/hudi.git
The following commit(s) were added to refs/heads/master by this push:
new 14d5d11 [HUDI-1406] Add date partition based
vinothchandar merged pull request #2264:
URL: https://github.com/apache/hudi/pull/2264
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go
wangxianghu created HUDI-1467:
-
Summary: Promote Powered by chapter to top level menu
Key: HUDI-1467
URL: https://issues.apache.org/jira/browse/HUDI-1467
Project: Apache Hudi
Issue Type:
garyli1019 commented on a change in pull request #2296:
URL: https://github.com/apache/hudi/pull/2296#discussion_r544993950
##
File path:
hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/hudi/functional/TestCOWDataSource.scala
##
@@ -320,4 +320,21 @@ class
Karl-WangSK commented on pull request #2260:
URL: https://github.com/apache/hudi/pull/2260#issuecomment-747348473
@vinothchandar
This is an automated message from the Apache Git Service.
To respond to the message, please
Karl-WangSK removed a comment on pull request #2260:
URL: https://github.com/apache/hudi/pull/2260#issuecomment-730111282
cc @bvaradar @yanghua
This is an automated message from the Apache Git Service.
To respond to the
codecov-io edited a comment on pull request #2260:
URL: https://github.com/apache/hudi/pull/2260#issuecomment-729530724
# [Codecov](https://codecov.io/gh/apache/hudi/pull/2260?src=pr=h1) Report
> Merging
[#2260](https://codecov.io/gh/apache/hudi/pull/2260?src=pr=desc) (0750f24)
into
wangxianghu created HUDI-1466:
-
Summary: Migrate CI/CD from travis to Azure pipeline
Key: HUDI-1466
URL: https://issues.apache.org/jira/browse/HUDI-1466
Project: Apache Hudi
Issue Type: New
This is an automated email from the ASF dual-hosted git repository.
vinoyang pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/hudi.git
The following commit(s) were added to refs/heads/master by this push:
new 4ddfc61 [MINOR] Make QuickstartUtil generate
yanghua merged pull request #2340:
URL: https://github.com/apache/hudi/pull/2340
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to
garyli1019 commented on a change in pull request #2281:
URL: https://github.com/apache/hudi/pull/2281#discussion_r544952178
##
File path:
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieDataSourceConfig.java
##
@@ -0,0 +1,102 @@
+/*
+ * Licensed to
liujinhui1994 commented on a change in pull request #2281:
URL: https://github.com/apache/hudi/pull/2281#discussion_r544948270
##
File path:
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieDataSourceConfig.java
##
@@ -0,0 +1,102 @@
+/*
+ * Licensed
wangxianghu commented on a change in pull request #2281:
URL: https://github.com/apache/hudi/pull/2281#discussion_r544888525
##
File path:
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieDataSourceConfig.java
##
@@ -0,0 +1,102 @@
+/*
+ * Licensed to
garyli1019 commented on a change in pull request #2281:
URL: https://github.com/apache/hudi/pull/2281#discussion_r544885347
##
File path:
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieDataSourceConfig.java
##
@@ -0,0 +1,102 @@
+/*
+ * Licensed to
83 matches
Mail list logo