Re: [DISCUSS] Promote Hudi Chinese Documentation into the official website

2019-08-13 Thread Y. Ethan Guo
+1 I can also help on the Chinese version of the docs. On Tue, Aug 13, 2019 at 11:08 AM Vinoth Chandar wrote: > +1 Thanks for starting this initiative, Vino. > > I also suggest we add a new component in JIRA with a few volunteers to help > review PRs that come in this area? > > On Tue, Aug 13,

Re: Dropping support for Spark 2.2 and lower

2019-09-10 Thread Y. Ethan Guo
+1 we’re on Spark 2.4. On Tue, Sep 10, 2019 at 11:22 AM Minh Pham wrote: > +1 we are also on 2.3 and want to move to 2.4 > > Sent from my iPhone > > > On Sep 10, 2019, at 4:22 AM, taher koitawala wrote: > > > > +1 we can drop that > > > >> On Tue, Sep 10, 2019 at 4:45 PM Kabeer Ahmed > wrote:

Re: new committer: leesf/Shaofeng Li

2019-11-02 Thread Y. Ethan Guo
Congrats, @leesf, great to see this happening! On Sat, Nov 2, 2019 at 1:32 PM Vinoth Chandar wrote: > Hello all, > > The Podling Project Management Committee (PPMC) for Apache Hudi > (Incubating) has invited Shaofeng Li to become a committer and we are > pleased to announce that he has

Re: new committer: vinoyang/Hua Yang

2019-11-02 Thread Y. Ethan Guo
Congrats, @vinoyang, great work! On Sat, Nov 2, 2019 at 1:51 PM Kabeer Ahmed wrote: > Hua Yang and Shaofeng Li - Congratulations on your achievement! > > On Nov 2 2019, at 8:36 pm, Vinoth Chandar wrote: > > Hello all, > > > > The Podling Project Management Committee (PPMC) for Apache Hudi > >

[DISCUSS] Intent to RFC: Restructuring and auto-generation of docs

2019-11-13 Thread Y Ethan Guo
Hey Folks, I plan to start an RFC in the Docs Overhaul track. The scope of this RFC will be the restructuring and auto-generation of docs, with the following goals: - Make it easier for users to understand Hudi's main features and access docs of each release - Separate the actual

Re: [DISCUSS] Intent to RFC: Restructuring and auto-generation of docs

2019-11-13 Thread Y Ethan Guo
> > > > > Best, > > > Raymond > > > > > > On Wed, Nov 13, 2019 at 4:37 AM leesf wrote: > > > > > > > +1. It is very practical and thanks for driving the discussion. > > > > > > > > vinoth and balaji would give

[DISCUSS] RFC-10: Restructuring and auto-generation of docs

2019-11-13 Thread Y Ethan Guo
Hey folks, I put my thoughts around the topic in this RFC: RFC-10: Restructuring and auto-generation of docs https://cwiki.apache.org/confluence/display/HUDI/RFC-10%3A+Restructuring+and+auto-generation+of+docs Feel free to provide feedback there or here. Thanks, - Ethan

Re: [DISCUSS] RFC-10: Restructuring and auto-generation of docs

2019-11-14 Thread Y Ethan Guo
may want to use this comparison to choose among Hudi and others. Maybe we can update the comparison every Hudi release (a month or two) or quarter? > Regards, > Gurudatt > > > On Thu, Nov 14, 2019 at 6:21 AM Y Ethan Guo > wrote: > > > Hey folks, > > >

Re: [DISCUSS] Simplification of terminologies

2019-11-12 Thread Y. Ethan Guo
+1 on [1] and [2]. For [3], I have similar doubts as Shiyan. For the naming, I can understand the original intent of the analogy for COW which is to make another "copy" of columnar/parquet file upon the modification/update to the records in the file. From the system design point of view, it's

Re: [DISCUSS] RFC-10: Restructuring and auto-generation of docs

2019-11-15 Thread Y Ethan Guo
ranular > feature level, is a larger task to keep up with, and that too at every > release. > But I am all for more blogs explaining the tradeoffs we made in Hudi more > clearly for sure. > > > On Thu, Nov 14, 2019 at 11:33 AM Y Ethan Guo > wrote: > > > Hey Gurudatt, >

Re: EMR + HUDI

2019-11-15 Thread Y Ethan Guo
Great achievement! Thanks to all who contributed to this. On Fri, Nov 15, 2019 at 10:50 AM vbal...@apache.org wrote: > This is massive news !! Many thanks to Udit, Rahul and AWS team for > working with us patiently and making HUDI part of EMR. This is indeed a > marathon effort !! > Looking

Re: [DISCUSS] Scaling community support

2019-12-10 Thread Y Ethan Guo
Here are my two cents in addition to the great suggestions in the thread: I agree with @Sivabalan that folks in Hudi community have different levels of expertise and amount of effort to put in the community. So in general, it may be good to have PoCs or contributors for each area in Hudi, e.g.,

Re: [DISCUSS] Introduce stricter comment and code style validation rules

2019-11-19 Thread Y Ethan Guo
+1 on all of the proposed rules. These will also make the javadoc more readable. On Mon, Nov 18, 2019 at 5:55 PM Vinoth Chandar wrote: > +1 on all three. > > Would there be a overhaul of existing code to add comments to all classes? > We are pretty reasonable already, but good to get this in

Re: [QUESTION] Encountering exceptions while upserting with Deltastreamer

2019-12-19 Thread Y Ethan Guo
t; > without issues and the second set of data that caused an issue. Please > > check that you are allowed to share the data. > > Thanks > > Kabeer. > > > > On Dec 19 2019, at 7:33 pm, Y Ethan Guo > wrote: > > > Hi folks, > > > >

[QUESTION] Encountering exceptions while upserting with Deltastreamer

2019-12-19 Thread Y Ethan Guo
Hi folks, I'm testing a new Deltastreamer job in cluster which incrementally pulls data from an upstream Hudi table and upserts the dataset into another table. The first run of Deltastreamer job which involves only inserts succeeded. The second run of the job which involves updates throws the

Re: [QUESTION] Encountering exceptions while upserting with Deltastreamer

2019-12-19 Thread Y Ethan Guo
Got it. Thanks for the clarification. On Thu, Dec 19, 2019 at 2:54 PM nishith agarwal wrote: > Ethan, > > There isn't one available in the open-source, it's an internal build we > have. > > -Nishith > > On Thu, Dec 19, 2019 at 2:50 PM Y Ethan Guo > wrote: >

Re: [DISCUSS] RFC-10 Restructuring and auto-generation of docs

2019-12-20 Thread Y Ethan Guo
, manual update of content through PMC should not be hard). Once I have the automated scripts ready, I'll try to hook things up with buildbot. - Ethan On Fri, Dec 20, 2019 at 8:55 PM lamberken wrote: > > > Hi @Y Ethan Guo @Vinoth > > > I have some ideas for RFC-10 which aims to im

Re: [QUESTION] Encountering exceptions while upserting with Deltastreamer

2019-12-21 Thread Y Ethan Guo
; > { "name": "version", "type": "long" } > > ], > > "name": "master_cluster", > > "type": "record" > > } > > ] > > }, > > > > ___ > &g

Re: IDE setup for code formatting

2020-02-28 Thread Y Ethan Guo
Reviving this thread... I'm hitting checkstyle issues locally again and thinking that it might be worth trying GJF. It interoperates well with IntelliJ and the automated tool reformats the code in 1 shot as Minh suggested. If sweeping checkstyle fixes in the codebase disrupts development, we

Re: IDE setup for code formatting

2020-02-28 Thread Y Ethan Guo
ut we have turned it off. > > Thanks > Vinoth > > On Fri, Feb 28, 2020 at 12:47 PM Y Ethan Guo > wrote: > > > Reviving this thread... > > > > I'm hitting checkstyle issues locally again and thinking that it might be > > worth trying GJF. It interoperates

Re: IDE setup for code formatting

2020-02-29 Thread Y Ethan Guo
l focus on technical PRs for now before we have a strategy. Best, - Ethan > > On Fri, Feb 28, 2020 at 4:30 PM Y Ethan Guo > wrote: > > > Yes. Mostly when I run `mvn clean package -DskipTests -DskipITs` > locally, > > I saw some style issues. Two main things not we

Re: Re: Please welcome our new PPMCs and Committer

2020-02-17 Thread Y Ethan Guo
Congrats!! Great work! On Sun, Feb 16, 2020 at 8:27 AM Pratyaksh Sharma wrote: > Congratulations Leesf, Vino and Siva. Well deserved all of you. :) > > On Sat, Feb 15, 2020 at 6:05 PM leesf wrote: > > > Thanks you guys, it is really a great honor for me and I am very excited. > > Really happy

Re: [DISCUSS] Code freeze date for next release(0.5.1)

2020-01-08 Thread Y Ethan Guo
+1 on the timeline On Wed, Jan 8, 2020 at 6:01 PM vino yang wrote: > +1 > > Bhavani Sudha 于2020年1月9日周四 上午5:24写道: > > > +1 great idea. > > > > On Wed, Jan 8, 2020 at 1:20 PM Shiyan Xu > > wrote: > > > > > +1. Good idea for testing phase. > > > > > > On Wed, 8 Jan 2020, 08:26 Vinoth Chandar,

Re: Re: Re: Re: Re: Re: Re: Re: Re:Re: Re: Re:Re: Re: Re: [DISCUSS] Rework of new web site

2020-01-08 Thread Y Ethan Guo
;vacation. > >@lamber-ken The new site looks cool. Thanks for the time and effort you > >have put into this. > > > >Thanks, > >Sudha > > > > > > > >On Tue, Jan 7, 2020 at 11:45 PM lamberken wrote: > > > >> > >> > >> H

Re: [DISCUSS] Delay code freeze date for next release until Jan 19th (Sunday)

2020-01-15 Thread Y Ethan Guo
+1, this allows more time to land critical PRs. On Wed, Jan 15, 2020 at 10:54 PM leesf wrote: > > I assume you meant UTC-8  > Sure :) >

Re: Re: IDE setup for code formatting

2019-12-23 Thread Y Ethan Guo
+1 on auto-formatting the code in IDE based on the checkstyle rules. Based on my experience with Java and Scala in IntelliJ, there's is indeed discrepancy on auto formatting code on some custom checkstyle rules. For such cases, I tried to avoid using them if they do not sacrifice too much on the

Re: Re: Re: Re: Re: Re: Re:Re: Re: Re:Re: Re: Re: [DISCUSS] Rework of new web site

2020-01-07 Thread Y Ethan Guo
companion Chinese version on the old website (e.g., http://hudi.apache.org/cn/writing_data.html). So if it's not hard to port them to the new website, they are still useful for the users. Best, - Ethan On Tue, Jan 7, 2020 at 11:05 PM lamberken wrote: > > > Hi @Y Ethan Guo, > > >

Re: Re: Re: Re: Re: Re:Re: Re: Re:Re: Re: Re: [DISCUSS] Rework of new web site

2020-01-07 Thread Y Ethan Guo
@lamber-ken, Thanks for the great effort! The new website looks slick, with a much better browsing experience. One thing I noticed is that there seems to be no link to the Chinese version of the docs on the new website. Wondering where I can find them. Another minor thing is that the font

Re: [DISCUSS] Adding common errors and solutions to FAQs

2020-03-12 Thread Y Ethan Guo
I can help check the history of issues mentioned in Slack/Github and classify them to the troubleshooting guide Pratyaksh put up. In terms of the troubleshooting issues in Slack, shall we also copy useful ones to a separate doc for context? It's hard to track older messages and we're unable to

Re: [DISCUSS] Adding common errors and solutions to FAQs

2020-03-12 Thread Y Ethan Guo
I'll go ahead on this. If anyone else would like to help, feel free to ping me. Thanks, - Ethan On Thu, Mar 12, 2020 at 11:26 AM Y Ethan Guo wrote: > I can help check the history of issues mentioned in Slack/Github and > classify them to the troubleshooting guide Pratyaksh

Re: New Committer: lamber-ken

2020-04-07 Thread Y Ethan Guo
Congrats!!! On Tue, Apr 7, 2020 at 2:22 PM Gary Li wrote: > Congrats lamber! Well deserved! > > On Tue, Apr 7, 2020 at 2:18 PM Vinoth Chandar wrote: > > > Hello Apache Hudi Community, > > > > The Podling Project Management Committee (PPMC) for Apache > > Hudi (Incubating) has invited

Re: New PPMC Member : Bhavani Sudha

2020-04-07 Thread Y Ethan Guo
Congrats!!! On Tue, Apr 7, 2020 at 2:55 PM Vinoth Chandar wrote: > Hello all, > > I am very excited to share that we have new PPMC member - Sudha. She has > been a great champion for the project for almost couple years now, driving > a lot of presto/query engine facing changes and most of all

Re: [DISCUSS] should we do a 0.5.3 patch set release ?

2020-05-06 Thread Y Ethan Guo
+1 On Wed, May 6, 2020 at 6:29 PM vino yang wrote: > +1 for 0.5.3 as well > > Nishith 于2020年5月7日周四 上午8:16写道: > > > +1 on the idea > > > > Sent from my iPhone > > > > > On May 6, 2020, at 3:09 PM, Shiyan Xu > > wrote: > > > > > >

Re: [VOTE] Apache Hudi graduation to top level project

2020-05-06 Thread Y Ethan Guo
+1 On Wed, May 6, 2020 at 9:59 PM Prasanna Rajaperumal wrote: > +1 > > On 2020/05/06 20:55:48, Vinoth Chandar wrote: > > Hello all, > > > > Per our discussion on the dev mailing list ( > > >

Re: Bug Bash 0.6.0

2020-05-15 Thread Y Ethan Guo
Thanks for putting this together, Siva and folks! I'm currently blocked on my other PR, so if there are bug bash tickets unassigned or not worked on, I can pick up one at this time. On Fri, May 15, 2020 at 6:31 AM Sivabalan wrote: > As we discussed earlier in our mailing list, here we are

Re: [VOTE] Release 0.5.2-incubating, release candidate #2

2020-03-22 Thread Y Ethan Guo
+1 (non-binding) did the same checks as rc1 - [OK] Checksums and signatures - [OK] NOTICE, DISCLAIM, LICENSE files exist - [OK] Compilation & tests (`mvn clean package`, `mvn clean package -Dscala-2.12`, `mvn clean package -DskipTests -DskipITs -Pspark-shade-unbundle-avro`) - [OK] Javadoc (`mvn

Re: Unit tests in hudi-client module fail due to SparkContext

2020-07-28 Thread Y Ethan Guo
>> <https://github.com/apache/hudi/blob/b2763f433b3efb92fdcc0e760a88a43eaa2e5be3/pom.xml#L124>, >> which should be plentiful for all tests so far. >> The OOM looks weird to me.. maybe try checking the maven log see if >> -Xmx2g is indeed applied >> >> On Mon, Jul 27, 2020 at 11:11 PM

Unit tests in hudi-client module fail due to SparkContext

2020-07-26 Thread Y Ethan Guo
Hi, I'm working on hudi-client module and I notice that if I run all unit tests under hudi-client locally in IntelliJ, some tests (54 out of 256) are failing due to the following SparkException: "Only one SparkContext may be running in this JVM". Is there any way I can get around this?

Re: Unit tests in hudi-client module fail due to SparkContext

2020-07-28 Thread Y Ethan Guo
e test suite to speed things up. So > thats probably what you were hitting first, when running via IDE > Try following the .travis.yml profiles directly? > > > Not sure about the OOM. Have not seen this before. > > On Mon, Jul 27, 2020 at 11:00 PM Y Ethan Guo > wrote: &

[DISCUSS] Refactor hudi-client module for better support of multiple engines

2021-09-15 Thread Y Ethan Guo
Hi all, hudi-client module has core Hudi abstractions and client logic for different engines like Spark, Flink, and Java. While previous effort (HUDI-538 [1]) has decoupled the integration with Spark, there is quite some code duplication across different engines for almost the same logic due to

Re: Monthly or Bi-Monthly Dev meeting?

2021-09-23 Thread Y Ethan Guo
+1 on monthly community sync. On Thu, Sep 23, 2021 at 12:32 PM Udit Mehrotra wrote: > +1 for the monthly meeting. It would be great to start syncing up > again. Thanks Vinoth for bringing it up ! > > On Thu, Sep 23, 2021 at 12:14 PM Sivabalan wrote: > > > > +1 on monthly meet up. > > > > On

Re: [DISCUSS] Hudi 0.10.0 Release

2021-11-19 Thread Y Ethan Guo
Hi Danny, Thanks for summarizing the current progress towards the 0.10.0 release. I'm good with Nov 26th cutoff. Regarding my blockers: - [HUDI-2332] Implement scheduling of compaction/ clustering for Kafka Connect (Owner: Ethan Guo) PR is up. I'm addressing comments. - [HUDI-2737] Use

Re: [VOTE] Release 0.10.0, release candidate #3

2021-12-05 Thread Y Ethan Guo
+1 (non-binding) - [OK] Ran release validation script [1] - [OK] Built the source (Spark 2/3) - [OK] Ran Spark Guide in Quick Start using Spark 3.1.2 [1] https://gist.github.com/yihua/39ef5b07a08ed5780fa9c43819b326cb Best, - Ethan On Sat, Dec 4, 2021 at 1:27 PM Bhavani Sudha wrote: > +1

Re: Regular minor/patch releases

2021-12-14 Thread Y Ethan Guo
+1 on packing bug fixes (at best effort) to minor releases. On Mon, Dec 13, 2021 at 12:06 PM Sivabalan wrote: > +1 in general. but yeah, not sure if we have resources to do this for > every major release. > > On Mon, Dec 13, 2021 at 10:01 AM Vinoth Chandar wrote: > >> Hi all, >> >> In the past

Re: Unbundling "spark-avro" dependency

2022-03-08 Thread Y Ethan Guo
Thanks for raising the discussion. I agree that from the usability standpoint from the user side, we should keep the same expectation regarding "--packages" for Spark and reliance bundled spark-avro for utilities bundle in this release. Given that there are Spark API changes between 3.2.0 and

Re: [PSA] CI failures, PR merges halted

2022-03-24 Thread Y Ethan Guo
Hi all, The CI issues have been resolved. CI is green on master. Please rebase your PRs on the latest master to avoid noises in CI runs. Best, - Ethan On Wed, Mar 23, 2022 at 8:26 AM sagar sumit wrote: > Hi all, > > We have noticed consistent failure in the CI. These failures are mainly >

Calling for 0.12.4 release

2023-08-31 Thread Y Ethan Guo
Hi folks, It's been 4+ months since Hudi 0.12.3 was released. As we want to maintain 0.12.x LTS releases, shall we, as a community, follow up with 0.12.4 release to pick up recent bug fixes and improvements? Any volunteer for 0.12.4 Release Manager is welcome. Thanks, - Ethan

Re: Calling for 0.12.4 release

2023-09-01 Thread Y Ethan Guo
Thanks, Yue Zhang, for volunteering to be the RM! On Thu, Aug 31, 2023 at 4:38 PM Yue Zhang wrote: > Hi Hudiers, > I volunteer to be the RM for the next 0.12.4 if u don’t mind > YueZhang > Replied Message > | From | Y Ethan Guo | > | Date | 09/01/2023 07

Re: [ANNOUNCE] Apache HUDI 0.14.0 released

2023-10-16 Thread Y Ethan Guo
Thank you, Prashant, for driving the release! Thank you to everyone who contributed to the major release! - Ethan On Wed, Oct 4, 2023 at 10:57 PM Kyle Weller wrote: > Super exciting news, congratulations on the hard work to everyone who > contributed! > > On Wed, Oct 4, 2023 at 10:38 PM

Re: [VOTE] Release 0.14.0, release candidate #3

2023-09-22 Thread Y Ethan Guo
+1 (binding) - Ran validate_staged_release.sh [OK] - Hudi (Delta)streamer with error injection [OK] - Bundle validation https://github.com/apache/hudi/actions/runs/6277569953 [OK] - Ethan On Fri, Sep 22, 2023 at 10:29 AM Jonathan Vexler wrote: > +1 (non-binding) > - Tested Spark Datasource

Re: [DISSCUSS][NEW FEATURE] Hudi Lake Manager

2022-04-18 Thread Y Ethan Guo
ty. > > What is the final state of the Hudi service here ? Should we drop the > advantage of the server-less/light-weight architecture and moves > forward to a service mode ? > I mean will Hudi be more and more like a database on the cloud ? > > Best, > Danny > > Y Et

Re: [VOTE] Release 0.11.0, release candidate #2

2022-04-18 Thread Y Ethan Guo
-1 The Kafka Connect Sink for Hudi cannot ingest data using hudi-kafka-connect-bundle from 0.11.0-rc2 due to NoClassDefFoundError. The following fix is put up. https://github.com/apache/hudi/pull/5353 Best, - Ethan On Fri, Apr 15, 2022 at 5:20 AM Shiyan Xu wrote: > Hi everyone, > > Please

Re: [VOTE] Release 0.11.0, release candidate #1

2022-04-10 Thread Y Ethan Guo
-1 During my testing of 0.11.0 RC1 with Deltastreamer, errors have come up due to the issues Siva mentioned. On Sun, Apr 10, 2022 at 11:03 AM Shiyan Xu wrote: > -1 > > Rat plugin in CI was not working for some time and resulted in some files > missing Apache license header. This was fixed in

Re: [VOTE] Monthly Community Sync Time

2022-05-18 Thread Y Ethan Guo
+1 for new time. I prefer 9AM PT. On Tue, May 17, 2022 at 11:40 PM Pratyaksh Sharma wrote: > I would go with 8 AM PT. > > If that is not feasible, then 8.30 AM. > > On Wed, May 18, 2022 at 7:14 AM Vinoth Govindarajan < > vinoth.govindara...@gmail.com> wrote: > > > +1 > > > > I vote for 9 am as

0.11.1 release timeline

2022-05-23 Thread Y Ethan Guo
Hi folks, As the RM for the 0.11.1 release, I'd like to propose the code freeze on Jun 1st (Wed) for any bug fixes that are going to be included in the minor release, about a month after the 0.11.0 release. Let me know if you need more time for fixing any issues. Please tag any fix that you

[ANNOUNCE] Apache Hudi 0.11.1 released

2022-06-21 Thread Y Ethan Guo
The Apache Hudi team is pleased to announce the release of Apache Hudi 0.11.1. Apache Hudi (pronounced Hoodie) stands for Hadoop Upserts Deletes and Incrementals. Apache Hudi manages storage of large analytical datasets on DFS (Cloud stores, HDFS or any Hadoop FileSystem compatible storage) and

[VOTE] Release 0.11.1, release candidate #1

2022-06-09 Thread Y Ethan Guo
Hi everyone, Please review and vote on the release candidate #1 for the version 0.11.1, as follows: [ ] +1, Approve the release [ ] -1, Do not approve the release (please provide specific comments) The complete staging area is available for your review, which includes: * JIRA release notes

Re: Updates on 0.11.1 release

2022-06-10 Thread Y Ethan Guo
; > > > Please tell me if my email does not respect the release process > > > > On Wed Jun 8, 2022 at 1:39 AM CEST, Y Ethan Guo wrote: > > > Hi folks, > > > > > > All the 0.11.1 release blockers are landed. I'm going to cut RC1 and > > > start >

Re: [VOTE] Release 0.11.1, release candidate #1

2022-06-11 Thread Y Ethan Guo
Based on the feedback, we will cancel RC1 and I'll start preparing RC2 within a day. Thank you all. On Thu, Jun 9, 2022 at 11:42 AM Y Ethan Guo wrote: > Hi everyone, > > Please review and vote on the release candidate #1 for the version 0.11.1, > as follows: > > [ ] +1, A

Re: [VOTE] Release 0.11.1, release candidate #2

2022-06-16 Thread Y Ethan Guo
: > Thanks Ethan, would appreciate it if > > https://issues.apache.org/jira/browse/HUDI-4255 > > can be involved, the bug may cause the flink bucket index throws > FileNotFoundException in some cases. > > Best, > Danny > > Y Ethan Guo 于2022年6月13日周一 07:17写道: > >

Re: [VOTE] Release 0.11.1, release candidate #2

2022-06-17 Thread Y Ethan Guo
6月16日(星期四) 晚上7:07 > > 收件人:"dev" > > > 主题:Re: [VOTE] Release 0.11.1, release candidate #2 > > > > > > > > +1 binding. > > > > Verified by running deltastreamer end-to-end on AWS EMR 6.5, Glue Studio > > 3.0, GCP Dataproc 2, completed

[RESULT] [VOTE] Release 0.11.1, release candidate #2

2022-06-17 Thread Y Ethan Guo
Hi everyone, I'm happy to announce that we have unanimously approved this release. There are 5 approving votes, 3 of which are binding. Here is the breakdown: +1 (binding) : 3 * Bhavani Sudha Saktheeswaran * Sivabalan Narayanan * Raymond Xu -1 (binding) : 0 +1 (non-binding) : 2 * Meng Tao *

Re: [VOTE] Release 0.11.1, release candidate #2

2022-06-17 Thread Y Ethan Guo
Thanks for voting, everyone. The voting is now closed. I will send a separate email to provide the final result. Best, - Ethan On Thu, Jun 16, 2022 at 11:49 PM Y Ethan Guo wrote: > Thank you folks for voting. I'll wait for another 12 hours for folks who > still want to vote before c

[VOTE] Release 0.11.1, release candidate #2

2022-06-12 Thread Y Ethan Guo
Hi everyone, Please review and vote on the release candidate #2 for the version 0.11.1, as follows: [ ] +1, Approve the release [ ] -1, Do not approve the release (please provide specific comments) The complete staging area is available for your review, which includes: * JIRA release notes

Re: Updates on 0.11.1 release

2022-06-07 Thread Y Ethan Guo
Hi folks, All the 0.11.1 release blockers are landed. I'm going to cut RC1 and start the release candidate process. Thanks, - Ethan On Thu, Jun 2, 2022 at 9:48 PM Y Ethan Guo wrote: > Hi folks, > > There are still a few critical PRs on bridging the performance gaps > be

Updates on 0.11.1 release

2022-06-02 Thread Y Ethan Guo
Hi folks, There are still a few critical PRs on bridging the performance gaps between 0.11.0 and 0.10.1 that are pending, e.g., HUDI-4176 , HUDI-4178 , etc. We should get those landed for 0.11.1. In this case,

Re: 0.12.0 Release Timeline

2022-07-21 Thread Y Ethan Guo
+1 from my side. On Thu, Jul 21, 2022 at 1:10 AM Danny Chan wrote: > Have a quick review for the remaining release blockers and +1 from my side. > > Best, > Danny > > Vinoth Chandar 于2022年7月15日周五 13:29写道: > > > > +1 from me. > > > > On Thu, Jul 14, 2022 at 9:43 AM sagar sumit wrote: > > > > >

Re: [DISCUSS] hudi index improve

2022-04-18 Thread Y Ethan Guo
+1 it would be great to make Hudi's index support all query engines. Given that we already have multi-modal index (column stats index, bloom filter index) in metadata table and there is a proposal to have a metastore server, is the ultimate goal to serve the index from metastore leveraging

Re: [DISSCUSS][NEW FEATURE] Hudi Lake Manager

2022-04-18 Thread Y Ethan Guo
+1 This is a great idea! The proposed lake manager and centralized management layer are essential to ease the burden of carrying out data governance and optimizing the storage layout, making them independent of ingestion and streaming. I see that this provides a better abstraction for any

Re: [ANNOUNCE] Apache Hudi 0.11.0 released

2022-05-04 Thread Y Ethan Guo
+1 Thank you, Raymod, for driving the 0.11.0 release and coordinating the community on getting things done! On Wed, May 4, 2022 at 12:52 PM Sivabalan wrote: > Kudos Raymond on the humongous effort. > > On Tue, 3 May 2022 at 00:16, Vinoth Chandar wrote: > > > +1 this was a very well coordinated

Re: Release manager for 0.11.1 and 0.12.0

2022-05-04 Thread Y Ethan Guo
I'd like to be the release manager of the next minor release. Best, - Ethan On Wed, May 4, 2022 at 6:47 PM Shiyan Xu wrote: > Hi everyone, > > I'd like to call for volunteers for release managers for the next minor and > major releases. It'll be beneficial to have RM appointed from the

Re: [ANNOUNCE] Apache Hudi 0.12.0 released

2022-08-18 Thread Y Ethan Guo
Thank you, Sagar, for the coordination to make this release happen! Congrats to all the contributors! On Thu, Aug 18, 2022 at 11:22 AM Vinoth Chandar wrote: > Great job, Sagar! Huge congratulations to the entire community in getting > this out! > > On Thu, Aug 18, 2022 at 10:45 PM sagar sumit

Re: Release managers for 0.12.1 and 0.13.0

2022-08-25 Thread Y Ethan Guo
I’d love to be the RM of 0.13.0. Best, - Ethan On Thu, Aug 25, 2022 at 4:06 PM zhaojing yu wrote: > I'd like to be the release manager of the next minor release. > > Best, > - Zhaojing > > Shiyan Xu 于2022年8月26日周五 07:02写道: > > > Hi everyone, > > > > As we finished 0.12.0, we're planning for

Re: 0.12.1 release timeline

2022-09-27 Thread Y Ethan Guo
ry our > best > > to > > >> land them before starting of next week, but 28th would be more > > practical. > > >> > > >> On Tue, 20 Sept 2022 at 18:21, Vinoth Chandar > > wrote: > > >> > > >> > Thanks for sharing. Do we have an ETA for these? >

Re: [ANNOUNCE] Apache Hudi 0.12.1 released

2022-10-25 Thread Y Ethan Guo
Congrats! Thank you, Zhaojing, for driving the release to the finish line! On Mon, Oct 24, 2022 at 9:22 PM Shiyan Xu wrote: > Congrats! > > On Sun, Oct 23, 2022 at 3:57 PM Zhuoluo Yang > wrote: > > > Congrats! > > > > Thanks, > > Zhuoluo > > > > > > leesf 于2022年10月20日周四 09:03写道: > > > > >

Re: 0.12.1 release timeline

2022-09-20 Thread Y Ethan Guo
Hi Zhaojing, It would be good if we can land the following bootstrap fixes for 0.12.1 release. I'm working on getting them merged. HUDI-4855: https://github.com/apache/hudi/pull/6694 HUDI-4453: https://github.com/apache/hudi/pull/6676 Thanks, - Ethan On Tue, Sep 20, 2022 at 12:03 PM Alexey

Re: [DISCUSS] New RFC to support 'Snapshot view management'

2022-09-13 Thread Y Ethan Guo
Hi Feng Jian, Looking forward to the RFC! Is the snapshot view management more like managing commits / savepoints in the Hudi timeline and hiding Hudi internals from the users? Do you plan to merge the implementation of snapshot view and lifecycle management for the next major release (0.13.0)?

Re: [VOTE] Release 0.12.1, release candidate #1

2022-10-04 Thread Y Ethan Guo
+1 (non-binding) - [OK] checksums and signatures - [OK] ran release validation script - [OK] built successfully - [OK] error injection tests - [OK] table upgrade and downgrade tests On Tue, Oct 4, 2022 at 11:06 PM zhaojing yu wrote: > This commit has been reverted in version 0.12.1. > > Alexey

Re: [VOTE] Release 0.12.1, release candidate #2

2022-10-13 Thread Y Ethan Guo
+1 (non-binding) - [OK] checksums and signatures - [OK] ran release validation script - [OK] built successfully - [OK] table upgrade and downgrade tests On Thu, Oct 13, 2022 at 11:45 AM Rahil C wrote: > +1 (non-binding) > > Ran hudi-spark bundle against EMR integration tests > > > > On Thu,

Re: [VOTE] Release 0.12.0, release candidate #2

2022-08-13 Thread Y Ethan Guo
+1 (non-binding) - [OK] checksums and signatures - [OK] ran release validation script - [OK] built successfully (Spark 2.4, 3.2, 3.3) - [OK] ran Spark quickstart with Spark 3.3.0 - [OK] ran a few tests on schema evolution - [OK] Presto connector performance Best, - Ethan On Thu, Aug 11, 2022 at

Re: 0.13.0 release timeline

2023-01-09 Thread Y Ethan Guo
d also aim for the testing of the new features to be done by then. Thanks, - Ethan On Mon, Dec 12, 2022 at 9:53 AM Y Ethan Guo wrote: > Thank you folks for chiming in. > > The new code freeze date of the 0.13.0 release is *Jan 10, Tuesday, 11:59 > PM PST*. > > Thanks, > - Et

Re: [VOTE] Release 0.12.2, release candidate #1

2022-12-23 Thread Y Ethan Guo
+1 non-binding [OK] checksums and signatures [OK] ran release validation script [OK] built successfully (Spark 2.4, 3.3) [OK] Spark 3.3.1 quickstart guide On Fri, Dec 23, 2022 at 1:30 AM Bhavani Sudha wrote: > +1 binding > > [OK] Build successfully multiple supported spark versions > > [OK]

Re: 0.13.0 release timeline

2022-12-12 Thread Y Ethan Guo
sounds good to me! > > > > > > On Fri, 9 Dec 2022 at 09:01, Y Ethan Guo wrote: > > > > > Hi folks, > > > > > > As mentioned in another thread, RFC-46 (Optimize record payload > handling) > > > feature branch is planned for merging by the end of this we

Re: 0.13.0 release timeline

2022-12-09 Thread Y Ethan Guo
Hi folks, As mentioned in another thread, RFC-46 (Optimize record payload handling) feature branch is planned for merging by the end of this week. Given that RFC-46 is a major enhancement for 0.13.0 release, which requires more time to solidify on master, it is better to give at least 4-week

Re: RFC-46 Status Update

2022-12-09 Thread Y Ethan Guo
I agree that we can merge RFC-46 feature branch to master at the end of this week to give leeway to 0.12.2. On Tue, Dec 6, 2022 at 4:03 PM Alexey Kudinkin wrote: > Thanks for bubbling this up, Siva! > > Merge has not happened yet. > > What you're saying makes sense to me -- after merging

Re: 0.12.2 release code freeze date

2022-12-09 Thread Y Ethan Guo
+1 on Dec 12 as the code freeze date On Fri, Dec 9, 2022 at 3:16 AM Shiyan Xu wrote: > +1 > > On Fri, Dec 9, 2022 at 2:34 PM Sivabalan wrote: > > > sure, sounds good to me. We have been dragging this for 2 weeks ish. So, > > let's go ahead w/ Dec 12. > > > > On Thu, 8 Dec 2022 at 22:33, Satish

Re: RFC-46 Status Update

2022-12-09 Thread Y Ethan Guo
Hi Shawy and RFC-46 group, Thanks for bringing this up. Based on Alexey's latest update in another thread, the RFC-46 feature branch is going to be merged soon. It makes sense to me to let the new code fully baked in for 4 weeks and push back the code freeze date of 0.13.0 to early Jan. I'll

Re: [DISCUSS] Merging Nov and Dec community sync calls

2022-11-16 Thread Y Ethan Guo
+1 on having a single community sync all on Dec 14 during the holiday season. On Wed, Nov 16, 2022 at 5:12 PM Bhavani Sudha wrote: > Hello Hudi community, > > We have monthly community sync calls on the last wednesday of every month. > For November and December months these collide with public

Re: [RFC] RFC-64: New APIs to facilitate faster Query Engine integrations

2022-11-15 Thread Y Ethan Guo
+1 this is a great effort to foster easier query engine integration with Hudi. Looking forward to the implementation. On Thu, Nov 10, 2022 at 6:15 PM 冯健 wrote: > Great feature, eagerly looking forward to it > > On Fri, 11 Nov 2022 at 07:16, Sivabalan wrote: > > > +1 Definitely will ease

Re: [VOTE] Release 0.13.0, release candidate #1

2023-01-30 Thread Y Ethan Guo
over also tickets that are not yet > implemented. > > BR, > Daniel > > sob., 28 sty 2023 o 23:34 Y Ethan Guo napisał(a): > > > Hi everyone, > > > > Please review and vote on the release candidate #1 for the version > 0.13.0, > > as follows: >

Re: [VOTE] Release 0.13.0, release candidate #1

2023-01-30 Thread Y Ethan Guo
ull/7783 > > > > On Mon, 30 Jan 2023 at 09:28, Y Ethan Guo wrote: > > > Hey Daniel, > > > > Thanks for pointing this out. I'm cleaning up the JIRA tickets and > moving > > ones not addressed to the next release, so that the JIRA release notes > are >

Re: [VOTE] Release 0.13.0, release candidate #1

2023-01-30 Thread Y Ethan Guo
Given these signals, let's abandon RC1 and move on to RC2. On Mon, Jan 30, 2023 at 5:50 PM Y Ethan Guo wrote: > Another -1 on RC1, from my side. > > There are a couple of blocking issues in "hudi-cli-bundle": > - https://github.com/apache/hudi/pull/7790 > - https://gi

Re: 0.13.0 release timeline

2023-01-27 Thread Y Ethan Guo
Hi folks, I've already cut the branch <https://github.com/apache/hudi/tree/release-0.13.0> for 0.13.0 RC1. I'm preparing the artifacts and deploying them to the staging area. Once done, I'll put up a voting thread. Thanks, - Ethan On Mon, Jan 9, 2023 at 10:24 PM Y Ethan Guo wrote:

[VOTE] Release 0.13.0, release candidate #1

2023-01-28 Thread Y Ethan Guo
Hi everyone, Please review and vote on the release candidate #1 for the version 0.13.0, as follows: [ ] +1, Approve the release [ ] -1, Do not approve the release (please provide specific comments) The complete staging area is available for your review, which includes: * JIRA release notes

0.13.0 release timeline

2022-11-09 Thread Y Ethan Guo
Hi folks, As the RM for the upcoming 0.13.0 release, I'd like to propose the following timeline based on the current progress of major features on our community roadmap: - *Dec 12, Monday, 11:59 PM PST*: 0.13.0 code freeze - *Dec 13/14, PST*: Cut release branch and start RC1 voting All the

[ANNOUNCE] Apache Hudi 0.13.0 released

2023-02-25 Thread Y Ethan Guo
The Apache Hudi team is pleased to announce the release of Apache Hudi 0.13.0. This release has been a huge community effort with 737 commits from 71 contributors across the globe. Apache Hudi (pronounced Hoodie) stands for Hadoop Upserts Deletes and Incrementals. Apache Hudi manages storage of

Re: Apply to be a Hudi contributor

2023-02-27 Thread Y Ethan Guo
Hi Shilun, Thanks for your interest! I've added you as a Hudi contributor in JIRA. Best, - Ethan On Sun, Feb 26, 2023 at 5:15 PM Shilun Fan wrote: > Hi, I'm Shilun, > > > I'm using the Hudi project, I hope I can make some > contributions during the use process, apply to join hudi

Re: Request to Hudi Contributor and Jira Access

2023-03-02 Thread Y Ethan Guo
re says > that > > > the user should receive an email with account details. How you help me > > with > > > the steps to access the jira account > > > > > > Thanks > > > Bala Mahesh. > > > > > > On Wed, Mar 1, 2023 at 1:29 AM

Calling for 0.13.1 Release

2023-03-03 Thread Y Ethan Guo
Hi folks, Given that we have already found a few critical issues affecting 0.13.0 release, such as the following, I suggest that we, as a community, follow up with 0.13.1 release in a month to address reliability issues in 0.13.0. Any volunteer for 0.13.1 Release Manager is welcome.

Re: Proposal for 0.12.3 hudi release

2023-03-03 Thread Y Ethan Guo
+1 On Thu, Mar 2, 2023 at 7:19 PM Sivabalan wrote: > Hey folks, > Since we wanted to maintain LTS with 0.12.x, can we start discussing > about 0.12.3. We can pick very critical fixes that went into master after > 0.12.2, we can get it into 0.12.3. > > -- > Regards, > -Sivabalan >

  1   2   >