Re: Welcome our PMC Member, Raymond Xu
Congratulations, Raymond! -Navinder On Saturday, July 17, 2021, 9:37 AM, Nishith wrote: Congratulations Raymond! Huge shout out for your valuable contributions! > On Jul 16, 2021, at 5:28 PM, Vinoth Chandar wrote: > > Folks, > > I am incredibly happy to share the addition of Raymond Xu to the Hudi PMC. > Raymond has been a valuable member of our community, over the past few > years now. Always hustlin and taking on the most underappreciated, but > extremely valuable aspects of the project, mostly recently with getting our > tests working smoothly on Azure CI! > > Please join me in congratulating Raymond! > > Onwards, > Vinoth
Re: Welcome New Committers: Pengzhiwei and DannyChan
Congratulations guys!! On Saturday, July 17, 2021, 7:25 AM, Udit Mehrotra wrote: Congratulations guys ! Thanks for your amazon contributions and well deserved :) On Fri, Jul 16, 2021 at 5:40 PM Nishith wrote: > Congratulations to the both of you! Very well deserved! Onwards! > > -Nishith > > > On Jul 16, 2021, at 5:30 PM, Vinoth Chandar wrote: > > > > Congrats both! Your impact is amazing! > > More miles to travel. Looking forward > > > >> On Fri, Jul 16, 2021 at 4:43 PM 18717838093 <18717838...@126.com> > wrote: > >> > >> Congratulations! Well deserved! > >> > >> > >> > >> | | > >> 18717838093 > >> | > >> | > >> 18717838...@126.com > >> | > >> 签名由网易邮箱大师定制 > >> > >> > >> On 07/16/2021 19:50,wangxianghu wrote: > >> Congratulations! Well deserved! > >> > >> 2021年7月16日 下午6:52,vino yang 写道: > >> > >> Congratulation to both of you! Well deserved! > >> > >> Best, > >> Vino > >> > >> leesf mailto:leesf0...@gmail.com>> 于2021年7月16日周五 > >> 下午6:38写道: > >> Hi all, > >> > >> Please join me in congratulating our newest committers Pengzhiwei and > >> DannyChan. > >> > >> Pengzhiwei has been a consistent contributor to Hudi, he has contributed > >> numerous features to Hudi, such as Spark SQL integration with Hudi, > Spark > >> Structured Streaming Source for Hudi and Spark FileIndex for Hudi and > also > >> lots of other good contributions around Spark, and also very active to > >> answer users's questions. He is a solid team player and an asset to the > >> project. > >> > >> DannyChan has contributed many good features, such as new streaming > write > >> pipeline for Flink with automatic compaction and cleaning (COW and MOR), > >> batch and streaming reader for Flink (COW and MOR) and support Flink SQL > >> connectors (reader and writer), he is actively join the ML and answer > >> users' questions as well as wrote a Hudi Flink integration guide and > >> launched a live show to promote Hudi Flink integration for Chinese > users. > >> > >> Thanks so much for your continued contributions to make Hudi better and > >> better! > >> > >> Also I would like to introduce the current state of Hudi in China. Hudi > >> becomes more and more popular in China with the help of all community > >> members and has been adopted by almost all top companies in China, > >> including Alibaba, Baidu, ByteDance, Huawei, Tencent and other > companies, > >> from startups to large companies, data scale from TB to PB. You would > find > >> the logo wall below(PS: unofficial statistics, just listed some of them > and > >> you can contact me to add your company logo if wanted). > >> > >> We would not achieve this without such a good community and the > >> contribution of all community members. Cheers and Go! > >> > >> > >> > >> Thanks, > >> Leesf > >> > >> >
Re: Welcome our PMC Member, Raymond Xu
Congratulations Raymond! Huge shout out for your valuable contributions! > On Jul 16, 2021, at 5:28 PM, Vinoth Chandar wrote: > > Folks, > > I am incredibly happy to share the addition of Raymond Xu to the Hudi PMC. > Raymond has been a valuable member of our community, over the past few > years now. Always hustlin and taking on the most underappreciated, but > extremely valuable aspects of the project, mostly recently with getting our > tests working smoothly on Azure CI! > > Please join me in congratulating Raymond! > > Onwards, > Vinoth
Re: [DISCUSS] Consolidate all dev collaboration to Github
+1 for option B. On Sat, Jul 17, 2021 at 9:51 AM Udit Mehrotra wrote: > +1 for option B. For A, I will need more data points to convince myself if > GitHub issues will provide all the issue tracking functionality that Jira > provides today. > > Thanks, > Udit > > On Fri, Jul 16, 2021 at 2:33 PM Vinoth Chandar wrote: > > > Looks like we can start with B has a lot of support. > > I will start a VOTE on B alone and we can proceed if the VOTE passes. > > > > On Fri, Jul 16, 2021 at 8:05 AM Nishith wrote: > > > > > +1 for option B. > > > > > > > On Jul 15, 2021, at 10:50 PM, Bhavani Sudha > > > > wrote: > > > > > > > > Completely agree on B. On A I feel the necessity to centralize > > > everything > > > > in one place but also without losing the capabilities of Jira. I > think > > we > > > > will have to explore tools in eitherways. > > > > > > > > Thanks, > > > > Sudha > > > > > > > >> On Thu, Jul 15, 2021 at 10:42 PM vino yang > > > wrote: > > > >> > > > >> +1 for option B. > > > >> > > > >> Best, > > > >> Vino > > > >> > > > >> Sivabalan 于2021年7月16日周五 上午10:35写道: > > > >> > > > >>> +1 on B. Not sure on A though. I understand the intent to have all > in > > > >>> one place. but not very sure if we can get all functionality > > (version, > > > >>> type, component, status, parent- child relation), etc ported over > to > > > >>> github. I assume labels are the only option we have to achieve > these. > > > >>> Probably, we should also document the labels in detail so that > anyone > > > >>> looking to take a look at untriaged issues should know how/where to > > > look > > > >>> at. If we plan to use GH issues for all, I am sure there will be a > > lot > > > of > > > >>> proliferation of issues. > > > >>> > > > >>> On Fri, Jul 9, 2021 at 12:29 PM Vinoth Chandar > > > >> wrote: > > > >>> > > > Based on this, I will start consolidating more of the cWiki > content > > to > > > github wiki and master branch? > > > > > > JIRA vs GH Issue still probably needs more feedback. I do see the > > > >>> tradeoffs > > > there. > > > > > > On Fri, Jul 9, 2021 at 2:39 AM wei li > > wrote: > > > > > > > +1 > > > > > > > > On 2021/07/02 03:40:51, Vinoth Chandar > wrote: > > > >> Hi all, > > > >> > > > >> When we incubated Hudi, we made some initial choices around > > > collaboration > > > >> tools of choice. I am wondering if there are still optimal, > given > > > >> the > > > > scale > > > >> of the community at this point. > > > >> > > > >> Specifically, two points. > > > >> > > > >> A) Our issue tracker is JIRA, while we just use Github Issues > for > > > support > > > >> triage. While JIRA is pretty advanced and gives us the ability > to > > > >>> track > > > >> releases, versions and kanban boards, there are few practical > > > operational > > > >> problems. > > > >> > > > >> - Developers often open bug fixes/PR which all need to be > > > >>> continuously > > > >> tagged against a release version (fix version) > > > >> - Referencing JIRAs from Pull Requests is great (we cannot do > > > >> things > > > like > > > >> `fixes #1234` to close issues when PR lands, not an easy way to > > > >> click > > > and > > > >> get to the JIRA) > > > >> - Many more developers have a github account, to contribute to > > Hudi > > > > though, > > > >> they need an additional sign-up on jira. > > > >> > > > >> So wondering if we should just use one thing - Github Issues, > and > > > >>> build > > > >> scripts/hubot or something to get the missing project management > > > >> from > > > >> boards. > > > >> > > > >> B) Our design docs are on cWiki. Even though we link it off the > > > >> site, > > > > from > > > >> my experience, many do not discover them. > > > >> For large PRs, we need to manually enforce that design and code > > are > > > >>> in > > > > sync > > > >> before we land. If we can, I would love to make RFC being in > good > > > shape a > > > >> pre-requisite for landing the PR. > > > >> Once again, separate signup is needed to write design docs or > > > >> comment > > > on > > > >> them. > > > >> > > > >> So, wondering if we can move our process docs etc into Github > Wiki > > > >>> and > > > > RFCs > > > >> to the master branch in a rfc folder, and we just use github PRs > > to > > > raise > > > >> RFCs and discuss them. > > > >> > > > >> This all also makes it easy for us to measure community activity > > > >> and > > > keep > > > >> streamlining our processes. > > > >> > > > >> personally, these different channels are overwhelming to me > > > >> at-least > > > >>> :) > > > >> > > > >> Love to hear thoughts. Please specify if you are for,against > each > > > >> of > > > >>> A > > > > and > > > >> B. > > > >> > > > >> > > >
Re: Amazon Athena expands Apache Hudi Support
Thanks for sharing the great news Udit. Exciting to see the expansion of the Hudi ecosystem. We are seeing a large number of cooperations and public cloud providers adopt Hudi this year. The future of the project is definitely promising. Best, Gary On Sat, Jul 17, 2021 at 10:14 AM Rubens Rodrigues wrote: > Great News for Hudi community. > > Em sex., 16 de jul. de 2021 23:07, Udit Mehrotra > escreveu: > > > Hi Folks, > > > > Happy to announce that Amazon Athena has now upgraded to the latest Hudi > > 0.8.0 release. In addition, Athena now supports two additional features: > > > >- Snapshot/Real time query support for Merge on Read tables > >- Query support for tables created with *BOOTSTRAP* operation > > > > Following are the public documentation for the new supports: > > > >- What’s new: > > > > > https://aws.amazon.com/about-aws/whats-new/2021/07/amazon-athena-expands-apache-hudi-support/ > >- Updated Athena Hudi usage AWS doc: > >https://docs.aws.amazon.com/athena/latest/ug/querying-hudi.html > > > > Thanks, > > Udit Mehrotra > > SDE | AWS EMR > > >
Re: Welcome our PMC Member, Raymond Xu
Congrats Raymond! Best, Gary On Sat, Jul 17, 2021 at 9:30 AM vino yang wrote: > Congrats! Well deserved! > > Best, > Vino > > Vinoth Chandar 于2021年7月17日周六 上午8:28写道: > >> Folks, >> >> I am incredibly happy to share the addition of Raymond Xu to the Hudi >> PMC. Raymond has been a valuable member of our community, over the past few >> years now. Always hustlin and taking on the most underappreciated, but >> extremely valuable aspects of the project, mostly recently with getting our >> tests working smoothly on Azure CI! >> >> Please join me in congratulating Raymond! >> >> Onwards, >> Vinoth >> >
Re: Amazon Athena expands Apache Hudi Support
Great News for Hudi community. Em sex., 16 de jul. de 2021 23:07, Udit Mehrotra escreveu: > Hi Folks, > > Happy to announce that Amazon Athena has now upgraded to the latest Hudi > 0.8.0 release. In addition, Athena now supports two additional features: > >- Snapshot/Real time query support for Merge on Read tables >- Query support for tables created with *BOOTSTRAP* operation > > Following are the public documentation for the new supports: > >- What’s new: > > https://aws.amazon.com/about-aws/whats-new/2021/07/amazon-athena-expands-apache-hudi-support/ >- Updated Athena Hudi usage AWS doc: >https://docs.aws.amazon.com/athena/latest/ug/querying-hudi.html > > Thanks, > Udit Mehrotra > SDE | AWS EMR >
Re: Amazon Athena expands Apache Hudi Support
Thats awesome news. Thanks for sharing Udit. - Sudha On Fri, Jul 16, 2021 at 7:07 PM Udit Mehrotra wrote: > Hi Folks, > > Happy to announce that Amazon Athena has now upgraded to the latest Hudi > 0.8.0 release. In addition, Athena now supports two additional features: > >- Snapshot/Real time query support for Merge on Read tables >- Query support for tables created with *BOOTSTRAP* operation > > Following are the public documentation for the new supports: > >- What’s new: > > https://aws.amazon.com/about-aws/whats-new/2021/07/amazon-athena-expands-apache-hudi-support/ >- Updated Athena Hudi usage AWS doc: >https://docs.aws.amazon.com/athena/latest/ug/querying-hudi.html > > Thanks, > Udit Mehrotra > SDE | AWS EMR >
Amazon Athena expands Apache Hudi Support
Hi Folks, Happy to announce that Amazon Athena has now upgraded to the latest Hudi 0.8.0 release. In addition, Athena now supports two additional features: - Snapshot/Real time query support for Merge on Read tables - Query support for tables created with *BOOTSTRAP* operation Following are the public documentation for the new supports: - What’s new: https://aws.amazon.com/about-aws/whats-new/2021/07/amazon-athena-expands-apache-hudi-support/ - Updated Athena Hudi usage AWS doc: https://docs.aws.amazon.com/athena/latest/ug/querying-hudi.html Thanks, Udit Mehrotra SDE | AWS EMR
Re: Welcome New Committers: Pengzhiwei and DannyChan
Congratulations guys ! Thanks for your amazon contributions and well deserved :) On Fri, Jul 16, 2021 at 5:40 PM Nishith wrote: > Congratulations to the both of you! Very well deserved! Onwards! > > -Nishith > > > On Jul 16, 2021, at 5:30 PM, Vinoth Chandar wrote: > > > > Congrats both! Your impact is amazing! > > More miles to travel. Looking forward > > > >> On Fri, Jul 16, 2021 at 4:43 PM 18717838093 <18717838...@126.com> > wrote: > >> > >> Congratulations! Well deserved! > >> > >> > >> > >> | | > >> 18717838093 > >> | > >> | > >> 18717838...@126.com > >> | > >> 签名由网易邮箱大师定制 > >> > >> > >> On 07/16/2021 19:50,wangxianghu wrote: > >> Congratulations! Well deserved! > >> > >> 2021年7月16日 下午6:52,vino yang 写道: > >> > >> Congratulation to both of you! Well deserved! > >> > >> Best, > >> Vino > >> > >> leesf mailto:leesf0...@gmail.com>> 于2021年7月16日周五 > >> 下午6:38写道: > >> Hi all, > >> > >> Please join me in congratulating our newest committers Pengzhiwei and > >> DannyChan. > >> > >> Pengzhiwei has been a consistent contributor to Hudi, he has contributed > >> numerous features to Hudi, such as Spark SQL integration with Hudi, > Spark > >> Structured Streaming Source for Hudi and Spark FileIndex for Hudi and > also > >> lots of other good contributions around Spark, and also very active to > >> answer users's questions. He is a solid team player and an asset to the > >> project. > >> > >> DannyChan has contributed many good features, such as new streaming > write > >> pipeline for Flink with automatic compaction and cleaning (COW and MOR), > >> batch and streaming reader for Flink (COW and MOR) and support Flink SQL > >> connectors (reader and writer), he is actively join the ML and answer > >> users' questions as well as wrote a Hudi Flink integration guide and > >> launched a live show to promote Hudi Flink integration for Chinese > users. > >> > >> Thanks so much for your continued contributions to make Hudi better and > >> better! > >> > >> Also I would like to introduce the current state of Hudi in China. Hudi > >> becomes more and more popular in China with the help of all community > >> members and has been adopted by almost all top companies in China, > >> including Alibaba, Baidu, ByteDance, Huawei, Tencent and other > companies, > >> from startups to large companies, data scale from TB to PB. You would > find > >> the logo wall below(PS: unofficial statistics, just listed some of them > and > >> you can contact me to add your company logo if wanted). > >> > >> We would not achieve this without such a good community and the > >> contribution of all community members. Cheers and Go! > >> > >> > >> > >> Thanks, > >> Leesf > >> > >> >
Re: [DISCUSS] Consolidate all dev collaboration to Github
+1 for option B. For A, I will need more data points to convince myself if GitHub issues will provide all the issue tracking functionality that Jira provides today. Thanks, Udit On Fri, Jul 16, 2021 at 2:33 PM Vinoth Chandar wrote: > Looks like we can start with B has a lot of support. > I will start a VOTE on B alone and we can proceed if the VOTE passes. > > On Fri, Jul 16, 2021 at 8:05 AM Nishith wrote: > > > +1 for option B. > > > > > On Jul 15, 2021, at 10:50 PM, Bhavani Sudha > > wrote: > > > > > > Completely agree on B. On A I feel the necessity to centralize > > everything > > > in one place but also without losing the capabilities of Jira. I think > we > > > will have to explore tools in eitherways. > > > > > > Thanks, > > > Sudha > > > > > >> On Thu, Jul 15, 2021 at 10:42 PM vino yang > > wrote: > > >> > > >> +1 for option B. > > >> > > >> Best, > > >> Vino > > >> > > >> Sivabalan 于2021年7月16日周五 上午10:35写道: > > >> > > >>> +1 on B. Not sure on A though. I understand the intent to have all in > > >>> one place. but not very sure if we can get all functionality > (version, > > >>> type, component, status, parent- child relation), etc ported over to > > >>> github. I assume labels are the only option we have to achieve these. > > >>> Probably, we should also document the labels in detail so that anyone > > >>> looking to take a look at untriaged issues should know how/where to > > look > > >>> at. If we plan to use GH issues for all, I am sure there will be a > lot > > of > > >>> proliferation of issues. > > >>> > > >>> On Fri, Jul 9, 2021 at 12:29 PM Vinoth Chandar > > >> wrote: > > >>> > > Based on this, I will start consolidating more of the cWiki content > to > > github wiki and master branch? > > > > JIRA vs GH Issue still probably needs more feedback. I do see the > > >>> tradeoffs > > there. > > > > On Fri, Jul 9, 2021 at 2:39 AM wei li > wrote: > > > > > +1 > > > > > > On 2021/07/02 03:40:51, Vinoth Chandar wrote: > > >> Hi all, > > >> > > >> When we incubated Hudi, we made some initial choices around > > collaboration > > >> tools of choice. I am wondering if there are still optimal, given > > >> the > > > scale > > >> of the community at this point. > > >> > > >> Specifically, two points. > > >> > > >> A) Our issue tracker is JIRA, while we just use Github Issues for > > support > > >> triage. While JIRA is pretty advanced and gives us the ability to > > >>> track > > >> releases, versions and kanban boards, there are few practical > > operational > > >> problems. > > >> > > >> - Developers often open bug fixes/PR which all need to be > > >>> continuously > > >> tagged against a release version (fix version) > > >> - Referencing JIRAs from Pull Requests is great (we cannot do > > >> things > > like > > >> `fixes #1234` to close issues when PR lands, not an easy way to > > >> click > > and > > >> get to the JIRA) > > >> - Many more developers have a github account, to contribute to > Hudi > > > though, > > >> they need an additional sign-up on jira. > > >> > > >> So wondering if we should just use one thing - Github Issues, and > > >>> build > > >> scripts/hubot or something to get the missing project management > > >> from > > >> boards. > > >> > > >> B) Our design docs are on cWiki. Even though we link it off the > > >> site, > > > from > > >> my experience, many do not discover them. > > >> For large PRs, we need to manually enforce that design and code > are > > >>> in > > > sync > > >> before we land. If we can, I would love to make RFC being in good > > shape a > > >> pre-requisite for landing the PR. > > >> Once again, separate signup is needed to write design docs or > > >> comment > > on > > >> them. > > >> > > >> So, wondering if we can move our process docs etc into Github Wiki > > >>> and > > > RFCs > > >> to the master branch in a rfc folder, and we just use github PRs > to > > raise > > >> RFCs and discuss them. > > >> > > >> This all also makes it easy for us to measure community activity > > >> and > > keep > > >> streamlining our processes. > > >> > > >> personally, these different channels are overwhelming to me > > >> at-least > > >>> :) > > >> > > >> Love to hear thoughts. Please specify if you are for,against each > > >> of > > >>> A > > > and > > >> B. > > >> > > >> > > >> Thanks > > >> Vinoth > > >> > > > > > > > >>> > > >>> > > >>> -- > > >>> Regards, > > >>> -Sivabalan > > >>> > > >> > > >
Re: Welcome our PMC Member, Raymond Xu
Congrats! Well deserved! Best, Vino Vinoth Chandar 于2021年7月17日周六 上午8:28写道: > Folks, > > I am incredibly happy to share the addition of Raymond Xu to the Hudi PMC. > Raymond has been a valuable member of our community, over the past few > years now. Always hustlin and taking on the most underappreciated, but > extremely valuable aspects of the project, mostly recently with getting our > tests working smoothly on Azure CI! > > Please join me in congratulating Raymond! > > Onwards, > Vinoth >
Re: Welcome New Committers: Pengzhiwei and DannyChan
Congratulations to the both of you! Very well deserved! Onwards! -Nishith > On Jul 16, 2021, at 5:30 PM, Vinoth Chandar wrote: > > Congrats both! Your impact is amazing! > More miles to travel. Looking forward > >> On Fri, Jul 16, 2021 at 4:43 PM 18717838093 <18717838...@126.com> wrote: >> >> Congratulations! Well deserved! >> >> >> >> | | >> 18717838093 >> | >> | >> 18717838...@126.com >> | >> 签名由网易邮箱大师定制 >> >> >> On 07/16/2021 19:50,wangxianghu wrote: >> Congratulations! Well deserved! >> >> 2021年7月16日 下午6:52,vino yang 写道: >> >> Congratulation to both of you! Well deserved! >> >> Best, >> Vino >> >> leesf mailto:leesf0...@gmail.com>> 于2021年7月16日周五 >> 下午6:38写道: >> Hi all, >> >> Please join me in congratulating our newest committers Pengzhiwei and >> DannyChan. >> >> Pengzhiwei has been a consistent contributor to Hudi, he has contributed >> numerous features to Hudi, such as Spark SQL integration with Hudi, Spark >> Structured Streaming Source for Hudi and Spark FileIndex for Hudi and also >> lots of other good contributions around Spark, and also very active to >> answer users's questions. He is a solid team player and an asset to the >> project. >> >> DannyChan has contributed many good features, such as new streaming write >> pipeline for Flink with automatic compaction and cleaning (COW and MOR), >> batch and streaming reader for Flink (COW and MOR) and support Flink SQL >> connectors (reader and writer), he is actively join the ML and answer >> users' questions as well as wrote a Hudi Flink integration guide and >> launched a live show to promote Hudi Flink integration for Chinese users. >> >> Thanks so much for your continued contributions to make Hudi better and >> better! >> >> Also I would like to introduce the current state of Hudi in China. Hudi >> becomes more and more popular in China with the help of all community >> members and has been adopted by almost all top companies in China, >> including Alibaba, Baidu, ByteDance, Huawei, Tencent and other companies, >> from startups to large companies, data scale from TB to PB. You would find >> the logo wall below(PS: unofficial statistics, just listed some of them and >> you can contact me to add your company logo if wanted). >> >> We would not achieve this without such a good community and the >> contribution of all community members. Cheers and Go! >> >> >> >> Thanks, >> Leesf >> >>
Re: Welcome New Committers: Pengzhiwei and DannyChan
Congrats both! Your impact is amazing! More miles to travel. Looking forward On Fri, Jul 16, 2021 at 4:43 PM 18717838093 <18717838...@126.com> wrote: > Congratulations! Well deserved! > > > > | | > 18717838093 > | > | > 18717838...@126.com > | > 签名由网易邮箱大师定制 > > > On 07/16/2021 19:50,wangxianghu wrote: > Congratulations! Well deserved! > > 2021年7月16日 下午6:52,vino yang 写道: > > Congratulation to both of you! Well deserved! > > Best, > Vino > > leesf mailto:leesf0...@gmail.com>> 于2021年7月16日周五 > 下午6:38写道: > Hi all, > > Please join me in congratulating our newest committers Pengzhiwei and > DannyChan. > > Pengzhiwei has been a consistent contributor to Hudi, he has contributed > numerous features to Hudi, such as Spark SQL integration with Hudi, Spark > Structured Streaming Source for Hudi and Spark FileIndex for Hudi and also > lots of other good contributions around Spark, and also very active to > answer users's questions. He is a solid team player and an asset to the > project. > > DannyChan has contributed many good features, such as new streaming write > pipeline for Flink with automatic compaction and cleaning (COW and MOR), > batch and streaming reader for Flink (COW and MOR) and support Flink SQL > connectors (reader and writer), he is actively join the ML and answer > users' questions as well as wrote a Hudi Flink integration guide and > launched a live show to promote Hudi Flink integration for Chinese users. > > Thanks so much for your continued contributions to make Hudi better and > better! > > Also I would like to introduce the current state of Hudi in China. Hudi > becomes more and more popular in China with the help of all community > members and has been adopted by almost all top companies in China, > including Alibaba, Baidu, ByteDance, Huawei, Tencent and other companies, > from startups to large companies, data scale from TB to PB. You would find > the logo wall below(PS: unofficial statistics, just listed some of them and > you can contact me to add your company logo if wanted). > > We would not achieve this without such a good community and the > contribution of all community members. Cheers and Go! > > > > Thanks, > Leesf > >
Welcome our PMC Member, Raymond Xu
Folks, I am incredibly happy to share the addition of Raymond Xu to the Hudi PMC. Raymond has been a valuable member of our community, over the past few years now. Always hustlin and taking on the most underappreciated, but extremely valuable aspects of the project, mostly recently with getting our tests working smoothly on Azure CI! Please join me in congratulating Raymond! Onwards, Vinoth
Re: Welcome New Committers: Pengzhiwei and DannyChan
Congratulations! Well deserved! | | 18717838093 | | 18717838...@126.com | 签名由网易邮箱大师定制 On 07/16/2021 19:50,wangxianghu wrote: Congratulations! Well deserved! 2021年7月16日 下午6:52,vino yang 写道: Congratulation to both of you! Well deserved! Best, Vino leesf mailto:leesf0...@gmail.com>> 于2021年7月16日周五 下午6:38写道: Hi all, Please join me in congratulating our newest committers Pengzhiwei and DannyChan. Pengzhiwei has been a consistent contributor to Hudi, he has contributed numerous features to Hudi, such as Spark SQL integration with Hudi, Spark Structured Streaming Source for Hudi and Spark FileIndex for Hudi and also lots of other good contributions around Spark, and also very active to answer users's questions. He is a solid team player and an asset to the project. DannyChan has contributed many good features, such as new streaming write pipeline for Flink with automatic compaction and cleaning (COW and MOR), batch and streaming reader for Flink (COW and MOR) and support Flink SQL connectors (reader and writer), he is actively join the ML and answer users' questions as well as wrote a Hudi Flink integration guide and launched a live show to promote Hudi Flink integration for Chinese users. Thanks so much for your continued contributions to make Hudi better and better! Also I would like to introduce the current state of Hudi in China. Hudi becomes more and more popular in China with the help of all community members and has been adopted by almost all top companies in China, including Alibaba, Baidu, ByteDance, Huawei, Tencent and other companies, from startups to large companies, data scale from TB to PB. You would find the logo wall below(PS: unofficial statistics, just listed some of them and you can contact me to add your company logo if wanted). We would not achieve this without such a good community and the contribution of all community members. Cheers and Go! Thanks, Leesf
Re: Welcome New Committers: Pengzhiwei and DannyChan
Congrats Zhiwei and Danny! Best, Raymond On Fri, Jul 16, 2021 at 3:26 PM vbal...@apache.org wrote: > Many Congratulations to both of you !! Great contributions. Well deserved > !! > Balaji.V > On Friday, July 16, 2021, 03:08:44 PM PDT, Bhavani Sudha < > bhavanisud...@gmail.com> wrote: > > Big congratulations to both of you. Very well deserved! > > Cheers, > Sudha > > On Fri, Jul 16, 2021 at 8:56 AM Sivabalan wrote: > > > Not to hijack the limelight from *Pengzhiwei *and* DannyChan.* btw, Big > > Kudos to the Chinese community at large. Great adoption and good going :) > > Really excited for the future of Hudi across the globe ! :) btw, fyi, I > > don't get to see the image you attached leesf. > > > > > > On Fri, Jul 16, 2021 at 11:29 AM Sivabalan wrote: > > > > > Congrats guys! Well deserved. > > > > > > On Fri, Jul 16, 2021 at 9:12 AM Gary Li wrote: > > > > > >> Congrats Zhiwei and Danny! It's awesome to work with you guys. > > >> > > >> Best, > > >> Gary > > >> > > >> > > >> On Fri, Jul 16, 2021 at 7:55 PM wangxianghu wrote: > > >> > > >> > Congratulations!well deserved ! > > >> > > > >> > > 在 2021年7月16日,18:52,vino yang 写道: > > >> > > > > >> > > Congratulation > > >> > > > >> > > > >> > > > -- > > > Regards, > > > -Sivabalan > > > > > > > > > -- > > Regards, > > -Sivabalan > > >
Re: Welcome New Committers: Pengzhiwei and DannyChan
Many Congratulations to both of you !! Great contributions. Well deserved !! Balaji.V On Friday, July 16, 2021, 03:08:44 PM PDT, Bhavani Sudha wrote: Big congratulations to both of you. Very well deserved! Cheers, Sudha On Fri, Jul 16, 2021 at 8:56 AM Sivabalan wrote: > Not to hijack the limelight from *Pengzhiwei *and* DannyChan.* btw, Big > Kudos to the Chinese community at large. Great adoption and good going :) > Really excited for the future of Hudi across the globe ! :) btw, fyi, I > don't get to see the image you attached leesf. > > > On Fri, Jul 16, 2021 at 11:29 AM Sivabalan wrote: > > > Congrats guys! Well deserved. > > > > On Fri, Jul 16, 2021 at 9:12 AM Gary Li wrote: > > > >> Congrats Zhiwei and Danny! It's awesome to work with you guys. > >> > >> Best, > >> Gary > >> > >> > >> On Fri, Jul 16, 2021 at 7:55 PM wangxianghu wrote: > >> > >> > Congratulations!well deserved ! > >> > > >> > > 在 2021年7月16日,18:52,vino yang 写道: > >> > > > >> > > Congratulation > >> > > >> > > >> > > -- > > Regards, > > -Sivabalan > > > > > -- > Regards, > -Sivabalan >
Re: Welcome New Committers: Pengzhiwei and DannyChan
Big congratulations to both of you. Very well deserved! Cheers, Sudha On Fri, Jul 16, 2021 at 8:56 AM Sivabalan wrote: > Not to hijack the limelight from *Pengzhiwei *and* DannyChan.* btw, Big > Kudos to the Chinese community at large. Great adoption and good going :) > Really excited for the future of Hudi across the globe ! :) btw, fyi, I > don't get to see the image you attached leesf. > > > On Fri, Jul 16, 2021 at 11:29 AM Sivabalan wrote: > > > Congrats guys! Well deserved. > > > > On Fri, Jul 16, 2021 at 9:12 AM Gary Li wrote: > > > >> Congrats Zhiwei and Danny! It's awesome to work with you guys. > >> > >> Best, > >> Gary > >> > >> > >> On Fri, Jul 16, 2021 at 7:55 PM wangxianghu wrote: > >> > >> > Congratulations!well deserved ! > >> > > >> > > 在 2021年7月16日,18:52,vino yang 写道: > >> > > > >> > > Congratulation > >> > > >> > > >> > > -- > > Regards, > > -Sivabalan > > > > > -- > Regards, > -Sivabalan >
Re: [DISCUSS] Consolidate all dev collaboration to Github
Looks like we can start with B has a lot of support. I will start a VOTE on B alone and we can proceed if the VOTE passes. On Fri, Jul 16, 2021 at 8:05 AM Nishith wrote: > +1 for option B. > > > On Jul 15, 2021, at 10:50 PM, Bhavani Sudha > wrote: > > > > Completely agree on B. On A I feel the necessity to centralize > everything > > in one place but also without losing the capabilities of Jira. I think we > > will have to explore tools in eitherways. > > > > Thanks, > > Sudha > > > >> On Thu, Jul 15, 2021 at 10:42 PM vino yang > wrote: > >> > >> +1 for option B. > >> > >> Best, > >> Vino > >> > >> Sivabalan 于2021年7月16日周五 上午10:35写道: > >> > >>> +1 on B. Not sure on A though. I understand the intent to have all in > >>> one place. but not very sure if we can get all functionality (version, > >>> type, component, status, parent- child relation), etc ported over to > >>> github. I assume labels are the only option we have to achieve these. > >>> Probably, we should also document the labels in detail so that anyone > >>> looking to take a look at untriaged issues should know how/where to > look > >>> at. If we plan to use GH issues for all, I am sure there will be a lot > of > >>> proliferation of issues. > >>> > >>> On Fri, Jul 9, 2021 at 12:29 PM Vinoth Chandar > >> wrote: > >>> > Based on this, I will start consolidating more of the cWiki content to > github wiki and master branch? > > JIRA vs GH Issue still probably needs more feedback. I do see the > >>> tradeoffs > there. > > On Fri, Jul 9, 2021 at 2:39 AM wei li wrote: > > > +1 > > > > On 2021/07/02 03:40:51, Vinoth Chandar wrote: > >> Hi all, > >> > >> When we incubated Hudi, we made some initial choices around > collaboration > >> tools of choice. I am wondering if there are still optimal, given > >> the > > scale > >> of the community at this point. > >> > >> Specifically, two points. > >> > >> A) Our issue tracker is JIRA, while we just use Github Issues for > support > >> triage. While JIRA is pretty advanced and gives us the ability to > >>> track > >> releases, versions and kanban boards, there are few practical > operational > >> problems. > >> > >> - Developers often open bug fixes/PR which all need to be > >>> continuously > >> tagged against a release version (fix version) > >> - Referencing JIRAs from Pull Requests is great (we cannot do > >> things > like > >> `fixes #1234` to close issues when PR lands, not an easy way to > >> click > and > >> get to the JIRA) > >> - Many more developers have a github account, to contribute to Hudi > > though, > >> they need an additional sign-up on jira. > >> > >> So wondering if we should just use one thing - Github Issues, and > >>> build > >> scripts/hubot or something to get the missing project management > >> from > >> boards. > >> > >> B) Our design docs are on cWiki. Even though we link it off the > >> site, > > from > >> my experience, many do not discover them. > >> For large PRs, we need to manually enforce that design and code are > >>> in > > sync > >> before we land. If we can, I would love to make RFC being in good > shape a > >> pre-requisite for landing the PR. > >> Once again, separate signup is needed to write design docs or > >> comment > on > >> them. > >> > >> So, wondering if we can move our process docs etc into Github Wiki > >>> and > > RFCs > >> to the master branch in a rfc folder, and we just use github PRs to > raise > >> RFCs and discuss them. > >> > >> This all also makes it easy for us to measure community activity > >> and > keep > >> streamlining our processes. > >> > >> personally, these different channels are overwhelming to me > >> at-least > >>> :) > >> > >> Love to hear thoughts. Please specify if you are for,against each > >> of > >>> A > > and > >> B. > >> > >> > >> Thanks > >> Vinoth > >> > > > > >>> > >>> > >>> -- > >>> Regards, > >>> -Sivabalan > >>> > >> >
Re: Welcome New Committers: Pengzhiwei and DannyChan
Not to hijack the limelight from *Pengzhiwei *and* DannyChan.* btw, Big Kudos to the Chinese community at large. Great adoption and good going :) Really excited for the future of Hudi across the globe ! :) btw, fyi, I don't get to see the image you attached leesf. On Fri, Jul 16, 2021 at 11:29 AM Sivabalan wrote: > Congrats guys! Well deserved. > > On Fri, Jul 16, 2021 at 9:12 AM Gary Li wrote: > >> Congrats Zhiwei and Danny! It's awesome to work with you guys. >> >> Best, >> Gary >> >> >> On Fri, Jul 16, 2021 at 7:55 PM wangxianghu wrote: >> >> > Congratulations!well deserved ! >> > >> > > 在 2021年7月16日,18:52,vino yang 写道: >> > > >> > > Congratulation >> > >> > >> > -- > Regards, > -Sivabalan > -- Regards, -Sivabalan
Re: Welcome New Committers: Pengzhiwei and DannyChan
Congrats guys! Well deserved. On Fri, Jul 16, 2021 at 9:12 AM Gary Li wrote: > Congrats Zhiwei and Danny! It's awesome to work with you guys. > > Best, > Gary > > > On Fri, Jul 16, 2021 at 7:55 PM wangxianghu wrote: > > > Congratulations!well deserved ! > > > > > 在 2021年7月16日,18:52,vino yang 写道: > > > > > > Congratulation > > > > > -- Regards, -Sivabalan
Re: [DISCUSS] Consolidate all dev collaboration to Github
+1 for option B. > On Jul 15, 2021, at 10:50 PM, Bhavani Sudha wrote: > > Completely agree on B. On A I feel the necessity to centralize everything > in one place but also without losing the capabilities of Jira. I think we > will have to explore tools in eitherways. > > Thanks, > Sudha > >> On Thu, Jul 15, 2021 at 10:42 PM vino yang wrote: >> >> +1 for option B. >> >> Best, >> Vino >> >> Sivabalan 于2021年7月16日周五 上午10:35写道: >> >>> +1 on B. Not sure on A though. I understand the intent to have all in >>> one place. but not very sure if we can get all functionality (version, >>> type, component, status, parent- child relation), etc ported over to >>> github. I assume labels are the only option we have to achieve these. >>> Probably, we should also document the labels in detail so that anyone >>> looking to take a look at untriaged issues should know how/where to look >>> at. If we plan to use GH issues for all, I am sure there will be a lot of >>> proliferation of issues. >>> >>> On Fri, Jul 9, 2021 at 12:29 PM Vinoth Chandar >> wrote: >>> Based on this, I will start consolidating more of the cWiki content to github wiki and master branch? JIRA vs GH Issue still probably needs more feedback. I do see the >>> tradeoffs there. On Fri, Jul 9, 2021 at 2:39 AM wei li wrote: > +1 > > On 2021/07/02 03:40:51, Vinoth Chandar wrote: >> Hi all, >> >> When we incubated Hudi, we made some initial choices around collaboration >> tools of choice. I am wondering if there are still optimal, given >> the > scale >> of the community at this point. >> >> Specifically, two points. >> >> A) Our issue tracker is JIRA, while we just use Github Issues for support >> triage. While JIRA is pretty advanced and gives us the ability to >>> track >> releases, versions and kanban boards, there are few practical operational >> problems. >> >> - Developers often open bug fixes/PR which all need to be >>> continuously >> tagged against a release version (fix version) >> - Referencing JIRAs from Pull Requests is great (we cannot do >> things like >> `fixes #1234` to close issues when PR lands, not an easy way to >> click and >> get to the JIRA) >> - Many more developers have a github account, to contribute to Hudi > though, >> they need an additional sign-up on jira. >> >> So wondering if we should just use one thing - Github Issues, and >>> build >> scripts/hubot or something to get the missing project management >> from >> boards. >> >> B) Our design docs are on cWiki. Even though we link it off the >> site, > from >> my experience, many do not discover them. >> For large PRs, we need to manually enforce that design and code are >>> in > sync >> before we land. If we can, I would love to make RFC being in good shape a >> pre-requisite for landing the PR. >> Once again, separate signup is needed to write design docs or >> comment on >> them. >> >> So, wondering if we can move our process docs etc into Github Wiki >>> and > RFCs >> to the master branch in a rfc folder, and we just use github PRs to raise >> RFCs and discuss them. >> >> This all also makes it easy for us to measure community activity >> and keep >> streamlining our processes. >> >> personally, these different channels are overwhelming to me >> at-least >>> :) >> >> Love to hear thoughts. Please specify if you are for,against each >> of >>> A > and >> B. >> >> >> Thanks >> Vinoth >> > >>> >>> >>> -- >>> Regards, >>> -Sivabalan >>> >>
Re: Welcome New Committers: Pengzhiwei and DannyChan
Congrats Zhiwei and Danny! It's awesome to work with you guys. Best, Gary On Fri, Jul 16, 2021 at 7:55 PM wangxianghu wrote: > Congratulations!well deserved ! > > > 在 2021年7月16日,18:52,vino yang 写道: > > > > Congratulation > >
Re: Welcome New Committers: Pengzhiwei and DannyChan
Congratulations!well deserved ! > 在 2021年7月16日,18:52,vino yang 写道: > > Congratulation
Re: Welcome New Committers: Pengzhiwei and DannyChan
Congratulations! Well deserved! > 2021年7月16日 下午6:52,vino yang 写道: > > Congratulation to both of you! Well deserved! > > Best, > Vino > > leesf mailto:leesf0...@gmail.com>> 于2021年7月16日周五 > 下午6:38写道: > Hi all, > > Please join me in congratulating our newest committers Pengzhiwei and > DannyChan. > > Pengzhiwei has been a consistent contributor to Hudi, he has contributed > numerous features to Hudi, such as Spark SQL integration with Hudi, Spark > Structured Streaming Source for Hudi and Spark FileIndex for Hudi and also > lots of other good contributions around Spark, and also very active to answer > users's questions. He is a solid team player and an asset to the project. > > DannyChan has contributed many good features, such as new streaming write > pipeline for Flink with automatic compaction and cleaning (COW and MOR), > batch and streaming reader for Flink (COW and MOR) and support Flink SQL > connectors (reader and writer), he is actively join the ML and answer users' > questions as well as wrote a Hudi Flink integration guide and launched a live > show to promote Hudi Flink integration for Chinese users. > > Thanks so much for your continued contributions to make Hudi better and > better! > > Also I would like to introduce the current state of Hudi in China. Hudi > becomes more and more popular in China with the help of all community members > and has been adopted by almost all top companies in China, including Alibaba, > Baidu, ByteDance, Huawei, Tencent and other companies, from startups to large > companies, data scale from TB to PB. You would find the logo wall below(PS: > unofficial statistics, just listed some of them and you can contact me to add > your company logo if wanted). > > We would not achieve this without such a good community and the contribution > of all community members. Cheers and Go! > > > > Thanks, > Leesf
Re: Welcome New Committers: Pengzhiwei and DannyChan
Congratulation to both of you! Well deserved! Best, Vino leesf 于2021年7月16日周五 下午6:38写道: > Hi all, > > Please join me in congratulating our newest committers *Pengzhiwei *and > * DannyChan.* > > *Pengzhiwei *has been a consistent contributor to Hudi, he has > contributed numerous features to Hudi, such as Spark SQL integration with > Hudi, Spark Structured Streaming Source for Hudi and Spark FileIndex for > Hudi and also lots of other good contributions around Spark, and also very > active to answer users's questions. He is a solid team player and an asset > to the project. > > *DannyChan* has contributed many good features, such as new streaming > write pipeline for Flink with automatic compaction and cleaning (COW and > MOR), batch and streaming reader for Flink (COW and MOR) and support Flink > SQL connectors (reader and writer), he is actively join the ML and > answer users' questions as well as wrote a Hudi Flink integration guide and > launched a live show to promote Hudi Flink integration for Chinese users. > > Thanks so much for your continued contributions to make Hudi better and > better! > > Also I would like to introduce the current state of Hudi in China. Hudi > becomes more and more popular in China with the help of all community > members and has been adopted by almost all top companies in China, > including Alibaba, Baidu, ByteDance, Huawei, Tencent and other companies, > from startups to large companies, data scale from TB to PB. You would find > the logo wall below(PS: *unofficial statistics*, just listed some of them > and you can contact me to add your company logo if wanted). > > We would not achieve this without such a good community and the > contribution of all community members. Cheers and Go! > > [image: poweredby-0706.png] > > Thanks, > Leesf >
Welcome New Committers: Pengzhiwei and DannyChan
Hi all, Please join me in congratulating our newest committers *Pengzhiwei *and * DannyChan.* *Pengzhiwei *has been a consistent contributor to Hudi, he has contributed numerous features to Hudi, such as Spark SQL integration with Hudi, Spark Structured Streaming Source for Hudi and Spark FileIndex for Hudi and also lots of other good contributions around Spark, and also very active to answer users's questions. He is a solid team player and an asset to the project. *DannyChan* has contributed many good features, such as new streaming write pipeline for Flink with automatic compaction and cleaning (COW and MOR), batch and streaming reader for Flink (COW and MOR) and support Flink SQL connectors (reader and writer), he is actively join the ML and answer users' questions as well as wrote a Hudi Flink integration guide and launched a live show to promote Hudi Flink integration for Chinese users. Thanks so much for your continued contributions to make Hudi better and better! Also I would like to introduce the current state of Hudi in China. Hudi becomes more and more popular in China with the help of all community members and has been adopted by almost all top companies in China, including Alibaba, Baidu, ByteDance, Huawei, Tencent and other companies, from startups to large companies, data scale from TB to PB. You would find the logo wall below(PS: *unofficial statistics*, just listed some of them and you can contact me to add your company logo if wanted). We would not achieve this without such a good community and the contribution of all community members. Cheers and Go! [image: poweredby-0706.png] Thanks, Leesf