Re: Draft report to board - Jan 2019

2019-01-10 Thread Jake Maes
LGTM as well.

Thanks, Yi!

-Jake

On Wed, Jan 9, 2019 at 12:41 PM Yi Pan  wrote:

> Thanks! Updated inline accordingly.
>
> -Yi
>
> On Wed, Jan 9, 2019 at 12:32 PM Prateek Maheshwari 
> wrote:
>
> > Thanks for the summary Yi. I'd change: "HDFS based backup/restore of
> > state stores" to "Evaluation for HDFS based backup/restore of state
> > stores" since this was an intern project and is not checked in to
> > master. Otherwise LGTM.
> >
> > Thanks,
> > Prateek
> >
> > On Wed, Jan 9, 2019 at 12:28 PM Yi Pan  wrote:
> > >
> > > Hi, all,
> > >
> > > Our quarterly report is due this Wed (1/9). The following is the draft
> > > report. Please let me know by the end of the day if I missed anything.
> > > Thanks!
> > >
> > > ## Description:
> > >
> > >  - Apache Samza is a distributed stream processing engine that are
> highly
> > >
> > >configurable to process events from various data sources, including
> > >
> > >real-time messaging system (e.g. Kafka) and distributed file systems
> > > (e.g.
> > >
> > >HDFS).
> > >
> > >
> > >
> > > ## Issues:
> > >
> > >  - No issues requires board attention
> > >
> > >
> > >
> > > ## Activity:
> > >
> > >  - Samza 1.0 is released:
> > >
> > > - News coverage:
> > >
> >
> https://www.zdnet.com/article/real-time-data-processing-just-got-more-options-linkedin-releases-apache-samza-1-0-streaming/
> > >
> > > - Engineering blogs:
> > >
> >
> https://engineering.linkedin.com/blog/2018/11/samza-1-0--stream-processing-at-massive-scale
> > >
> > > - Major online website refresh: http://samza.apache.org/
> > >
> > >  - Critical improvement projects completed:
> > >
> > > - Changelog restore parallelization
> > >
> > > - Evaluation for HDFS based backup/restore of state stores
> > >
> > >  - Multiple SEP projects initiated or in-progress:
> > >
> > > - SEP-18: allows manipulating starting offsets and time-based
> rewind
> > >
> > > - SEP-19: Fast failover for stateful jobs on container failure
> (i.e.
> > > standby container)
> > >
> > > - SEP to come soon: async high-level API
> > >
> > >  - Beam Samza runner upgrade to use Samza 1.0
> > >
> > >  - Go and Python support via Beam Samza runner
> > >
> > >
> > >
> > > ## Health report:
> > >
> > >  - Project is in healthy status with 1.0 released in Nov 2018
> > >
> > >
> > >
> > > ## PMC changes:
> > >
> > >
> > >
> > >  - Currently 15 PMC members.
> > >
> > >  - Prateek Maheshwari was added to the PMC on Thu Nov 01 2018
> > >
> > >
> > >
> > > ## Committer base changes:
> > >
> > >
> > >
> > >  - Currently 22 committers.
> > >
> > >  - New commmitters:
> > >
> > > - Aditya Toomula was added as a committer on Mon Nov 05 2018
> > >
> > > - Hai Lu was added as a committer on Mon Nov 05 2018
> > >
> > >
> > >
> > > ## Releases:
> > >
> > >
> > >
> > >  - Last release was 1.0 on Nov 28, 2018
> > >
> > >
> > >
> > > ## /dist/ errors: 9
> > >
> > >  - Project is in healthy status with 1.0 released in Nov 2018
> > >
> > >
> > >
> > > ## Mailing list activity:
> > >
> > >
> > >
> > >  - dev@samza.apache.org:
> > >
> > > - 271 subscribers (down -13 in the last 3 months):
> > >
> > > - 445 emails sent to list (288 in previous quarter)
> > >
> > >
> > >
> > >
> > >
> > > ## JIRA activity:
> > >
> > >
> > >
> > >  - 111 JIRA tickets created in the last 3 months
> > >
> > >  - 57 JIRA tickets closed/resolved in the last 3 months
> >
>


Re: Draft report to board - Jan 2019

2019-01-09 Thread Yi Pan
Thanks! Updated inline accordingly.

-Yi

On Wed, Jan 9, 2019 at 12:32 PM Prateek Maheshwari 
wrote:

> Thanks for the summary Yi. I'd change: "HDFS based backup/restore of
> state stores" to "Evaluation for HDFS based backup/restore of state
> stores" since this was an intern project and is not checked in to
> master. Otherwise LGTM.
>
> Thanks,
> Prateek
>
> On Wed, Jan 9, 2019 at 12:28 PM Yi Pan  wrote:
> >
> > Hi, all,
> >
> > Our quarterly report is due this Wed (1/9). The following is the draft
> > report. Please let me know by the end of the day if I missed anything.
> > Thanks!
> >
> > ## Description:
> >
> >  - Apache Samza is a distributed stream processing engine that are highly
> >
> >configurable to process events from various data sources, including
> >
> >real-time messaging system (e.g. Kafka) and distributed file systems
> > (e.g.
> >
> >HDFS).
> >
> >
> >
> > ## Issues:
> >
> >  - No issues requires board attention
> >
> >
> >
> > ## Activity:
> >
> >  - Samza 1.0 is released:
> >
> > - News coverage:
> >
> https://www.zdnet.com/article/real-time-data-processing-just-got-more-options-linkedin-releases-apache-samza-1-0-streaming/
> >
> > - Engineering blogs:
> >
> https://engineering.linkedin.com/blog/2018/11/samza-1-0--stream-processing-at-massive-scale
> >
> > - Major online website refresh: http://samza.apache.org/
> >
> >  - Critical improvement projects completed:
> >
> > - Changelog restore parallelization
> >
> > - Evaluation for HDFS based backup/restore of state stores
> >
> >  - Multiple SEP projects initiated or in-progress:
> >
> > - SEP-18: allows manipulating starting offsets and time-based rewind
> >
> > - SEP-19: Fast failover for stateful jobs on container failure (i.e.
> > standby container)
> >
> > - SEP to come soon: async high-level API
> >
> >  - Beam Samza runner upgrade to use Samza 1.0
> >
> >  - Go and Python support via Beam Samza runner
> >
> >
> >
> > ## Health report:
> >
> >  - Project is in healthy status with 1.0 released in Nov 2018
> >
> >
> >
> > ## PMC changes:
> >
> >
> >
> >  - Currently 15 PMC members.
> >
> >  - Prateek Maheshwari was added to the PMC on Thu Nov 01 2018
> >
> >
> >
> > ## Committer base changes:
> >
> >
> >
> >  - Currently 22 committers.
> >
> >  - New commmitters:
> >
> > - Aditya Toomula was added as a committer on Mon Nov 05 2018
> >
> > - Hai Lu was added as a committer on Mon Nov 05 2018
> >
> >
> >
> > ## Releases:
> >
> >
> >
> >  - Last release was 1.0 on Nov 28, 2018
> >
> >
> >
> > ## /dist/ errors: 9
> >
> >  - Project is in healthy status with 1.0 released in Nov 2018
> >
> >
> >
> > ## Mailing list activity:
> >
> >
> >
> >  - dev@samza.apache.org:
> >
> > - 271 subscribers (down -13 in the last 3 months):
> >
> > - 445 emails sent to list (288 in previous quarter)
> >
> >
> >
> >
> >
> > ## JIRA activity:
> >
> >
> >
> >  - 111 JIRA tickets created in the last 3 months
> >
> >  - 57 JIRA tickets closed/resolved in the last 3 months
>


Re: Draft report to board - Jan 2019

2019-01-09 Thread Prateek Maheshwari
Thanks for the summary Yi. I'd change: "HDFS based backup/restore of
state stores" to "Evaluation for HDFS based backup/restore of state
stores" since this was an intern project and is not checked in to
master. Otherwise LGTM.

Thanks,
Prateek

On Wed, Jan 9, 2019 at 12:28 PM Yi Pan  wrote:
>
> Hi, all,
>
> Our quarterly report is due this Wed (1/9). The following is the draft
> report. Please let me know by the end of the day if I missed anything.
> Thanks!
>
> ## Description:
>
>  - Apache Samza is a distributed stream processing engine that are highly
>
>configurable to process events from various data sources, including
>
>real-time messaging system (e.g. Kafka) and distributed file systems
> (e.g.
>
>HDFS).
>
>
>
> ## Issues:
>
>  - No issues requires board attention
>
>
>
> ## Activity:
>
>  - Samza 1.0 is released:
>
> - News coverage:
> https://www.zdnet.com/article/real-time-data-processing-just-got-more-options-linkedin-releases-apache-samza-1-0-streaming/
>
> - Engineering blogs:
> https://engineering.linkedin.com/blog/2018/11/samza-1-0--stream-processing-at-massive-scale
>
> - Major online website refresh: http://samza.apache.org/
>
>  - Critical improvement projects completed:
>
> - Changelog restore parallelization
>
> - HDFS based backup/restore of state stores
>
>  - Multiple SEP projects initiated or in-progress:
>
> - SEP-18: allows manipulating starting offsets and time-based rewind
>
> - SEP-19: Fast failover for stateful jobs on container failure (i.e.
> standby container)
>
> - SEP to come soon: async high-level API
>
>  - Beam Samza runner upgrade to use Samza 1.0
>
>  - Go and Python support via Beam Samza runner
>
>
>
> ## Health report:
>
>  - Project is in healthy status with 1.0 released in Nov 2018
>
>
>
> ## PMC changes:
>
>
>
>  - Currently 15 PMC members.
>
>  - Prateek Maheshwari was added to the PMC on Thu Nov 01 2018
>
>
>
> ## Committer base changes:
>
>
>
>  - Currently 22 committers.
>
>  - New commmitters:
>
> - Aditya Toomula was added as a committer on Mon Nov 05 2018
>
> - Hai Lu was added as a committer on Mon Nov 05 2018
>
>
>
> ## Releases:
>
>
>
>  - Last release was 1.0 on Nov 28, 2018
>
>
>
> ## /dist/ errors: 9
>
>  - Project is in healthy status with 1.0 released in Nov 2018
>
>
>
> ## Mailing list activity:
>
>
>
>  - dev@samza.apache.org:
>
> - 271 subscribers (down -13 in the last 3 months):
>
> - 445 emails sent to list (288 in previous quarter)
>
>
>
>
>
> ## JIRA activity:
>
>
>
>  - 111 JIRA tickets created in the last 3 months
>
>  - 57 JIRA tickets closed/resolved in the last 3 months


Draft report to board - Jan 2019

2019-01-09 Thread Yi Pan
Hi, all,

Our quarterly report is due this Wed (1/9). The following is the draft
report. Please let me know by the end of the day if I missed anything.
Thanks!

## Description:

 - Apache Samza is a distributed stream processing engine that are highly

   configurable to process events from various data sources, including

   real-time messaging system (e.g. Kafka) and distributed file systems
(e.g.

   HDFS).



## Issues:

 - No issues requires board attention



## Activity:

 - Samza 1.0 is released:

- News coverage:
https://www.zdnet.com/article/real-time-data-processing-just-got-more-options-linkedin-releases-apache-samza-1-0-streaming/

- Engineering blogs:
https://engineering.linkedin.com/blog/2018/11/samza-1-0--stream-processing-at-massive-scale

- Major online website refresh: http://samza.apache.org/

 - Critical improvement projects completed:

- Changelog restore parallelization

- HDFS based backup/restore of state stores

 - Multiple SEP projects initiated or in-progress:

- SEP-18: allows manipulating starting offsets and time-based rewind

- SEP-19: Fast failover for stateful jobs on container failure (i.e.
standby container)

- SEP to come soon: async high-level API

 - Beam Samza runner upgrade to use Samza 1.0

 - Go and Python support via Beam Samza runner



## Health report:

 - Project is in healthy status with 1.0 released in Nov 2018



## PMC changes:



 - Currently 15 PMC members.

 - Prateek Maheshwari was added to the PMC on Thu Nov 01 2018



## Committer base changes:



 - Currently 22 committers.

 - New commmitters:

- Aditya Toomula was added as a committer on Mon Nov 05 2018

- Hai Lu was added as a committer on Mon Nov 05 2018



## Releases:



 - Last release was 1.0 on Nov 28, 2018



## /dist/ errors: 9

 - Project is in healthy status with 1.0 released in Nov 2018



## Mailing list activity:



 - dev@samza.apache.org:

- 271 subscribers (down -13 in the last 3 months):

- 445 emails sent to list (288 in previous quarter)





## JIRA activity:



 - 111 JIRA tickets created in the last 3 months

 - 57 JIRA tickets closed/resolved in the last 3 months