## Description:

- Apache MADlib is a scalable, big data, SQL-driven machine learning
framework
  for data scientists.


## Issues:

- There are no issues requiring board attention at this time.


## Activity:

- Community is at work on the 1.17 release, which will be the 7th release as
  an Apache TLP project. Main JIRAs include:
* feature improvements for deep learning including training multiple models
in
  parallel for parameter selection (hyper-parameter tuning and model
  architecture search), inference on models trained outside of MADlib, and
  performance improvements to mini-batch preprocessor
* performance improvements to correlation/covariance, association rules, and
  weakly connected components graph algorithm
* stopping criteria on LDA using perplexity
* auto selection of number of centroids for K-mean clustering
* Postgres 12 support

- After that will be the 2.0 release with JIRAs related to versioning
models.

— Frank McQuillan (MADlib committer and PMC member) will present the latest
  deep learning work at FOSDEM'20
  https://fosdem.org/2020/schedule/event/mppdb/ in a talk called: "Efficient
  Model Selection for Deep Neural Networks on Massively Parallel Processing
  Databases"

## Health report:

The community is relatively small but very engaged with robust mailing list
traffic, interest in doing frequent releases and new functionality being
developed by contributors.

The number of developers actively contributing to the code/documentation is
approximately 7 in the 4th quarter of calendar year 2019.

We will constantly be on a lookout for new community members to be invited
either as committers or PMC.


## PMC changes:

- No changes in the last quarter.  Currently stands at 14 PMC members.


## Committer base changes:

- Currently 17 committers, no new committers since last report.

- The most recent committers added were: Ekta Khanna (2019-07-27) Himanshu
  Pandey (2019-07-27) Domino Valdano (2019-07-27)


## Releases:

- Next release: v1.17 planned for Jan 2019

- v1.16.0 released on 2019-07-08

- v1.15.1 released on 2018-10-15

- v1.15.0 released on 2018-08-10


## Mailing list activity:

Average monthly mailing list activity was 138 posts to dev@ and 11 posts to
user@ for the last 3 months Oct-Dec 2019.


## JIRA Statistics:

- 2 JIRA tickets created in the last month

- 10 JIRA tickets resolved in the last month

Reply via email to