The Apache MADlib team is pleased to announce the immediate availability of the 1.17.0 release.
The main goals of this release are: New features - DL: Add optional params to madlib_keras_fit_multiple_model (MADLIB-1397) - DL: Fit and evaluate changes for asymmetric cluster config (MADLIB-1393) - DL: Make param search fit() function work with existing evaluate and predict (MADLIB-1387) - DL: ParamSearch: Add utility function for generating model selection table (MADLIB-1375) - DL: Predict changes for asymmetric cluster config (MADLIB-1394) - DL: Preprocessor should evenly distribute data on an arbitrary number of segments (MADLIB-1378) - DL: Preprocessor support for asymmetric segment distribution (MADLIB-1392) - DL: Remove model_arch_table column from the output of load_model_selection_table (MADLIB-1381) - DL: Support DL predict without training on MADlib (MADLIB-1359) - DL: Transfer learning for multi-model (MADLIB-1389) - Kmeans: Add simple silhouette score for every point (MADLIB-1382) - Kmeans: Select number of centroids in k-means (MADLIB-1380) - PostgreSQL 12 support (MADLIB-1391) Improvements: - Assoc rules: Add option to set number of posterior in association rules (MADLIB-1327) - Correlation: Improve correlation and covariance memory usage with large number of groups (MADLIB-1301) - DL: helper function for asymmetric cluster config (MADLIB-1390) - DL: Mini-batch preprocessor for images - performance issue (MADLIB-1342) - DL: Modify warm start logic for DL to handle case of missing weight (MADLIB-1400) - DL: Param search for multiple models on MPP architecture (MADLIB-1386) - DL: performance improvements to fit transition function (MADLIB-1418) - Docs: Enhance Installation Guides (MADLIB-1399) - Graph: SSSP should not show vertices in output table that are unreachable (MADLIB-1415) - Knn - add zero check and output distance array (MADLIB-1370) - LDA: Add stopping criteria on perplexity to LDA (MADLIB-1351) - Summary: Last optional param in summary errors when NULL (MADLIB-1413) - Summary: Summary function has dups for MFV for approximate results (MADLIB-1412) - SVM: Change default num_components for SVM to max(100, 2*num_features) (MADLIB-1384) All release changes can be found here: https://cwiki.apache.org/confluence/display/MADLIB/MADlib+1.17.0 You can download the source release and convenience binary packages from Apache MADlib's download page here: http://madlib.apache.org/download.html Alternatively, you can download through an ASF mirror near you: https://www.apache.org/dyn/closer.lua/madlib/1.17.0 ---- Apache MADlib is an open-source library for scalable in-database analytics. It provides data-parallel implementations of mathematical, statistical and machine learning methods for structured and unstructured data. The MADlib mission: to foster widespread development of scalable analytic skills, by harnessing efforts from commercial practice, academic research, and open-source development. We welcome your help and feedback. For more information on how to report problems, and to get involved, visit the project website at https://madlib.apache.org ---- Thank you, everyone, who contributed to the 1.17.0 release. We look forward to continued community participation for the next release! Regards, Orhan Kislal