[GitHub] madlib issue #193: Add support for GPDB 6.X versions

2017-11-02 Thread kaknikhil
Github user kaknikhil commented on the issue: https://github.com/apache/madlib/pull/193 I pulled your branch and I was able to install madlib with gpdb 6. 👍 ---

[GitHub] madlib pull request #195: Feature: Add grouping support to HITS

2017-11-06 Thread kaknikhil
GitHub user kaknikhil opened a pull request: https://github.com/apache/madlib/pull/195 Feature: Add grouping support to HITS JIRA: MADLIB-1151 Changes to support grouping column in HITS. Update queries to use group by and other necessary sql constructs. You can merge

[GitHub] madlib pull request #194: Logregr: Add input validation for dep/indep variab...

2017-11-10 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/194#discussion_r150355564 --- Diff: src/ports/postgres/modules/regress/logistic.py_in --- @@ -158,12 +159,14 @@ def __logregr_validate_args(schema_madlib, tbl_source, tbl_output

[GitHub] madlib issue #195: Feature: Add grouping support to HITS

2017-11-14 Thread kaknikhil
Github user kaknikhil commented on the issue: https://github.com/apache/madlib/pull/195 @iyerr3 @orhankislal All the `*_for_centrality_measures` functions are currently only used in `pagerank` and `hits`, which are centrality measures. They can potentially be used in future

[GitHub] madlib issue #195: Feature: Add grouping support to HITS

2017-11-14 Thread kaknikhil
Github user kaknikhil commented on the issue: https://github.com/apache/madlib/pull/195 @orhankislal @iyerr3 we have refactored the functions grp_frm_table and get_ignore_groups. Let us know if it looks good ? ---

[GitHub] madlib pull request #195: Feature: Add grouping support to HITS

2017-11-15 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/195#discussion_r151225853 --- Diff: src/ports/postgres/modules/graph/hits.sql_in --- @@ -164,10 +168,20 @@ INSERT INTO edge VALUES (3, 0, 1), (4, 0, 1), (5, 6, 1

[GitHub] madlib pull request #195: Feature: Add grouping support to HITS

2017-11-15 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/195#discussion_r151226056 --- Diff: src/ports/postgres/modules/utilities/utilities.py_in --- @@ -709,16 +709,35 @@ def _check_groups(tbl1, tbl2, grp_list): return '

[GitHub] madlib pull request #195: Feature: Add grouping support to HITS

2017-11-15 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/195#discussion_r151226712 --- Diff: src/ports/postgres/modules/graph/hits.py_in --- @@ -95,234 +109,391 @@ def hits(schema_madlib, vertex_table, vertex_id, edge_table, edge_args

[GitHub] madlib pull request #218: Balanced Datasets: Random undersampling with/witho...

2017-12-19 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/218#discussion_r157794392 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,322 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #218: Balanced Datasets: Random undersampling with/witho...

2017-12-19 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/218#discussion_r157773474 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,322 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #218: Balanced Datasets: Random undersampling with/witho...

2017-12-19 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/218#discussion_r157769789 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,322 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #218: Balanced Datasets: Random undersampling with/witho...

2017-12-19 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/218#discussion_r157770698 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,322 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #218: Balanced Datasets: Random undersampling with/witho...

2017-12-19 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/218#discussion_r157799149 --- Diff: src/ports/postgres/modules/sample/test/balance_sample.sql_in --- @@ -0,0 +1,103

[GitHub] madlib pull request #218: Balanced Datasets: Random undersampling with/witho...

2017-12-19 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/218#discussion_r157892828 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,322 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #218: Balanced Datasets: Random undersampling with/witho...

2017-12-19 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/218#discussion_r157893105 --- Diff: src/ports/postgres/modules/sample/test/balance_sample.sql_in --- @@ -93,8 +93,8 @@ SELECT assert(sum(c) <= 18, 'Wrong number of samples

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165537394 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165734791 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165733978 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165527448 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165479832 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165516241 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165503656 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165546601 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165526039 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165505215 --- Diff: src/ports/postgres/modules/sample/balance_sample.sql_in --- @@ -0,0 +1,355

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165530126 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165736819 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165511625 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165520949 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165520599 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165748923 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165522340 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165741803 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165515209 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165523942 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165737045 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #230: Balanced sets final

2018-02-02 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/230#discussion_r165529067 --- Diff: src/ports/postgres/modules/sample/balance_sample.py_in --- @@ -0,0 +1,748 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] madlib pull request #233: Install git on postgres centos 7 docker images.

2018-02-12 Thread kaknikhil
GitHub user kaknikhil opened a pull request: https://github.com/apache/madlib/pull/233 Install git on postgres centos 7 docker images. You can merge this pull request into a Git repository by running: $ git pull https://github.com/kaknikhil/madlib install_git_docker

[GitHub] madlib pull request #241: MiniBatch Pre-Processor: Add new module minibatch_...

2018-03-14 Thread kaknikhil
GitHub user kaknikhil opened a pull request: https://github.com/apache/madlib/pull/241 MiniBatch Pre-Processor: Add new module minibatch_preprocessing JIRA: MADLIB-1200 MiniBatch Preprocessor is a utility function to pre-process the input data for use with models that

[GitHub] madlib pull request #241: MiniBatch Pre-Processor: Add new module minibatch_...

2018-03-14 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/241#discussion_r174625748 --- Diff: src/ports/postgres/modules/utilities/minibatch_preprocessing.py_in --- @@ -0,0 +1,559 @@ +# coding=utf-8 +# +# Licensed to the

[GitHub] madlib pull request #240: MLP: Fix step size initialization based on learnin...

2018-03-16 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/240#discussion_r175224785 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -112,6 +112,7 @@ def mlp(schema_madlib, source_table, output_table, independent_varname

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175621915 --- Diff: src/modules/convex/algo/igd.hpp --- @@ -90,20 +90,27 @@ IGD::transition(state_type &state, for (int curr_epoch=0; curr_e

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175625333 --- Diff: src/modules/convex/algo/igd.hpp --- @@ -90,20 +90,27 @@ IGD::transition(state_type &state, for (int curr_epoch=0; curr_e

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175626817 --- Diff: src/modules/convex/mlp_igd.cpp --- @@ -130,6 +145,90 @@ mlp_igd_transition::run(AnyType &args) { return s

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175917822 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -222,67 +243,83 @@ def mlp(schema_madlib, source_table, output_table, independent_varname

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175890376 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -1457,3 +1660,85 @@ def mlp_predict_help(schema_madlib, message): return

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175895832 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -72,107 +73,127 @@ def mlp(schema_madlib, source_table, output_table, independent_varname

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175873168 --- Diff: src/modules/convex/task/mlp.hpp --- @@ -111,6 +117,57 @@ class MLP { template double MLP::lambda = 0; +template +double

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175921215 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -222,67 +243,83 @@ def mlp(schema_madlib, source_table, output_table, independent_varname

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175931022 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -491,10 +571,28 @@ def _update_temp_model_table(args, iteration, temp_output_table

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175628144 --- Diff: src/modules/convex/mlp_igd.cpp --- @@ -130,6 +145,90 @@ mlp_igd_transition::run(AnyType &args) { return s

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175877947 --- Diff: src/modules/convex/task/mlp.hpp --- @@ -111,6 +117,57 @@ class MLP { template double MLP::lambda = 0; +template +double

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175929883 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -292,26 +329,33 @@ def mlp(schema_madlib, source_table, output_table, independent_varname

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175620624 --- Diff: src/modules/convex/algo/igd.hpp --- @@ -90,20 +90,27 @@ IGD::transition(state_type &state, for (int curr_epoch=0; curr_e

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175871655 --- Diff: src/modules/convex/mlp_igd.cpp --- @@ -170,6 +289,24 @@ mlp_igd_final::run(AnyType &args) { return s

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175627929 --- Diff: src/modules/convex/mlp_igd.cpp --- @@ -130,6 +145,90 @@ mlp_igd_transition::run(AnyType &args) { return s

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175922864 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -292,26 +329,33 @@ def mlp(schema_madlib, source_table, output_table, independent_varname

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175889157 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -590,51 +664,103 @@ def _validate_warm_start(output_table, summary_table

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175627412 --- Diff: src/modules/convex/mlp_igd.cpp --- @@ -130,6 +145,90 @@ mlp_igd_transition::run(AnyType &args) { return s

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175872591 --- Diff: src/modules/convex/task/mlp.hpp --- @@ -111,6 +117,57 @@ class MLP { template double MLP::lambda = 0; +template +double

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175923655 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -33,11 +34,12 @@ from convex.utils_regularization import __utils_normalize_data_grouping

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175894372 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -1457,3 +1660,85 @@ def mlp_predict_help(schema_madlib, message): return

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175893520 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -1457,3 +1660,85 @@ def mlp_predict_help(schema_madlib, message): return

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175891761 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -72,107 +73,127 @@ def mlp(schema_madlib, source_table, output_table, independent_varname

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175923217 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -292,26 +329,33 @@ def mlp(schema_madlib, source_table, output_table, independent_varname

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-20 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r175888098 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -590,51 +664,103 @@ def _validate_warm_start(output_table, summary_table

[GitHub] madlib pull request #243: MLP: Add minibatch gradient descent solver

2018-03-21 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/243#discussion_r176262391 --- Diff: src/ports/postgres/modules/convex/test/mlp.sql_in --- @@ -340,6 +181,51 @@ INSERT INTO iris_data VALUES (149,ARRAY[6.2,3.4,5.4,2.3],'

[GitHub] madlib issue #247: SVM: Revert minibatch-related work

2018-03-22 Thread kaknikhil
Github user kaknikhil commented on the issue: https://github.com/apache/madlib/pull/247 @iyerr3 I compared this PR with the original commit and it looks good. Just a couple of minor questions 1. This change to model.hpp was not reverted. It looks like it's not relat

[GitHub] madlib issue #247: SVM: Revert minibatch-related work

2018-03-22 Thread kaknikhil
Github user kaknikhil commented on the issue: https://github.com/apache/madlib/pull/247 @iyerr3 Thanks for the detailed explanation. I agree, we should avoid a redundant commit and take care of this in a future commit. ---

[GitHub] madlib issue #244: Changes for Personalized Page Rank : Jira:1084

2018-03-23 Thread kaknikhil
Github user kaknikhil commented on the issue: https://github.com/apache/madlib/pull/244 @hpandeycodeit Can you add more description in the final commit (maybe use the same description as the PR). This is the convention we use for our commit messages. ``` Commit Title

[GitHub] madlib issue #249: RF: Use NULL::integer[] when no continuous features

2018-03-27 Thread kaknikhil
Github user kaknikhil commented on the issue: https://github.com/apache/madlib/pull/249 The changes look good. +1 for adding an install check test. I noticed that `con_features` is populated by `decision_tree.py:_classify_features` which does something like `con_types

[GitHub] madlib pull request #253: MLP: Add install check tests for minibatch with gr...

2018-03-28 Thread kaknikhil
GitHub user kaknikhil opened a pull request: https://github.com/apache/madlib/pull/253 MLP: Add install check tests for minibatch with grouping This PR adds install check tests for MLP minibatch with grouping. You can merge this pull request into a Git repository by running

[GitHub] madlib pull request #250: MLP: Allow one-hot encoded dependent var for class...

2018-03-29 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/250#discussion_r177928648 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -667,7 +678,8 @@ def _validate_dependent_var(source_table, dependent_varname

[GitHub] madlib pull request #250: MLP: Allow one-hot encoded dependent var for class...

2018-03-29 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/250#discussion_r177913869 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -856,8 +868,16 @@ def mlp_predict(schema_madlib, model_table, data_table, id_col_name

[GitHub] madlib pull request #250: MLP: Allow one-hot encoded dependent var for class...

2018-03-29 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/250#discussion_r177912305 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -667,7 +678,8 @@ def _validate_dependent_var(source_table, dependent_varname

[GitHub] madlib pull request #250: MLP: Allow one-hot encoded dependent var for class...

2018-03-29 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/250#discussion_r177912480 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -667,7 +678,8 @@ def _validate_dependent_var(source_table, dependent_varname

[GitHub] madlib pull request #250: MLP: Allow one-hot encoded dependent var for class...

2018-03-29 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/250#discussion_r177912997 --- Diff: src/ports/postgres/modules/convex/mlp_igd.py_in --- @@ -667,7 +678,8 @@ def _validate_dependent_var(source_table, dependent_varname

[GitHub] madlib issue #253: MLP: Add install check tests for minibatch with grouping

2018-03-29 Thread kaknikhil
Github user kaknikhil commented on the issue: https://github.com/apache/madlib/pull/253 Closed by ab7166ff4fc55311ec29bb8b54d17becd9bb1750 ---

[GitHub] madlib pull request #253: MLP: Add install check tests for minibatch with gr...

2018-03-29 Thread kaknikhil
Github user kaknikhil closed the pull request at: https://github.com/apache/madlib/pull/253 ---

[GitHub] madlib pull request #254: Enable grouping for minibatch preprocessing

2018-04-02 Thread kaknikhil
GitHub user kaknikhil opened a pull request: https://github.com/apache/madlib/pull/254 Enable grouping for minibatch preprocessing This PR enables grouping for the minibatch preprocessor module. Other changes 1. Added install check test for special chars. 2

[GitHub] madlib pull request #260: minibatch preprocessor improvements

2018-04-10 Thread kaknikhil
GitHub user kaknikhil opened a pull request: https://github.com/apache/madlib/pull/260 minibatch preprocessor improvements This PR makes two improvements to the preprocessor code 1. Check for all character types for dependent col 2. Create temp table for standardization

[GitHub] madlib pull request #260: minibatch preprocessor improvements

2018-04-11 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/260#discussion_r180850923 --- Diff: src/ports/postgres/modules/utilities/minibatch_preprocessing.py_in --- @@ -397,8 +408,9 @@ class MiniBatchStandardizer

[GitHub] madlib pull request #260: minibatch preprocessor improvements

2018-04-11 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/260#discussion_r180851151 --- Diff: src/ports/postgres/modules/utilities/minibatch_preprocessing.py_in --- @@ -387,6 +397,7 @@ class MiniBatchStandardizer: ) as

[GitHub] madlib pull request #272: MLP: Add momentum and nesterov to gradient updates...

2018-05-24 Thread kaknikhil
GitHub user kaknikhil opened a pull request: https://github.com/apache/madlib/pull/272 MLP: Add momentum and nesterov to gradient updates. JIRA: MADLIB-1210 We refactored the minibatch code to separate out the momentum and model update functions. We initially were using

[GitHub] madlib pull request #268: DT: Don't use NULL value to get dep_var type

2018-05-25 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/268#discussion_r185970102 --- Diff: src/ports/postgres/modules/utilities/validate_args.py_in --- @@ -368,8 +375,10 @@ def get_expr_type(expr, tbl): SELECT pg_typeof({0

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191581891 --- Diff: src/madpack/madpack.py --- @@ -95,6 +95,16 @@ def _internal_run_query(sql, show_error): return run_query(sql, con_args, show_error

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191581553 --- Diff: src/madpack/madpack.py --- @@ -95,6 +95,16 @@ def _internal_run_query(sql, show_error): return run_query(sql, con_args, show_error

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191581799 --- Diff: src/madpack/madpack.py --- @@ -95,6 +95,16 @@ def _internal_run_query(sql, show_error): return run_query(sql, con_args, show_error

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191586806 --- Diff: src/madpack/madpack.py --- @@ -131,10 +141,73 @@ def _get_relative_maddir(maddir, port): return maddir

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191580904 --- Diff: src/madpack/madpack.py --- @@ -95,6 +95,16 @@ def _internal_run_query(sql, show_error): return run_query(sql, con_args, show_error

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191607552 --- Diff: src/madpack/madpack.py --- @@ -824,6 +873,246 @@ def parse_arguments(): # Get the arguments return parser.parse_args

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191607127 --- Diff: src/madpack/madpack.py --- @@ -824,6 +873,246 @@ def parse_arguments(): # Get the arguments return parser.parse_args

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191589562 --- Diff: src/madpack/madpack.py --- @@ -238,6 +311,88 @@ def _run_sql_file(schema, maddir_mod_py, module, sqlfile, return retval

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191603261 --- Diff: src/madpack/madpack.py --- @@ -131,10 +141,73 @@ def _get_relative_maddir(maddir, port): return maddir

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191585342 --- Diff: src/madpack/madpack.py --- @@ -131,10 +141,73 @@ def _get_relative_maddir(maddir, port): return maddir

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191608639 --- Diff: src/madpack/madpack.py --- @@ -824,6 +873,246 @@ def parse_arguments(): # Get the arguments return parser.parse_args

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191588071 --- Diff: src/madpack/madpack.py --- @@ -131,10 +141,73 @@ def _get_relative_maddir(maddir, port): return maddir

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191598228 --- Diff: src/madpack/upgrade_util.py --- @@ -1299,18 +1303,19 @@ def _clean_function(self): pattern = re.compile(r"""CREA

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191604335 --- Diff: src/madpack/upgrade_util.py --- @@ -1299,18 +1303,19 @@ def _clean_function(self): pattern = re.compile(r"""CREA

[GitHub] madlib pull request #271: Madpack: Make install, reinstall and upgrade atomi...

2018-05-30 Thread kaknikhil
Github user kaknikhil commented on a diff in the pull request: https://github.com/apache/madlib/pull/271#discussion_r191592112 --- Diff: src/madpack/madpack.py --- @@ -238,6 +311,88 @@ def _run_sql_file(schema, maddir_mod_py, module, sqlfile, return retval

  1   2   >