[GitHub] incubator-madlib pull request: Matrix: Fix multiple input/output i...

2015-12-04 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/7 Matrix: Fix multiple input/output issues JIRA: MADLIB-932 This commit contains fixes for following issues: - 'inf' or 'infinity' was read by Python as a float value. This has

[GitHub] incubator-madlib pull request: SVM: Add cross validation support a...

2015-11-24 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/4#discussion_r45804084 --- Diff: src/ports/postgres/modules/svm/svm.py_in --- @@ -66,211 +253,118 @@ def svm(schema_madlib, source_table, model_table

[GitHub] incubator-madlib pull request: SVM: Add cross validation support a...

2015-11-24 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/4#discussion_r45806336 --- Diff: src/ports/postgres/modules/svm/svm.py_in --- @@ -440,55 +595,68 @@ def _process_epsilon(is_svc, args): def _extract_params(schema_madlib

[GitHub] incubator-madlib pull request: SVM: Add cross validation support a...

2015-11-24 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/4#discussion_r45806649 --- Diff: src/ports/postgres/modules/svm/svm.py_in --- @@ -440,55 +595,68 @@ def _process_epsilon(is_svc, args): def _extract_params(schema_madlib

[GitHub] incubator-madlib pull request: SVM: Add cross validation support a...

2015-11-25 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/4#discussion_r45920065 --- Diff: src/ports/postgres/modules/svm/svm.py_in --- @@ -389,11 +521,20 @@ def _process_epsilon(is_svc, args): as_rel_source = '_src

[GitHub] incubator-madlib pull request: SVM: Add cross validation support a...

2015-11-25 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/4#discussion_r45919749 --- Diff: src/ports/postgres/modules/svm/svm.py_in --- @@ -66,211 +253,118 @@ def svm(schema_madlib, source_table, model_table

[GitHub] incubator-madlib pull request: Prediction Metrics: New module

2016-05-31 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/42#discussion_r65259215 --- Diff: src/ports/postgres/modules/stats/pred_metrics.py_in --- @@ -0,0 +1,562 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] incubator-madlib pull request: Prediction Metrics: New module

2016-05-31 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/42#discussion_r65259392 --- Diff: src/ports/postgres/modules/stats/pred_metrics.py_in --- @@ -0,0 +1,562 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] incubator-madlib pull request: Prediction Metrics: New module

2016-05-31 Thread iyerr3
Github user iyerr3 commented on the pull request: https://github.com/apache/incubator-madlib/pull/42 Along with casting the columns to int in binary classification, we also need to change docs/online-help/tests to reflect that boolean columns allowed for observation columns

[GitHub] incubator-madlib pull request: Prediction Metrics: New module

2016-05-27 Thread iyerr3
Github user iyerr3 commented on the pull request: https://github.com/apache/incubator-madlib/pull/42#issuecomment-23823 I have made some changes and added validation and online help functions (in my [private fork branch](https://github.com/iyerr3/incubator-madlib/commits/feature

[GitHub] incubator-madlib pull request #47: Feature: Pivot Function

2016-06-21 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/47#discussion_r67946909 --- Diff: src/ports/postgres/modules/utilities/pivot.sql_in --- @@ -0,0 +1,202

[GitHub] incubator-madlib pull request #49: Feature: Sessionize funtion - Phase 2

2016-06-21 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/49#discussion_r67954685 --- Diff: src/ports/postgres/modules/utilities/sessionize.py_in --- @@ -35,41 +36,83 @@ def sessionize(schema_madlib, source_table, output_table

[GitHub] incubator-madlib pull request #47: Feature: Pivot Function

2016-06-22 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/47#discussion_r68128267 --- Diff: src/ports/postgres/modules/utilities/pivot.py_in --- @@ -0,0 +1,201 @@ +# coding=utf-8 +# +# Licensed to the Apache Software

[GitHub] incubator-madlib pull request #47: Feature: Pivot Function

2016-06-15 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/47#discussion_r67259935 --- Diff: doc/mainpage.dox.in --- @@ -1,12 +1,12 @@ /** @mainpage -Apache MADlib (incubating) is an open-source library for scalable

[GitHub] incubator-madlib pull request: fix hypothesis test docs and exampl...

2016-02-29 Thread iyerr3
Github user iyerr3 commented on the pull request: https://github.com/apache/incubator-madlib/pull/23#issuecomment-190345079 Merged (0545cdfc499354820af670b627a2769ae6af75b1) --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well

[GitHub] incubator-madlib pull request: SVM: Add Polynomial Kernel

2016-02-26 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/22#discussion_r54282752 --- Diff: methods/array_ops/src/pg_gp/array_ops.c --- @@ -702,8 +702,14 @@ array_sub(PG_FUNCTION_ARGS){ PG_FUNCTION_INFO_V1(array_mult

[GitHub] incubator-madlib pull request: SVM: Add Polynomial Kernel

2016-02-26 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/22#discussion_r54290851 --- Diff: methods/array_ops/src/pg_gp/array_ops.c --- @@ -702,8 +702,14 @@ array_sub(PG_FUNCTION_ARGS){ PG_FUNCTION_INFO_V1(array_mult

[GitHub] incubator-madlib pull request: Path: Return results for each match

2016-03-10 Thread iyerr3
Github user iyerr3 commented on the pull request: https://github.com/apache/incubator-madlib/pull/29#issuecomment-195136869 @fmcquillan99 Please pull this branch to review/update documentation. @mktal Please give this a review. --- If your project is set up for it, you can

[GitHub] incubator-madlib pull request: Path: Return results for each match

2016-03-18 Thread iyerr3
Github user iyerr3 commented on the pull request: https://github.com/apache/incubator-madlib/pull/29#issuecomment-198561189 @fmcquillan99 - Added a new commit to ensure that the symbol order is maintained. The expected behavior is "if a row matches with two symbols, the s

[GitHub] incubator-madlib pull request: Path: Return results for each match

2016-03-21 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/29#discussion_r56901766 --- Diff: src/ports/postgres/modules/utilities/path.py_in --- @@ -387,11 +395,16 @@ def _parse_symbol_str(symbol_expr, pattern_expr

[GitHub] incubator-madlib pull request: Recursive_partitioning: Fix random ...

2016-03-25 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/32#discussion_r57494868 --- Diff: src/ports/postgres/modules/recursive_partitioning/decision_tree.py_in --- @@ -669,6 +669,8 @@ def _get_n_and_deplist(training_table_name

[GitHub] incubator-madlib pull request: misc doc changes for 1 dot 9

2016-03-25 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/33#discussion_r57504858 --- Diff: src/ports/postgres/modules/summary/summary.sql_in --- @@ -285,6 +294,16 @@ string should be double-quoted; in this case the input would

[GitHub] incubator-madlib pull request: misc doc changes for 1 dot 9

2016-03-25 Thread iyerr3
Github user iyerr3 commented on the pull request: https://github.com/apache/incubator-madlib/pull/33#issuecomment-201615733 LGTM pending some of the spelling corrections --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well

[GitHub] incubator-madlib pull request: misc doc changes for 1 dot 9

2016-03-25 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/33#discussion_r57504745 --- Diff: src/ports/postgres/modules/pca/pca.sql_in --- @@ -324,13 +341,20 @@ string should be double-quoted; in this case the input would

[GitHub] incubator-madlib pull request: misc doc changes for 1 dot 9

2016-03-25 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/33#discussion_r57503961 --- Diff: src/ports/postgres/modules/elastic_net/elastic_net.sql_in --- @@ -130,7 +130,13 @@ the dependent variable expression to the excluded string

[GitHub] incubator-madlib pull request: Path: Return results for each match

2016-03-24 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/29#discussion_r57402068 --- Diff: src/ports/postgres/modules/utilities/path.py_in --- @@ -108,8 +107,11 @@ def path(schema_madlib, source_table, output_table, partition_expr

[GitHub] incubator-madlib pull request: Path: Return results for each match

2016-03-24 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/29#discussion_r57402003 --- Diff: src/ports/postgres/modules/utilities/path.py_in --- @@ -118,140 +120,175 @@ def path(schema_madlib, source_table, output_table

[GitHub] incubator-madlib pull request: Path: Return results for each match

2016-03-24 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/29#discussion_r57401060 --- Diff: src/ports/postgres/modules/utilities/path.py_in --- @@ -118,140 +120,175 @@ def path(schema_madlib, source_table, output_table

[GitHub] incubator-madlib pull request: Release1.9alpha: Minor fixes for RC...

2016-03-07 Thread iyerr3
Github user iyerr3 commented on the pull request: https://github.com/apache/incubator-madlib/pull/27#issuecomment-193432184 Thanks, Roman. I can take care of the merge. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well

[GitHub] incubator-madlib pull request: SVM: Add class weights for use with...

2016-05-23 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/43 SVM: Add class weights for use with unbalanced data JIRA: MADLIB-998 Added 'class_weight' in the 'params' argument. It can either be a string or a dictionary-like mapping

[GitHub] incubator-madlib pull request: SVM: Add class weights for use with...

2016-05-23 Thread iyerr3
Github user iyerr3 commented on the pull request: https://github.com/apache/incubator-madlib/pull/43#issuecomment-221046547 Testing with scikit learn revealed odd behavior. See figures below. 1) With equal class size the results are similar between scikit and madlib. 2

[GitHub] incubator-madlib pull request: SVM: Add class weights for use with...

2016-05-23 Thread iyerr3
Github user iyerr3 commented on the pull request: https://github.com/apache/incubator-madlib/pull/43#issuecomment-221083387 Things are better when we don't use the default step size. Using init_stepsize=0.1 instead of the default (0.01): ![with_weights_higher_init

[GitHub] incubator-madlib pull request: SVM: Add class weights for use with...

2016-05-23 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/43#discussion_r64303826 --- Diff: src/ports/postgres/modules/svm/svm.py_in --- @@ -781,6 +785,46 @@ def _random_feature_map(schema_madlib, source_table, dependent_varname

[GitHub] incubator-madlib pull request: Prediction Metrics: New module

2016-05-10 Thread iyerr3
Github user iyerr3 commented on the pull request: https://github.com/apache/incubator-madlib/pull/42#issuecomment-218301178 General comments: - The distance functions (mean_*_error) all have the same structure except the distance metric. I suggest refactoring the table creation

[GitHub] incubator-madlib pull request #54: Pivoting: Phase 2

2016-07-22 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/54#discussion_r71961433 --- Diff: src/ports/postgres/modules/utilities/pivot.py_in --- @@ -58,66 +61,255 @@ def pivot(schema_madlib, source_table, out_table

[GitHub] incubator-madlib pull request #54: Pivoting: Phase 2

2016-07-22 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/54#discussion_r71961026 --- Diff: src/ports/postgres/modules/utilities/pivot.py_in --- @@ -58,66 +61,255 @@ def pivot(schema_madlib, source_table, out_table

[GitHub] incubator-madlib pull request #54: Pivoting: Phase 2

2016-07-22 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/54#discussion_r71961445 --- Diff: src/ports/postgres/modules/utilities/pivot.py_in --- @@ -58,66 +61,255 @@ def pivot(schema_madlib, source_table, out_table

[GitHub] incubator-madlib pull request #54: Pivoting: Phase 2

2016-07-22 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/54#discussion_r71960956 --- Diff: src/ports/postgres/modules/utilities/pivot.py_in --- @@ -58,66 +61,255 @@ def pivot(schema_madlib, source_table, out_table

[GitHub] incubator-madlib pull request #54: Pivoting: Phase 2

2016-07-29 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/54#discussion_r72859110 --- Diff: src/ports/postgres/modules/utilities/pivot.py_in --- @@ -58,66 +61,256 @@ def pivot(schema_madlib, source_table, out_table

[GitHub] incubator-madlib pull request #54: Pivoting: Phase 2

2016-07-29 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/54#discussion_r72856491 --- Diff: src/ports/postgres/modules/utilities/pivot.py_in --- @@ -58,66 +61,256 @@ def pivot(schema_madlib, source_table, out_table

[GitHub] incubator-madlib pull request #54: Pivoting: Phase 2

2016-07-29 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/54#discussion_r72855881 --- Diff: src/ports/postgres/modules/utilities/pivot.py_in --- @@ -58,66 +61,256 @@ def pivot(schema_madlib, source_table, out_table

[GitHub] incubator-madlib pull request #54: Pivoting: Phase 2

2016-07-29 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/54#discussion_r72856933 --- Diff: src/ports/postgres/modules/utilities/pivot.py_in --- @@ -58,66 +61,256 @@ def pivot(schema_madlib, source_table, out_table

[GitHub] incubator-madlib issue #54: Pivoting: Phase 2

2016-07-29 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/54 Even for the other two, I suggest keeping the static parts all together with only the dynamic part created in the loops. That avoids having the `SELECT` clause separated from the `FROM

[GitHub] incubator-madlib issue #53: Sessionize function: Minor changes to online hel...

2016-07-14 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/53 Merged with e421cd5. @njayaram2 this PR can be closed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] incubator-madlib pull request #48: SVM: Novelty detection using 1-class SVM

2016-06-29 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/48#discussion_r69024778 --- Diff: src/ports/postgres/modules/svm/svm.py_in --- @@ -676,27 +985,32 @@ def svm_help(schema_madlib, message, is_svc, **kwargs): def svm

[GitHub] incubator-madlib issue #93: Allow encode_categorical_variables() to use the ...

2017-02-01 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/93 Thanks, Satoshi (@snaga). The changes look good - I merged this and updated the docs to reflect the new parameter. --- If your project is set up for it, you can reply to this email

[GitHub] incubator-madlib issue #95: Release v1.10:

2017-02-03 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/95 If the upgrade from versions prior to v1.8 is not supported then we should delete the corresponding changelist files to avoid confusion. --- If your project is set up for it, you can reply

[GitHub] incubator-madlib issue #95: Release v1.10:

2017-02-03 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/95 IMO, if we're not supporting upgrade from prior to v1.8 then we should force users to follow an upgrade chain. On Fri, Feb 3, 2017 at 12:39 PM, Orhan Kislal <notific

[GitHub] incubator-madlib pull request #91: Build: Use only major version for GPDB 5,...

2017-01-23 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/91 Build: Use only major version for GPDB 5, HAWQ 2 GPDB, starting 5.0, and HAWQ, starting 2.0, are using semantic versioning for releases. This implies a binary compatibility between

[GitHub] incubator-madlib issue #84: PCA: Add grouping support to PCA

2017-01-20 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/84 This was merged with commit [02a7ef45](02a7ef453aa16e19eaa1e044ba18fe955fde0bf0) --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] incubator-madlib issue #76: Include boost::format in MathToolkit_impl.hpp.

2017-01-26 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/76 Merged with 0e00a27. Need to close this PR (either by @lifepillar or via a future commit message) --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] incubator-madlib pull request #89: K-means: support for array input

2017-01-26 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/89#discussion_r98118503 --- Diff: src/ports/postgres/modules/kmeans/kmeans.py_in --- @@ -34,6 +38,34 @@ def kmeans_validate_src(schema_madlib, rel_source, **kwargs

[GitHub] incubator-madlib issue #103: Docker Build for MADlib

2017-02-17 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/103 Looks good to me. I was able to run through the steps. Only comment would be to update documentation to reflect the `madlib` docker hub account. --- If your project is set up for it, you

[GitHub] incubator-madlib pull request #106: Build: Fix module sort order for PGXN in...

2017-02-24 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/106 Build: Fix module sort order for PGXN installation JIRA: MADLIB-1024 PGXN installation involves creating a single extension sql file that contains all the SQL commands run

[GitHub] incubator-madlib pull request #101: Multiple: Add casting to allow compilati...

2017-02-10 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/101 Multiple: Add casting to allow compilation with GCC 6+ JIRA: MADLIB-1025 GCC 6+ introduced stricter rules for implicit casting where loss of information is possible

[GitHub] incubator-madlib pull request #102: Build: Add error for missing server incl...

2017-02-15 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/102 Build: Add error for missing server includedir JIRA: MADLIB-1065 Missing server headers is a common error with new contributors. This leads to a non-related error with CMake

[GitHub] incubator-madlib pull request #105: Graph:

2017-02-27 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/105#discussion_r103327658 --- Diff: doc/design/modules/graph.tex --- @@ -86,3 +131,49 @@ \section{Single Source Shortest Path} \label{sec:graph:sssp

[GitHub] incubator-madlib pull request #105: Graph:

2017-02-27 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/105#discussion_r103325238 --- Diff: doc/design/modules/graph.tex --- @@ -33,9 +34,53 @@ This module implements various graph algorithms that are used in a number

[GitHub] incubator-madlib pull request #:

2016-10-06 Thread iyerr3
Github user iyerr3 commented on the pull request: https://github.com/apache/incubator-madlib/commit/d62bfbe5234d9e0185cd8acda711fe6dabfd7574#commitcomment-19326675 I don't expect any specific performance or usage changes. We should do a full round of performance tests to confirm

[GitHub] incubator-madlib issue #70: Build: Fix version parsing in madpack

2016-10-25 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/70 Apologies for the rebased branch. New changes include: ``` Changes: - Fix madpack to assume revision string can be any valid Semantic versioning scheme. - Change

[GitHub] incubator-madlib issue #74: Correct the version used in MADlib rpm installat...

2016-11-10 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/74 Merged with commit 82e56a4 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] incubator-madlib issue #70: Build: Fix version parsing in madpack

2016-10-24 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/70 Need final approval to close this PR. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] incubator-madlib issue #70: Build: Fix version parsing in madpack

2016-10-25 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/70 See recent update to enable versions of the form "1.9.1-rc". Per semantic versioning, all identifier labels should be a suffix starting with a hyphen. This is considered a val

[GitHub] incubator-madlib issue #74: Correct the version used in MADlib rpm installat...

2016-11-28 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/74 @njayaram2: Could you please close this PR (forgot to close it with the commit). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] incubator-madlib pull request #77: Cross validation in Elastic Net

2016-11-28 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/77 Cross validation in Elastic Net Note: Currently this is rebased on top of the grouping PR (#69) and has the commits from that PR. When that PR is merged, this should be corrected to only

[GitHub] incubator-madlib issue #70: Build: Fix version parsing in madpack

2016-10-30 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/70 Thanks for the comment, @stephendotcarter. I thought I had changed that to int last week but clearly I didn't push to remote. Please check the updated code - I've also added some

[GitHub] incubator-madlib pull request #69: Elastic Net: Add grouping support

2016-11-03 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/69#discussion_r86223466 --- Diff: src/ports/postgres/modules/utilities/in_mem_group_control.py_in --- @@ -54,6 +54,18 @@ class BaseState(object

[GitHub] incubator-madlib issue #70: Build: Fix version parsing in madpack

2016-10-31 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/70 Thanks! Updated code. On Sun, Oct 30, 2016 at 9:39 AM, Stephen Carter <notificati...@github.com> wrote: > There is another line where we need to convert ba

[GitHub] incubator-madlib pull request #70: Build: Fix version parsing in madpack

2016-10-17 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/70 Build: Fix version parsing in madpack Primary change is line 359. Other changes are whitespace fixes. You can merge this pull request into a Git repository by running: $ git pull

[GitHub] incubator-madlib pull request #70: Build: Fix version parsing in madpack

2016-10-17 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/70#discussion_r83716925 --- Diff: src/madpack/madpack.py --- @@ -357,7 +356,7 @@ def _get_rev_num(rev): @param rev version text ""&quo

[GitHub] incubator-madlib pull request #82: Encode categorical variables

2016-12-21 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/82 Encode categorical variables JIRA: MADLIB-1038 Major overhaul of the dummy/one-hot encoding of categorical variables with new name and updated arguments. Older function has been

[GitHub] incubator-madlib pull request #77: Cross validation in Elastic Net

2016-12-21 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/77#discussion_r93540854 --- Diff: src/ports/postgres/modules/validation/internal/cross_validation.py_in --- @@ -336,22 +339,31 @@ class CrossValidator(object

[GitHub] incubator-madlib issue #110: Build: Update pom version for rat check

2017-03-23 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/110 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] incubator-madlib issue #110: Build: Update pom version for rat check

2017-03-23 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/110 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] incubator-madlib pull request #107: Build: Avoid downloading mathjax during ...

2017-03-15 Thread iyerr3
Github user iyerr3 closed the pull request at: https://github.com/apache/incubator-madlib/pull/107 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] incubator-madlib issue #107: Build: Avoid downloading mathjax during make do...

2017-03-15 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/107 Merged with 7be68936f2cf09e44fd9a8ae3e893db73dc99b26 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] incubator-madlib pull request #108: Pivot: Add support for array output

2017-03-15 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/108 Pivot: Add support for array output JIRA: MADLIB-1066 When total pivoted columns exceed 1600, an array output becomes essential. This commit adds support to get each pivoted

[GitHub] incubator-madlib pull request #123: MADLIB-1076. Review LICENSE file and REA...

2017-04-20 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/123#discussion_r112568270 --- Diff: licenses/MADlib.txt --- @@ -1,10 +0,0 @@ -Portions of this software Copyright (c) 2010-2013 by EMC Corporation. All rights reserved

[GitHub] incubator-madlib issue #116: Unnest 2d array

2017-04-20 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/116 Jenkins, OK to test. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] incubator-madlib pull request #123: MADLIB-1076. Review LICENSE file and REA...

2017-04-20 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/123#discussion_r112544856 --- Diff: licenses/MADlib.txt --- @@ -1,10 +0,0 @@ -Portions of this software Copyright (c) 2010-2013 by EMC Corporation. All rights reserved

[GitHub] incubator-madlib pull request #119: Multiple: Minor changes for GPDB5 and HA...

2017-04-20 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/119#discussion_r112535164 --- Diff: src/ports/postgres/modules/graph/sssp.py_in --- @@ -432,11 +433,17 @@ def graph_sssp(schema_madlib, vertex_table, vertex_id, edge_table

[GitHub] incubator-madlib pull request #119: Multiple: Minor changes for GPDB5 and HA...

2017-04-20 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/119#discussion_r112534924 --- Diff: src/ports/postgres/modules/graph/sssp.py_in --- @@ -314,9 +314,13 @@ def graph_sssp(schema_madlib, vertex_table, vertex_id, edge_table

[GitHub] incubator-madlib pull request #119: Multiple: Minor changes for GPDB5 and HA...

2017-04-20 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/119#discussion_r112535321 --- Diff: src/ports/postgres/modules/elastic_net/test/elastic_net_install_check.sql_in --- @@ -840,27 +840,27 @@ SELECT elastic_net_train

[GitHub] incubator-madlib pull request #124: Bugfix/jenkins xml report

2017-04-20 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/124 Bugfix/jenkins xml report You can merge this pull request into a Git repository by running: $ git pull https://github.com/iyerr3/incubator-madlib bugfix/jenkins_xml_report

[GitHub] incubator-madlib issue #124: Bugfix/jenkins xml report

2017-04-20 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/124 Note: Only two files have changed with this commit. For some reason github/master has not updated to upstream (apache). --- If your project is set up for it, you can reply to this email

[GitHub] incubator-madlib pull request #117: Decision Tree: Update defaults for max_d...

2017-04-18 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/117 Decision Tree: Update defaults for max_depth, num_splits Reduce the defaults for max_depth to 7 and num_splits to 20 to ensure we don't run out of memory when initializing tree

[GitHub] incubator-madlib issue #124: Bugfix/jenkins xml report

2017-04-21 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/124 retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] incubator-madlib pull request #107: Build: Avoid downloading mathjax during ...

2017-03-07 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/107 Build: Avoid downloading mathjax during make doc You can merge this pull request into a Git repository by running: $ git pull https://github.com/iyerr3/incubator-madlib infra

[GitHub] incubator-madlib issue #115: Task: Skip install-check for pmml

2017-04-14 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/115 jenkins, ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] incubator-madlib issue #115: Task: Skip install-check for pmml

2017-04-14 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/115 +1 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] incubator-madlib issue #117: Decision Tree: Update defaults for max_depth, n...

2017-04-18 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/117 Reading the commit description again, I would rephrase it as "Reduce the defaults for max_depth to 7 and num_splits to 20 to **minimize the chances of running out of m

[GitHub] incubator-madlib pull request #116: Unnest 2d array

2017-04-18 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/116#discussion_r112040453 --- Diff: methods/array_ops/src/pg_gp/array_ops.sql_in --- @@ -636,3 +663,30 @@ CREATE OR REPLACE FUNCTION MADLIB_SCHEMA.array_cum_prod(x anyarray

[GitHub] incubator-madlib issue #117: Decision Tree: Update defaults for max_depth, n...

2017-04-18 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/117 No, that's a separate JIRA: MADLIB-1057 <https://issues.apache.org/jira/browse/MADLIB-1057>. This one is just about setting the defaults to a more reasonable value consi

[GitHub] incubator-madlib issue #119: Multiple: Minor changes for GPDB5 and HAWQ2.2 s...

2017-04-18 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/119 OK to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] incubator-madlib pull request #120: DT: Assign memory only for reachable nod...

2017-04-18 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/120 DT: Assign memory only for reachable nodes JIRA: MADLIB-1057 TreeAccumulator assigns a matrix to track the statistics of rows reaching the last layer of nodes. This matrix

[GitHub] incubator-madlib issue #117: Decision Tree: Update defaults for max_depth, n...

2017-04-18 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/117 The docs/latest corresponds to latest release (1.10) and won't be updated till the next release. We also have [docs/master](http://madlib.incubator.apache.org/docs/master/), which can

[GitHub] incubator-madlib issue #120: DT: Assign memory only for reachable nodes

2017-04-18 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/120 @ivannovick Those are good questions. Short answer is it's problem (data) dependent - the memory reduction depends on how sparse the tree is. I can run some experiments

[GitHub] incubator-madlib pull request #156: DT: Add option to treat NULL as category

2017-07-28 Thread iyerr3
GitHub user iyerr3 opened a pull request: https://github.com/apache/incubator-madlib/pull/156 DT: Add option to treat NULL as category This commit adds an option to treat NULL as a level in the categorical feature. The level is added as a string (instead of a NULL value

[GitHub] incubator-madlib pull request #156: DT: Add option to treat NULL as category

2017-08-02 Thread iyerr3
Github user iyerr3 commented on a diff in the pull request: https://github.com/apache/incubator-madlib/pull/156#discussion_r131048326 --- Diff: src/ports/postgres/modules/recursive_partitioning/decision_tree.py_in --- @@ -825,22 +855,34 @@ def _get_bins(schema_madlib

[GitHub] incubator-madlib issue #157: Multiple: Check optimizer_control before updati...

2017-08-01 Thread iyerr3
Github user iyerr3 commented on the issue: https://github.com/apache/incubator-madlib/pull/157 Added more details in the 2nd commit message which will be used in the final merge. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

  1   2   >