GitHub user kaknikhil opened a pull request:
https://github.com/apache/madlib/pull/254
Enable grouping for minibatch preprocessing
This PR enables grouping for the minibatch preprocessor module.
Other changes
1. Added install check test for special chars.
2. Improved error messages and created a reusable function for
testing column dimension in install check.
3. Added an optional flag to `utils_ind_var_scales_grouping` so as to
create a persistent x_mean table that will be reused as the
standardization table by the preprocessor module.
4. Added unittests for `input_tbl_valid` and `output_tbl_valid` in
validate_args.py_in
5. Raise custom exception for mocked plpy error.
Co-authored-by: Jingyi Mei <[email protected]>
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/madlib/madlib
feature/minibatch-preprocessing-grouping
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/madlib/pull/254.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #254
----
commit 32eb0d5fd55a502eef654a6309bc15c1fbf548d0
Author: Nikhil Kak <nkak@...>
Date: 2018-03-23T18:29:07Z
MiniBatch Pre-Processor: Add support for grouping
This commit enables grouping for the minibatch preprocessor module.
Other changes
1. Added install check test for special chars.
2. Improved error messages and created a reusable function for
testing column dimension in install check.
3. Add a new optional flag to utils_ind_var_scales_grouping so as to
create a persistent x_mean table that will be reused as the
standardization table by the preprocessor module.
Co-authored-by: Jingyi Mei <[email protected]>
commit e5be55d5ce5e23f04955f3b69ae23175b5d0d500
Author: Nikhil Kak <nkak@...>
Date: 2018-03-30T03:09:53Z
Add unit test file for validate args
This commit adds a new unittest file for the validate_args python file.
The only two functions tested right now are input_tbl_valid and
output_tbl_valid.
commit a4d8b69624a19d7a184a7878b1f043cf87618c4d
Author: Nikhil Kak <nkak@...>
Date: 2018-03-30T18:29:22Z
UnitTests: Raise custom exception for mocked plpy error.
Before this commit, all the unit tests that wanted to assert that
plpy.error was called had to assert that an Exception was raised. This
was too generic and did not distinguish between an exception coming from
the plpy mock class vs any other exception.
With this commit, we now raise a custom plpy exception so that we don't
need to assert for the equality of the error messages. Asserting for the
exception is proof enough that plpy.error was called.
----
---