GitHub user iyerr3 opened a pull request:
https://github.com/apache/incubator-madlib/pull/138
Summary: Add param to determine num of cols per run
JIRA: MADLIB-1117
Summary used a hard-coded parameter of a maximum of 15 columns per run.
This was put in place to avoid out-of-memory errors in most cases.
This, however, limits the run time since higher number of columns can be
summarized in a single run for a simpler data set (one which leads to
smaller sketch data structures).
This commit adds a new parameter allowing users to set this limit,
while retaining the old default of 15 columns.
Closes #138
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/iyerr3/incubator-madlib
feature/summary_add_parameter
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-madlib/pull/138.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #138
----
commit 1cca783b63111d004662f314cef67e9be8bb9a92
Author: Rahul Iyer <[email protected]>
Date: 2017-06-05T23:36:50Z
Summary: Add param to determine num of cols per run
JIRA: MADLIB-1117
Summary used a hard-coded parameter of a maximum of 15 columns per run.
This was put in place to avoid out-of-memory errors in most cases.
This, however, limits the run time since higher number of columns can be
summarized in a single run for a simpler data set (one which leads to
smaller sketch data structures).
This commit adds a new parameter allowing users to set this limit,
while retaining the old default of 15 columns.
Closes #138
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---