Github user kaknikhil commented on a diff in the pull request:
https://github.com/apache/madlib/pull/334#discussion_r234051175
--- Diff:
src/ports/postgres/modules/utilities/minibatch_preprocessing.py_in ---
@@ -508,8 +514,13 @@ class MiniBatchDocumentation:
dependent_varname, -- TEXT. Name of the dependent variable
column
independent_varname, -- TEXT. Name of the independent
variable
column
- buffer_size -- INTEGER. Number of source input rows
to
- pack into batch
+ grouping_col -- TEXT. Default NULL. An expression
list used
+ to group the input dataset into
discrete groups
+ buffer_size -- INTEGER. Default computed
automatically.
+ Number of source input rows to pack
into batch
--- End diff --
/s/batch/buffer
---