Github user kaknikhil commented on a diff in the pull request:
https://github.com/apache/madlib/pull/334#discussion_r234051252
--- Diff:
src/ports/postgres/modules/utilities/minibatch_preprocessing.py_in ---
@@ -487,10 +487,16 @@ class MiniBatchDocumentation:
----------------------------------------------------------------
SUMMARY
----------------------------------------------------------------
- MiniBatch Preprocessor is a utility function to pre process the
input
- data for use with models that support mini-batching as an
optimization
+ The mini-batch preprocessor is a utility that prepares input data
for
+ use by models that support mini-batch as an optimization option.
(This
+ is currently only the case for Neural Networks.) It is effectively
a
+ packing operation that builds arrays of dependent and independent
+ variables from the source data table.
- #TODO add more here
+ The advantage of using mini-batching is that it can perform better
than
+ stochastic gradient descent (default MADlib optimizer) because it
uses
+ more than one training example at a time, typically resulting
faster
--- End diff --
missing the word in `resulting in faster .....`
---