Github user viirya commented on a diff in the pull request:
https://github.com/apache/spark/pull/20285#discussion_r162224903
--- Diff: docs/ml-features.md ---
@@ -1283,6 +1283,56 @@ for more details on the API.
</div>
</div>
+## VectorSizeHint
+
+It can sometimes be useful to explicitly specify the size of the vectors a
column of
+`VectorType`. For example, `VectorAssembler` uses size information from
its input columns to
+produce size information and metadata for its output column. While in some
cases this information
+can be obtained by inspecting the contents of the column, in a streaming
dataframe the contents are
+not available until the stream is started. `VectorSizeHint` allows a user
to explicitly specify the
--- End diff --
nit: a user -> an user
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]