Github user holdenk commented on a diff in the pull request:
https://github.com/apache/spark/pull/14830#discussion_r76582404
--- Diff: examples/src/main/python/ml/binarizer_example.py ---
@@ -17,9 +17,10 @@
from __future__ import print_function
-from pyspark.sql import SparkSession
# $example on$
from pyspark.ml.feature import Binarizer
+from pyspark.sql import SparkSession
--- End diff --
Some of the examples files are used in generating the website
documentation, and the "example on" and "example off" tags are used to
determine which parts get pulled in to the website (in this case this is done
since we don't want to have the same boiler plate imports for each example -
rather showing the ones specific to that). You can take a look at
`./docs/ml-features.md` which includes this file to see how its used in
markdown and the generated website documentation at
http://spark.apache.org/docs/latest/ml-features.html#binarizer .
The instructions for building the docs locally are located at
`./docs/README.md` - let me know if you need any help with that - the
documentation build is sometimes a bit overlooked since many of the developers
don't build it manually often.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]