Github user BenFradet commented on a diff in the pull request:
https://github.com/apache/spark/pull/10006#discussion_r46471157
--- Diff: docs/ml-features.md ---
@@ -1591,6 +1591,65 @@ transformer.transform(df).show()
</div>
+## SQLTransformer
+
+`SQLTransformer` implements the transforms which are defined by SQL
statement.
+Currently we only support SQL syntax like `"SELECT ... FROM __THIS__ ..."`
+where `"__THIS__"` represents the underlying table of the input dataset.
+The select clause specifies the fields, constants, and expressions to
display in
+the output, it can be any select clause that Spark SQL supported. Users
can also
+use Spark SQL build-in function and UDFs to operate on these selected
columns.
+For example, `SQLTransformer` support statements like:
--- End diff --
supports
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]