Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/3099#issuecomment-62693414
I tried to hide APIs as much as I can while maintaining the code at a level
where user can actually try creating, configuring, and tuning a pipeline. All
major classes are marked as "AlphaComponent". The schema transformation layer
is hidden. `Identifiable` and `UnaryTransformer` are marked `private[ml]`. As a
result, I copied `Tokenizer` to `ml.feature`. Attached is a list of public
classes. @mateiz
~~~
org.apache.spark.ml
Estimator
Evaluator
Model
Pipeline
PipelineModel
PipelineStage
Transformer
org.apache.spark.ml.classification
LogisticRegression
LogisticRegressionModel
org.apache.spark.ml.evaluation
BinaryClassificationEvaluator
org.apache.spark.ml.feature
HashingTF
StandardScaler
StandardScalerModel
Tokenizer
org.apache.spark.ml.param
BooleanParam
DoubleParam
FloatParam
IntParam
LongParam
Param
ParamMap
ParamPair
Params
org.apache.spark.ml.tuning
CrossValidator
CrossValidatorModel
ParamGridBuilder
~~~
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]