[
https://issues.apache.org/jira/browse/SPARK-10595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-10595:
----------------------------------
Shepherd: Feynman Liang
> Various ML programming guide cleanups post 1.5
> ----------------------------------------------
>
> Key: SPARK-10595
> URL: https://issues.apache.org/jira/browse/SPARK-10595
> Project: Spark
> Issue Type: Documentation
> Components: Documentation, ML, MLlib
> Affects Versions: 1.5.0
> Reporter: Joseph K. Bradley
> Assignee: Joseph K. Bradley
> Priority: Minor
>
> Various ML guide cleanups.
> * ml-guide.md: Make it easier to access the algorithm-specific guides.
> * LDA user guide: EM often begins with useless topics, but running longer
> generally improves them dramatically. E.g., 10 iterations on a Wikipedia
> dataset produces useless topics, but 50 iterations produces very meaningful
> topics.
> * mllib-feature-extraction.html#elementwiseproduct: “w” parameter should be
> “scalingVec”
> * Clean up Binarizer user guide a little.
> * Document in Pipeline that users should not put an instance into the
> Pipeline in more than 1 place.
> * spark.ml Word2Vec user guide: clean up grammar/writing
> * Chi Sq Feature Selector docs: Improve text in doc.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]