GitHub user BryanCutler opened a pull request:
https://github.com/apache/spark/pull/14308
[SPARK-16260][EXAMPLES][ML] Improve ML Example Outputs
## What changes were proposed in this pull request?
Improve example outputs to better reflect the functionality that is being
presented. This mostly consisted of modifying what was printed at the end of
the example, such as calling show() with truncate=False, but sometimes required
minor tweaks in the example data to get relevant output. Explicitly set
parameters when they are used as part of the example. Fixed Java examples that
failed to run because of using old-style MLlib Vectors or problem with schema.
Synced examples between different APIs.
## How was this patch tested?
Ran each example for Scala, Python, and Java and made sure output was
legible on a terminal of width 100.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/BryanCutler/spark
ml-examples-improve-output-SPARK-16260
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/14308.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #14308
----
commit 7b4496b16517b01c01abd6aebe84b53876265b82
Author: Bryan Cutler <[email protected]>
Date: 2016-07-19T00:22:12Z
finished going through about a third of examples
commit 6e4ed29e704e4805fff312b561f8e41919e014eb
Author: Bryan Cutler <[email protected]>
Date: 2016-07-20T00:23:25Z
Fixed more examples, about half done now
commit 26718e9da96de142e4bb3078ffdacbf94e4c3d47
Author: Bryan Cutler <[email protected]>
Date: 2016-07-20T18:19:45Z
more progress up to NaiveBayes example
commit ff066ce1ad3391c707cd21b4802c5843a70a2da9
Author: Bryan Cutler <[email protected]>
Date: 2016-07-21T00:26:59Z
further progress up to PCA example
commit 53a29411c5969d1bc25ace3817cc927213fcb0b7
Author: Bryan Cutler <[email protected]>
Date: 2016-07-21T04:28:12Z
continued throught examples up to Tf Idf
commit 38c319945e854939f86b8e3f67ebcb04d0be532f
Author: Bryan Cutler <[email protected]>
Date: 2016-07-21T20:22:31Z
finished remaining ml examples
commit a8093bec8fc4090711e6d7b56001a288db03235d
Author: Bryan Cutler <[email protected]>
Date: 2016-07-21T20:37:22Z
fixed style checks
commit afe2b2ad3069363de62a6f25cd1e4ac706b9e6b8
Author: Bryan Cutler <[email protected]>
Date: 2016-07-21T20:57:36Z
fixed Java import ordering
commit b7384cef97f89730f4f400873c8369775bbe994e
Author: Bryan Cutler <[email protected]>
Date: 2016-07-21T21:09:41Z
minor cleanup
commit ae2249a3396f6585c504986234d664dd23f9c401
Author: Bryan Cutler <[email protected]>
Date: 2016-07-21T21:33:35Z
made accurracy reporting consistent
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]