GitHub user BryanCutler opened a pull request:

    https://github.com/apache/spark/pull/14308

    [SPARK-16260][EXAMPLES][ML]  Improve ML Example Outputs

    ## What changes were proposed in this pull request?
    Improve example outputs to better reflect the functionality that is being 
presented.  This mostly consisted of modifying what was printed at the end of 
the example, such as calling show() with truncate=False, but sometimes required 
minor tweaks in the example data to get relevant output.  Explicitly set 
parameters when they are used as part of the example.  Fixed Java examples that 
failed to run because of using old-style MLlib Vectors or problem with schema.  
Synced examples between different APIs.
    
    ## How was this patch tested?
    Ran each example for Scala, Python, and Java and made sure output was 
legible on a terminal of width 100.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/BryanCutler/spark 
ml-examples-improve-output-SPARK-16260

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/14308.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #14308
    
----
commit 7b4496b16517b01c01abd6aebe84b53876265b82
Author: Bryan Cutler <[email protected]>
Date:   2016-07-19T00:22:12Z

    finished going through about a third of examples

commit 6e4ed29e704e4805fff312b561f8e41919e014eb
Author: Bryan Cutler <[email protected]>
Date:   2016-07-20T00:23:25Z

    Fixed more examples, about half done now

commit 26718e9da96de142e4bb3078ffdacbf94e4c3d47
Author: Bryan Cutler <[email protected]>
Date:   2016-07-20T18:19:45Z

    more progress up to NaiveBayes example

commit ff066ce1ad3391c707cd21b4802c5843a70a2da9
Author: Bryan Cutler <[email protected]>
Date:   2016-07-21T00:26:59Z

    further progress up to PCA example

commit 53a29411c5969d1bc25ace3817cc927213fcb0b7
Author: Bryan Cutler <[email protected]>
Date:   2016-07-21T04:28:12Z

    continued throught examples up to Tf Idf

commit 38c319945e854939f86b8e3f67ebcb04d0be532f
Author: Bryan Cutler <[email protected]>
Date:   2016-07-21T20:22:31Z

    finished remaining ml examples

commit a8093bec8fc4090711e6d7b56001a288db03235d
Author: Bryan Cutler <[email protected]>
Date:   2016-07-21T20:37:22Z

    fixed style checks

commit afe2b2ad3069363de62a6f25cd1e4ac706b9e6b8
Author: Bryan Cutler <[email protected]>
Date:   2016-07-21T20:57:36Z

    fixed Java import ordering

commit b7384cef97f89730f4f400873c8369775bbe994e
Author: Bryan Cutler <[email protected]>
Date:   2016-07-21T21:09:41Z

    minor cleanup

commit ae2249a3396f6585c504986234d664dd23f9c401
Author: Bryan Cutler <[email protected]>
Date:   2016-07-21T21:33:35Z

    made accurracy reporting consistent

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to