[
https://issues.apache.org/jira/browse/MAHOUT-1838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15264651#comment-15264651
]
ASF GitHub Bot commented on MAHOUT-1838:
----------------------------------------
GitHub user andrewpalumbo opened a pull request:
https://github.com/apache/mahout/pull/229
MAHOUT-1838: Simple 2d plotting of a sampled DRM
This is a bare-bones beginning to some mahout plots. Takes a `DRM` and
creates a scatter plot of a sample of the first two columns.
Tested in the shell and works when the sample size is hard coded. But am
currently having some problems with the simple calculation of the sample
percent- simple fix I'm sure..
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/andrewpalumbo/mahout visiualization
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/mahout/pull/229.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #229
----
commit bf0a461abf32178b476a107b9752facafcb3575b
Author: Andrew Palumbo <[email protected]>
Date: 2016-04-27T22:35:46Z
initial commit
commit 09418724f5c2fadbf288900dba6e4ec259847677
Author: Andrew Palumbo <[email protected]>
Date: 2016-04-28T04:19:28Z
crashing in shell
commit 80198df57b3ee690b04caa77a14136a0b89d2b37
Author: Andrew Palumbo <[email protected]>
Date: 2016-04-28T04:45:36Z
crashing in shell: error: bad symbolic reference. A signature in
mplot2d.class refers to term smile
commit 09859df4047bfa092980bc4f755fb3a865b10f39
Author: Andrew Palumbo <[email protected]>
Date: 2016-04-29T03:31:05Z
working plot
commit 51f6fd97c7cfcca846d37fabf866916a60d9a8db
Author: Andrew Palumbo <[email protected]>
Date: 2016-04-29T04:38:43Z
operational plotting
commit f8fc304529ce0036e9ffbff5367cc1e30542018a
Author: Andrew Palumbo <[email protected]>
Date: 2016-04-29T19:18:09Z
remove unnecessary deps. refactoring (spelling), licsense, etc
commit fa614825269be070f7202a4b921f1fea1c4a2b52
Author: Andrew Palumbo <[email protected]>
Date: 2016-04-29T20:01:02Z
add default kryo serilizer mb to the spark-shell. problem calculating
sample percent in mplot2d- getting 0 back
----
> Provide and plotting capabilities for Mahout mtrices and DRMs
> --------------------------------------------------------------
>
> Key: MAHOUT-1838
> URL: https://issues.apache.org/jira/browse/MAHOUT-1838
> Project: Mahout
> Issue Type: New Feature
> Components: visiualization
> Affects Versions: 0.12
> Reporter: Andrew Palumbo
> Assignee: Andrew Palumbo
> Fix For: 0.13
>
> Attachments: drmSamplePlot2d.png
>
>
> Add basic 2d and 3d plotting capabilities to Mahout using the Smile library:
> https://github.com/haifengl/smile
> I've prototyped an {{mplot2d}} class here to sample a certain percentage of a
> DRM's data using {{drmSampleKRows()}} and to create a 2d plot from the
> resulting x,y coordinate matrix, and tested from the spark-shell as a POC.
> Extending to 3d should be trivial:
> The Smile-Plot library has some very nice plotting features, all of which can
> easily integrated into mahout:
> http://haifengl.github.io/smile/index.html#gallery
> http://haifengl.github.io/smile/index.html#visualization
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)