[
https://issues.apache.org/jira/browse/MADLIB-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15971914#comment-15971914
]
ASF GitHub Bot commented on MADLIB-1086:
----------------------------------------
GitHub user rashmi815 opened a pull request:
https://github.com/apache/incubator-madlib/pull/116
Unnest 2d array
Array Operations: Add function to unnest 2-D arrays into rows of 1-D arrays
JIRA: MADLIB-1086
Function to unnest 2-D array by one level (i.e. into rows of 1-D arrays).
This is needed, for instance, in K-means, so that we can get one centroid
per row for follow on operations.
- Added function to array operations
- Added an example in k-means to demonstrate usage
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/rashmi815/incubator-madlib unnest_2d_array
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-madlib/pull/116.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #116
----
commit 18e562813702d12d620594598f471161a990fbbd
Author: Rashmi Raghu <[email protected]>
Date: 2017-04-15T00:08:17Z
Unnest function, install-check tests completed. Initial docs included
commit 2a4baffa29c8f976d3260931c1790cfc125e91f4
Author: Rashmi Raghu <[email protected]>
Date: 2017-04-15T06:20:01Z
Refactored names of function output columns
commit a3eae964adc84382fa674e4d95c486f472b14099
Author: Rashmi Raghu <[email protected]>
Date: 2017-04-17T23:45:32Z
Updated docs (array_ops and k-means) and minor update to install-check tests
----
> Unnest 2-D array by one level (i.e. into rows of 1-D arrays)
> ------------------------------------------------------------
>
> Key: MADLIB-1086
> URL: https://issues.apache.org/jira/browse/MADLIB-1086
> Project: Apache MADlib
> Issue Type: New Feature
> Components: Module: Utilities
> Reporter: Frank McQuillan
> Assignee: Rashmi Raghu
> Priority: Minor
> Fix For: v1.11
>
>
> Context
> Currently k-means returns the following
> {code}
> centroids |
> {{13.7533333333333,1.905,2.425,16.0666666666667,90.3333333333333,2.805,2.98,0.29,2.005,5.40663333333333,1.04166666666667,
> 3.31833333333333,1020.83333333333},
>
> {14.255,1.9325,2.5025,16.05,110.5,3.055,2.9775,0.2975,1.845,6.2125,0.9975,3.365,1378.75}}
> cluster_variance | {122999.110416013,30561.74805}
> objective_fn | 153560.858466013
> frac_reassigned | 0
> num_iterations | 3
> {code}
> Story
> As a data scientist, I want to unnest 2-D array by one level (i.e. into rows
> of 1-D arrays) in K-means, so that I can get one centroid per row for follow
> on operations.
> Acceptance
> 1) Add function to array operations
> http://madlib.incubator.apache.org/docs/latest/group__grp__array.html
> 2) Add an example in k-means
> http://madlib.incubator.apache.org/docs/latest/group__grp__kmeans.html
> to demonstrate usage
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)