GitHub user yinxusen opened a pull request:
https://github.com/apache/spark/pull/5731
[SPARK-5895][ML] add vector slicer
JIRA issue [here](https://issues.apache.org/jira/browse/SPARK-5895).
I have some thoughts of `AttributeGroup`:
1. End-user is hard to add `AttributeGroup` or `Attribute` with `DataFrame`;
2. The constructor of `AttributeGroup` cannot covariance with subclass of
`Attribute`, say, it requires an `Array[Attribute]`, but I cannot give it an
`Array[NumericAttribute]`.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/yinxusen/spark SPARK-5895
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/5731.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #5731
----
commit aa5a0bfb4d5f69ad8c62be7e366957eef06a87af
Author: Xusen Yin <[email protected]>
Date: 2015-04-24T15:46:37Z
add vector slicer
commit 9ab97478d8974541d2b326eacb34cf0092349a67
Author: Xusen Yin <[email protected]>
Date: 2015-04-26T17:06:37Z
add vector slicer
commit 17171f8ffa3424265d90f956e836f2d3f5ad934f
Author: Xusen Yin <[email protected]>
Date: 2015-04-27T23:36:17Z
fix slicer
commit fd154d78188d5de87e12fec8e4fb454055378385
Author: Xusen Yin <[email protected]>
Date: 2015-04-28T02:45:48Z
add test suite of vector slicer
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]