GitHub user dwmclary opened a pull request:
https://github.com/apache/spark/pull/6919
Spark 7998 freq item api
Here's "a better frequent item API" which provides a DataFrame with each
ArrayBuffer expanded into a column. There's surely some improvement that could
be done here, but I think this is in the spirit of what the JIRA was asking for.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/dwmclary/spark SPARK-7998-freq-item-api
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/6919.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #6919
----
commit 55fc39c96c746a0d39f8789e0afe6ab09a3920e3
Author: dwmclary <[email protected]>
Date: 2015-06-19T14:38:49Z
adding freqItemsFrame
commit 1bbc32356e3def5da5ed6997d1dc00d1f488276b
Author: dwmclary <[email protected]>
Date: 2015-06-19T14:39:47Z
Merge branch 'master' of https://github.com/apache/spark into
SPARK-7998-freq-item-api
commit 4a173b800a703a9dfa37f4bada92db6d62813e09
Author: dwmclary <[email protected]>
Date: 2015-06-19T21:56:28Z
finished freqItemsFrame
commit 8dec609eb590f534040b9320fd22e8d5a139cc81
Author: dwmclary <[email protected]>
Date: 2015-06-19T21:58:03Z
Merge branch 'master' of https://github.com/apache/spark into
SPARK-7998-freq-item-api
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]