Frank McQuillan created MADLIB-1066:
---------------------------------------
Summary: CLONE - Pivoting - Phase 3
Key: MADLIB-1066
URL: https://issues.apache.org/jira/browse/MADLIB-1066
Project: Apache MADlib
Issue Type: Improvement
Components: Module: Utilities
Reporter: Frank McQuillan
Fix For: v2.0
Follow on to these JIRAs
https://issues.apache.org/jira/browse/MADLIB-908
https://issues.apache.org/jira/browse/MADLIB-1004
this capability is to carry over some good ideas from
https://issues.apache.org/jira/browse/MADLIB-1038
Candidate improvements:
* output column naming options
* adding an ‘*’ option and list of features to exclude
* pivot more than 1600 column limit, i.e., most MADlib algos take array input
so pivot should support array output
* Support non-STRICT functions in Greenplum and HAWQ; this was removed in 1.9.1
since it is not handled correctly. Does work OK for Postgres.
* others???
References
[1] Good data set
http://pbpython.com/pandas-pivot-table-explained.html
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)