Frank McQuillan created MADLIB-1066:
---------------------------------------

             Summary: CLONE - Pivoting - Phase 3
                 Key: MADLIB-1066
                 URL: https://issues.apache.org/jira/browse/MADLIB-1066
             Project: Apache MADlib
          Issue Type: Improvement
          Components: Module: Utilities
            Reporter: Frank McQuillan
             Fix For: v2.0


Follow on to these JIRAs
https://issues.apache.org/jira/browse/MADLIB-908
https://issues.apache.org/jira/browse/MADLIB-1004

this capability is to carry over some good ideas from
https://issues.apache.org/jira/browse/MADLIB-1038

Candidate improvements:
* output column naming options
* adding an ‘*’ option and list of features to exclude
* pivot more than 1600 column limit, i.e., most MADlib algos take array input 
so pivot should support array output
* Support non-STRICT functions in Greenplum and HAWQ; this was removed in 1.9.1 
since it is not handled correctly.  Does work OK for Postgres.
* others??? 

References

[1] Good data set
http://pbpython.com/pandas-pivot-table-explained.html



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to