[
https://issues.apache.org/jira/browse/MADLIB-1004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15365078#comment-15365078
]
ASF GitHub Bot commented on MADLIB-1004:
----------------------------------------
GitHub user orhankislal opened a pull request:
https://github.com/apache/incubator-madlib/pull/54
Pivoting: Phase 2
JIRA: MADLIB-1004
This phase of the pivoting development adds the following functionality
- Multiple indices
- Multiple pivot columns
- Multiple value columns
- Multiple aggregate functions
- Value column specific aggregate functions
- Keep null values in pivot columns
- Fill null values in the output table
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/orhankislal/incubator-madlib
feature/pivoting_take5
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-madlib/pull/54.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #54
----
commit aa7bce4387487c7d96f3fcccde1e95250996fbf7
Author: Orhan Kislal <[email protected]>
Date: 2016-07-06T21:00:29Z
Pivoting: Phase 2
JIRA: MADLIB-1004
This phase of the pivoting development adds the following functionality
- Multiple indices
- Multiple pivot columns
- Multiple value columns
- Multiple aggregate functions
- Value column specific aggregate functions
- Keep null values in pivot columns
- Fill null values in the output table
----
> Pivoting - Phase 2 (advanced pivot)
> -----------------------------------
>
> Key: MADLIB-1004
> URL: https://issues.apache.org/jira/browse/MADLIB-1004
> Project: Apache MADlib
> Issue Type: New Feature
> Components: Module: Utilities
> Reporter: Frank McQuillan
> Fix For: v1.9.1
>
>
> Story
> As a data scientist, I want to perform *advanced* pivot operation on my data,
> so that I can prepare it for input to predictive analytics algorithms.
> Details
> * Advanced pivot for this story means the features defined as MVP in the
> requirements attached.
> In general, we are following Pandas ideas.
> References
> [1] Pivot table general information, like what is pivoting?
> https://en.wikipedia.org/wiki/Pivot_table
> [2] Pandas pivot tables and cross-tabulations
> http://pandas.pydata.org/pandas-docs/stable/reshaping.html#pivot-tables-and-cross-tabulations
> http://pandas.pydata.org/pandas-docs/stable/cookbook.html#cookbook-pivot
> http://pbpython.com/pandas-pivot-table-explained.html
> [3] GPDB pivot_sum function
> http://gpdb.docs.pivotal.io/4320/admin_guide/query.html#topic30
> [4] PostgreSQL tablefunc
> http://www.postgresql.org/docs/9.4/static/tablefunc.html
> [5] PDL tools pivoting routines
> http://pdl-tools.pa.pivotal.io/group__grp__pivot.html
> http://pdl-tools.pa.pivotal.io/group__grp__pivot01.html
> [6] Aster Pivot and Unpivot functions
> User Guide
> http://www.info.teradata.com/eDownload.cfm?itemid=122580002
> [7] PostgreSQL aggregates
> http://www.postgresql.org/docs/8.2/static/functions-aggregate.html
> [8] PostgreSQL basic statements/assignment operator,
> http://www.postgresql.org/docs/8.2/static/plpgsql-statements.html
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)