[
https://issues.apache.org/jira/browse/MADLIB-1004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15389836#comment-15389836
]
Orhan Kislal edited comment on MADLIB-1004 at 7/22/16 8:49 PM:
---------------------------------------------------------------
I agree that a general discussion on the distribution policy will be beneficial
since it is not always clear which column will make a good candidate for
distribution. I'll hold off the commit I was planning for now but it should be
noted that if there are multiple index columns, just picking the first one may
skew the data significantly.
was (Author: okislal):
I agree that a general discussion on the distribution policy will be beneficial
since it is not always clear which column will make a good candidate for
distribution. I'll hold off the commit I was planning for now but it should be
noted that if there are multiple index columns, just picking the first one may
skew the data considerably.
> Pivoting - Phase 2 (advanced pivot)
> -----------------------------------
>
> Key: MADLIB-1004
> URL: https://issues.apache.org/jira/browse/MADLIB-1004
> Project: Apache MADlib
> Issue Type: New Feature
> Components: Module: Utilities
> Reporter: Frank McQuillan
> Fix For: v1.9.1
>
>
> Story
> As a data scientist, I want to perform *advanced* pivot operation on my data,
> so that I can prepare it for input to predictive analytics algorithms.
> Details
> * Advanced pivot for this story means the features defined as MVP in the
> requirements attached.
> In general, we are following Pandas ideas.
> References
> [1] Pivot table general information, like what is pivoting?
> https://en.wikipedia.org/wiki/Pivot_table
> [2] Pandas pivot tables and cross-tabulations
> http://pandas.pydata.org/pandas-docs/stable/reshaping.html#pivot-tables-and-cross-tabulations
> http://pandas.pydata.org/pandas-docs/stable/cookbook.html#cookbook-pivot
> http://pbpython.com/pandas-pivot-table-explained.html
> [3] GPDB pivot_sum function
> http://gpdb.docs.pivotal.io/4320/admin_guide/query.html#topic30
> [4] PostgreSQL tablefunc
> http://www.postgresql.org/docs/9.4/static/tablefunc.html
> [5] PDL tools pivoting routines
> http://pdl-tools.pa.pivotal.io/group__grp__pivot.html
> http://pdl-tools.pa.pivotal.io/group__grp__pivot01.html
> [6] Aster Pivot and Unpivot functions
> User Guide
> http://www.info.teradata.com/eDownload.cfm?itemid=122580002
> [7] PostgreSQL aggregates
> http://www.postgresql.org/docs/8.2/static/functions-aggregate.html
> [8] PostgreSQL basic statements/assignment operator,
> http://www.postgresql.org/docs/8.2/static/plpgsql-statements.html
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)