[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-10-07 Thread Sohum Sachdev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16195617#comment-16195617 ] Sohum Sachdev commented on SPARK-19428: --- [~lminer] This is a very interesting point you brought up.

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-05 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15853272#comment-15853272 ] Herman van Hovell commented on SPARK-19428: --- You could also use a window function: {noformat}

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15853214#comment-15853214 ] Sean Owen commented on SPARK-19428: --- Yeah, I think it would require a custom aggregator after all as I

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-04 Thread Luke Miner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15852946#comment-15852946 ] Luke Miner commented on SPARK-19428: I did not know of the existence of the {first}} function for

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-04 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15852929#comment-15852929 ] koert kuipers commented on SPARK-19428: --- generalizing to return top-x by some sorting is

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-04 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15852895#comment-15852895 ] Hyukjin Kwon commented on SPARK-19428: -- Oh, the other comments show up later in my browser. Please

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-04 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15852894#comment-15852894 ] Hyukjin Kwon commented on SPARK-19428: -- (FWIW, I think he meant... {code} >>> df =

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-04 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15852893#comment-15852893 ] Hyukjin Kwon commented on SPARK-19428: -- [~lminer], another workaround might be (with {{from

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15852892#comment-15852892 ] Sean Owen commented on SPARK-19428: --- Let's see, if as above you could get the nth most recent timestamp

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-04 Thread Luke Miner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15852890#comment-15852890 ] Luke Miner commented on SPARK-19428: How could you do it that way? Normally the cutoff varies by

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15852885#comment-15852885 ] Sean Owen commented on SPARK-19428: --- You can usually do this in two steps by find the cutoff that

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-04 Thread Luke Miner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15852884#comment-15852884 ] Luke Miner commented on SPARK-19428: That would be fantastic. Would it be possible to generalize it

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-04 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15852882#comment-15852882 ] koert kuipers commented on SPARK-19428: --- getting a first element for each group (which is somewhat

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-04 Thread Luke Miner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15852867#comment-15852867 ] Luke Miner commented on SPARK-19428: Unfortunately no, because that would just get me a row for a

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-03 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15852582#comment-15852582 ] Takeshi Yamamuro commented on SPARK-19428: -- Thanks for the explanation!

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-03 Thread Luke Miner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15851784#comment-15851784 ] Luke Miner commented on SPARK-19428: Couple of things. Sometimes I just want a random row from each

[jira] [Commented] (SPARK-19428) Ability to select first row of groupby

2017-02-02 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15851176#comment-15851176 ] Takeshi Yamamuro commented on SPARK-19428: -- What's this operation used for? > Ability to select