any way you like. Fork will make things easier, but since we are talking about a fairly isolated experiment at this point, it is not required.
On Mon, May 5, 2014 at 9:40 PM, Saikat Kanjilal <[email protected]> wrote: > How do you want to proceed on this, if it's expedient I was going to clone > mahout into my own fork and do this work as before. > > Sent from my iPad > > > On May 5, 2014, at 9:36 PM, "Dmitriy Lyubimov" <[email protected]> > wrote: > > > > Ok lets go one by one. Can you try and put mutate into scala dsl skeleton > > of in-core dataframe that does nothing? > >> On May 5, 2014 8:22 PM, "Saikat Kanjilal (JIRA)" <[email protected]> > wrote: > >> > >> > >> [ > >> > https://issues.apache.org/jira/browse/MAHOUT-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13990236#comment-13990236 > ] > >> > >> Saikat Kanjilal commented on MAHOUT-1490: > >> ----------------------------------------- > >> > >> d.mutate( let("gain") equal { col("ArrDelay") - col("DepDelay") } ) > seems > >> rather verbose > >> > >> how about something like the following: > >> > >> d.mutate((v= gain) equal { col("ArrDelay") - col("DepDelay") } ) > >> > >> in the above expression v= would essentially create a defined identifier > >> called v > >> > >> I will put some more examples in the blog around select with this > thinking > >> > >> Also what about the other functions I am proposing like running a > function > >> around a dataframe (like map) or the slicing functionality around R, > should > >> we keep those as part of this proposal? > >> > >>> Data frame R-like bindings > >>> -------------------------- > >>> > >>> Key: MAHOUT-1490 > >>> URL: https://issues.apache.org/jira/browse/MAHOUT-1490 > >>> Project: Mahout > >>> Issue Type: New Feature > >>> Reporter: Saikat Kanjilal > >>> Assignee: Dmitriy Lyubimov > >>> Fix For: 1.0 > >>> > >>> Original Estimate: 20h > >>> Remaining Estimate: 20h > >>> > >>> Create Data frame R-like bindings for spark > >> > >> > >> > >> -- > >> This message was sent by Atlassian JIRA > >> (v6.2#6252) > >> >
