How do you want to proceed on this, if it's expedient I was going to clone mahout into my own fork and do this work as before.
Sent from my iPad > On May 5, 2014, at 9:36 PM, "Dmitriy Lyubimov" <[email protected]> wrote: > > Ok lets go one by one. Can you try and put mutate into scala dsl skeleton > of in-core dataframe that does nothing? >> On May 5, 2014 8:22 PM, "Saikat Kanjilal (JIRA)" <[email protected]> wrote: >> >> >> [ >> https://issues.apache.org/jira/browse/MAHOUT-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13990236#comment-13990236] >> >> Saikat Kanjilal commented on MAHOUT-1490: >> ----------------------------------------- >> >> d.mutate( let("gain") equal { col("ArrDelay") - col("DepDelay") } ) seems >> rather verbose >> >> how about something like the following: >> >> d.mutate((v= gain) equal { col("ArrDelay") - col("DepDelay") } ) >> >> in the above expression v= would essentially create a defined identifier >> called v >> >> I will put some more examples in the blog around select with this thinking >> >> Also what about the other functions I am proposing like running a function >> around a dataframe (like map) or the slicing functionality around R, should >> we keep those as part of this proposal? >> >>> Data frame R-like bindings >>> -------------------------- >>> >>> Key: MAHOUT-1490 >>> URL: https://issues.apache.org/jira/browse/MAHOUT-1490 >>> Project: Mahout >>> Issue Type: New Feature >>> Reporter: Saikat Kanjilal >>> Assignee: Dmitriy Lyubimov >>> Fix For: 1.0 >>> >>> Original Estimate: 20h >>> Remaining Estimate: 20h >>> >>> Create Data frame R-like bindings for spark >> >> >> >> -- >> This message was sent by Atlassian JIRA >> (v6.2#6252) >>
