How do you want to proceed on this, if it's expedient I was going to clone 
mahout into my own fork and do this work as before.

Sent from my iPad

> On May 5, 2014, at 9:36 PM, "Dmitriy Lyubimov" <[email protected]> wrote:
> 
> Ok lets go one by one. Can you try and put mutate into scala dsl skeleton
> of in-core dataframe that does nothing?
>> On May 5, 2014 8:22 PM, "Saikat Kanjilal (JIRA)" <[email protected]> wrote:
>> 
>> 
>>    [
>> https://issues.apache.org/jira/browse/MAHOUT-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13990236#comment-13990236]
>> 
>> Saikat Kanjilal commented on MAHOUT-1490:
>> -----------------------------------------
>> 
>> d.mutate( let("gain") equal { col("ArrDelay") - col("DepDelay") } ) seems
>> rather verbose
>> 
>> how about something like the following:
>> 
>> d.mutate((v= gain) equal { col("ArrDelay") - col("DepDelay") } )
>> 
>> in the above expression v= would essentially create a defined identifier
>> called v
>> 
>> I will put some more examples in the blog around select with this thinking
>> 
>> Also what about the other functions I am proposing like running a function
>> around a dataframe (like map) or the slicing functionality around R, should
>> we keep those as part of this proposal?
>> 
>>> Data frame R-like bindings
>>> --------------------------
>>> 
>>>                Key: MAHOUT-1490
>>>                URL: https://issues.apache.org/jira/browse/MAHOUT-1490
>>>            Project: Mahout
>>>         Issue Type: New Feature
>>>           Reporter: Saikat Kanjilal
>>>           Assignee: Dmitriy Lyubimov
>>>            Fix For: 1.0
>>> 
>>>  Original Estimate: 20h
>>> Remaining Estimate: 20h
>>> 
>>> Create Data frame R-like bindings for spark
>> 
>> 
>> 
>> --
>> This message was sent by Atlassian JIRA
>> (v6.2#6252)
>> 

Reply via email to