Thanks will do my best to move this along. The next biggest effort will be to get the blog to contain use cases before any more coding can continue.
> Date: Thu, 19 Jun 2014 14:26:08 -0700 > Subject: Re: Mahout-1490 and blog > From: [email protected] > To: [email protected] > > I will help, sure. But there's no (and cannot be) a single committer > responsible for reviewing changes of this magnitude. You have started a > fairly ambitious undertaking when you filed this issue. If this issue is > tackled elegantly, this would be the single biggest thing that happened to > Mahout in a long time, and will immediately give this project an edge by > allowing scripting out a quick and customizable end2end application. So > please give it your best! :) > > > On Thu, Jun 19, 2014 at 2:11 PM, Saikat Kanjilal <[email protected]> > wrote: > > > I'll work on these suggestions, Dmitry I noticed you assigned 1490 to > > Grant, I was wondering if you are still going to have time to help our > > reviewing work on this item.Thanks > > > > > From: [email protected] > > > Subject: Re: Mahout-1490 and blog > > > Date: Tue, 17 Jun 2014 18:55:18 -0700 > > > To: [email protected] > > > > > > I agree with Dmitriy. > > > > > > I note also the formatting on a desktop browser could be improved(e.g., > > fixed-font code blocks), and that on my phone it looks "off." > > > > > > If we want to send people to this we will want to tidy things up I think. > > > > > > > > > On Jun 17, 2014, at 6:24 PM, Dmitriy Lyubimov <[email protected]> wrote: > > > > > > > > I would not bring overly complicated examples. > > > > Show us how we would add a new computed column, group, aggregate, type > > > > conversion manipulations, etc. > > > > > > > > Anything that helps us to understand what it is we can do in shortest > > > > format possible. Current text IMO doesn't do that. > > > > > > > > > > > > On Tue, Jun 17, 2014 at 6:21 PM, Saikat Kanjilal <[email protected]> > > > > wrote: > > > > > > > >> I will refocus the blog on concrete use cases tied into the workflow > > of > > > >> using a Dataframe within mahout oriented perhaps towards a set of > > > >> algorithms focused on clustering, does that sound like a reasonable > > first > > > >> step? > > > >> > > > >> Sent from my iPhone > > > >> > > > >>>> On Jun 17, 2014, at 6:15 PM, "Dmitriy Lyubimov" <[email protected]> > > > >>> wrote: > > > >>> > > > >>> This blog IMO desperately needs conrete use examples, not apis. > > > >>> I would also remove R-specific examples as inconsequential. > > > >>> > > > >>> > > > >>> On Tue, Jun 17, 2014 at 6:05 PM, Saikat Kanjilal < > > [email protected]> > > > >>> wrote: > > > >>> > > > >>>> Hi Folks,I'm currently adding some bits to an earlier commit I had > > done > > > >>>> for Mahout-1490 to add a thin set of APIs around a dataframe, I was > > > >> going > > > >>>> to refine the blog specified here ( > > > >> > > http://mlefforts.blogspot.com/2014/04/introduction-this-proposal-will.html > > > >> ) > > > >>>> to include these APIs but before I put any more effort and time into > > > >> this > > > >>>> as well as into the code I wanted to ask if there's actual interest > > in > > > >> the > > > >>>> developer/committer community to support this and move it into > > mahout. > > > >>>> Please let me know thoughts and feedback and a general direction so > > > >> that I > > > >>>> can either continue or redirect these efforts per the current need. > > > >>>> Thanks in advance. > > > >> > > > >
