Currently, you have to edit code the add new algorithms. We will make it configurable eventually, but at this stage we do not need a framework IMHO.
--Srinath On Thu, Jul 24, 2014 at 7:46 PM, Rajith Siriwardena <[email protected]> wrote: > > Okay got it! :) > > @supun : If you don't mind can you share the architecture diagram. I'm bit > interested in how it's done. how can we introduce new algorithms to the > component? > > Thanks, > Rajith > > > On Thu, Jul 24, 2014 at 10:33 AM, Supun Sethunga <[email protected]> wrote: > >> Hi Rajith, >> >> The service is a stand-alone carbon component, and not a UDAF. Hive will >> only be used to retrieve and store the data which will eventually used by >> the service. >> >> Regards, >> Supun >> >> >> On Thu, Jul 24, 2014 at 9:28 AM, Rajith Siriwardena <[email protected]> >> wrote: >> >>> Hi Supun, >>> >>> Are you using UDAFs for the purpose? Can explain a bit how it's done? >>> >>> >>> On Wed, Jul 23, 2014 at 5:30 PM, Supun Sethunga <[email protected]> wrote: >>> >>>> Hi, >>>> >>>> The back-end service for the $subject has been implemented. For the >>>> time being, it supports three algorithms (Naive Bayes, Logistic Regression >>>> and Multilayer Perceptrons). The service accepts a JSON object having the >>>> input data file path, output path where the created the model should be >>>> saved, the algorithm to be used for modeling (one of the above three >>>> algos), and the hyper-parameters and their values. >>>> >>>> Next goal would be to develop a carbon UI component, which allows a >>>> user to call the above service, in a form of a wizard. The wizard would >>>> provide a step-by-step guide to the user to create a ML model for any set >>>> of data. >>>> >>>> Appreciate any suggestions. >>>> >>>> Regards, >>>> Supun >>>> >>>> >>>> On Fri, Jul 4, 2014 at 7:39 AM, Srinath Perera <[email protected]> >>>> wrote: >>>> >>>>> Anjana and myself discussed. >>>>> >>>>> UDFs does not work. UDFs take an event as the input while mahout takes >>>>> a complete dataset. >>>>> >>>>> May be we can use UDAF (A = Aggregate), but has to explore more. >>>>> >>>>> >>>>> On Tue, Jul 1, 2014 at 10:03 AM, Anjana Fernando <[email protected]> >>>>> wrote: >>>>> >>>>>> I see, sure, I was thinking of doing all the operations, including >>>>>> the training operations using an UDF. Will come and meet you. >>>>>> >>>>>> Cheers, >>>>>> Anjana. >>>>>> >>>>>> >>>>>> On Tue, Jul 1, 2014 at 9:56 AM, Srinath Perera <[email protected]> >>>>>> wrote: >>>>>> >>>>>>> No, we need to get the data, preprocess them using hive, and send >>>>>>> all the data (not 1-2 values, rather say 10 millions values) to training >>>>>>> phase. Lets chat f2f a bit. >>>>>>> >>>>>>> --Srinath >>>>>>> >>>>>>> >>>>>>> On Tue, Jul 1, 2014 at 6:24 AM, Anjana Fernando <[email protected]> >>>>>>> wrote: >>>>>>> >>>>>>>> Hi, >>>>>>>> >>>>>>>> I was simply thinking, the UDF could directly mapped to some basic >>>>>>>> Mahout operation it implements, and the input/output should be given as >>>>>>>> parameters to the UDF, so probably, we can publish some input data >>>>>>>> beforehand to Cassandra etc.. and give the location of that data to the >>>>>>>> UDF, and the UDF will, as it is called, create the map/reduce jobs and >>>>>>>> execute. >>>>>>>> >>>>>>>> Cheers, >>>>>>>> Anjana. >>>>>>>> >>>>>>>> >>>>>>>> On Tue, Jul 1, 2014 at 9:18 AM, Srinath Perera <[email protected]> >>>>>>>> wrote: >>>>>>>> >>>>>>>>> +1 we wanted to explore that more. >>>>>>>>> >>>>>>>>> However, It is not a simple UDF as this is a stateful op where we >>>>>>>>> feed lot of data and start a separate map reduce process. Anjana, do >>>>>>>>> you >>>>>>>>> have any thoughts on how it can be done? >>>>>>>>> >>>>>>>>> >>>>>>>>> On Tue, Jul 1, 2014 at 5:37 AM, Anjana Fernando <[email protected]> >>>>>>>>> wrote: >>>>>>>>> >>>>>>>>>> Hi, >>>>>>>>>> >>>>>>>>>> I'm just wondering if we have any way to integrate this to Hive >>>>>>>>>> itself (UDF?), to get results of an ML algorithm run, to a result >>>>>>>>>> there. A >>>>>>>>>> similar scenario is possible in Shark/MLlib integration. >>>>>>>>>> >>>>>>>>>> Cheers, >>>>>>>>>> Anjana. >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> On Mon, Jun 30, 2014 at 12:28 PM, Supun Sethunga <[email protected] >>>>>>>>>> > wrote: >>>>>>>>>> >>>>>>>>>>> Hi, >>>>>>>>>>> >>>>>>>>>>> Im working on the $subject, and the objective is to apply >>>>>>>>>>> Machine Learning algorithms on the data stored by WSO2 BAM. Apache >>>>>>>>>>> Mahout >>>>>>>>>>> will be used as the ML tool, for this purpose. >>>>>>>>>>> >>>>>>>>>>> As per the discussion I had With Srinath, the procedure for >>>>>>>>>>> $subject would be: >>>>>>>>>>> >>>>>>>>>>> - Test a Machine Learning algorithm using Mahout libraries >>>>>>>>>>> within Java. >>>>>>>>>>> - Implement a RESTful service which provides the above >>>>>>>>>>> functionality. >>>>>>>>>>> - Since Mahout also uses Hadoop, the above service can send >>>>>>>>>>> Map Reduce Jobs to the Hadoop built inside the BAM. >>>>>>>>>>> - Deploy the service as a Carbon Component on WSO2 BAM. >>>>>>>>>>> >>>>>>>>>>> The first step is completed for now. >>>>>>>>>>> Any feedback is highly appreciated. >>>>>>>>>>> >>>>>>>>>>> Thanks, >>>>>>>>>>> Supun >>>>>>>>>>> >>>>>>>>>>> -- >>>>>>>>>>> *Supun Sethunga* >>>>>>>>>>> Software Engineer >>>>>>>>>>> WSO2, Inc. >>>>>>>>>>> lean | enterprise | middleware >>>>>>>>>>> Mobile : +94 716546324 >>>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> -- >>>>>>>>>> *Anjana Fernando* >>>>>>>>>> Senior Technical Lead >>>>>>>>>> WSO2 Inc. | http://wso2.com >>>>>>>>>> lean . enterprise . middleware >>>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> -- >>>>>>>>> ============================ >>>>>>>>> Director, Research, WSO2 Inc. >>>>>>>>> Visiting Faculty, University of Moratuwa >>>>>>>>> Member, Apache Software Foundation >>>>>>>>> Research Scientist, Lanka Software Foundation >>>>>>>>> Blog: http://srinathsview.blogspot.com twitter:@srinath_perera >>>>>>>>> Site: http://people.apache.org/~hemapani/ >>>>>>>>> Photos: http://www.flickr.com/photos/hemapani/ >>>>>>>>> Phone: 0772360902 >>>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> *Anjana Fernando* >>>>>>>> Senior Technical Lead >>>>>>>> WSO2 Inc. | http://wso2.com >>>>>>>> lean . enterprise . middleware >>>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> ============================ >>>>>>> Director, Research, WSO2 Inc. >>>>>>> Visiting Faculty, University of Moratuwa >>>>>>> Member, Apache Software Foundation >>>>>>> Research Scientist, Lanka Software Foundation >>>>>>> Blog: http://srinathsview.blogspot.com twitter:@srinath_perera >>>>>>> Site: http://people.apache.org/~hemapani/ >>>>>>> Photos: http://www.flickr.com/photos/hemapani/ >>>>>>> Phone: 0772360902 >>>>>>> >>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> *Anjana Fernando* >>>>>> Senior Technical Lead >>>>>> WSO2 Inc. | http://wso2.com >>>>>> lean . enterprise . middleware >>>>>> >>>>> >>>>> >>>>> >>>>> -- >>>>> ============================ >>>>> Director, Research, WSO2 Inc. >>>>> Visiting Faculty, University of Moratuwa >>>>> Member, Apache Software Foundation >>>>> Research Scientist, Lanka Software Foundation >>>>> Blog: http://srinathsview.blogspot.com twitter:@srinath_perera >>>>> Site: http://people.apache.org/~hemapani/ >>>>> Photos: http://www.flickr.com/photos/hemapani/ >>>>> Phone: 0772360902 >>>>> >>>> >>>> >>>> >>>> -- >>>> *Supun Sethunga* >>>> Software Engineer >>>> WSO2, Inc. >>>> lean | enterprise | middleware >>>> Mobile : +94 716546324 >>>> >>>> _______________________________________________ >>>> Architecture mailing list >>>> [email protected] >>>> https://mail.wso2.org/cgi-bin/mailman/listinfo/architecture >>>> >>>> >>> >>> >>> -- >>> *Rajith Siriwardana* >>> Software Engineer >>> WSO2 Inc. ; http://wso2.com >>> *lean. enterprise. middleware* >>> >>> ------------------------------------------------------ >>> *http://people.apache.org/~siriwardana >>> <http://people.apache.org/~siriwardana>* >>> >> >> >> >> -- >> *Supun Sethunga* >> Software Engineer >> WSO2, Inc. >> lean | enterprise | middleware >> Mobile : +94 716546324 >> > > > > -- > *Rajith Siriwardana* > Software Engineer > WSO2 Inc. ; http://wso2.com > *lean. enterprise. middleware* > > ------------------------------------------------------ > *http://people.apache.org/~siriwardana > <http://people.apache.org/~siriwardana>* > > _______________________________________________ > Architecture mailing list > [email protected] > https://mail.wso2.org/cgi-bin/mailman/listinfo/architecture > > -- ============================ Director, Research, WSO2 Inc. Visiting Faculty, University of Moratuwa Member, Apache Software Foundation Research Scientist, Lanka Software Foundation Blog: http://srinathsview.blogspot.com twitter:@srinath_perera Site: http://people.apache.org/~hemapani/ Photos: http://www.flickr.com/photos/hemapani/ Phone: 0772360902
_______________________________________________ Architecture mailing list [email protected] https://mail.wso2.org/cgi-bin/mailman/listinfo/architecture
