Hi Misgana, Please find the answers inline.
1. Do I need only to work with carbon-ml repo or should the whole kernel be > installed? Don't need to build the kernal. Building carbon-ml [1] and then product-ml [2] would be enough. The Build from source Documentation site varies from what i have setup(uses > svn, downloads whole kernel). Should i follow this? Just download the source-code ([1] and [2]), and execute a "maven clean install" from the source directory. As I mentioned earlier, no need to download or build the Carbon Kernal. Could you suggest on how to setup my dev environment? Currently I > installed Spark, converted my project to a maven project. But maven seems > not to properly compile. No need to install spark. Spark is only used as an external library (jars). If you are using Eclipse/IntelliJ IDEA, import the source-code as a maven project. IDE will automatically resolve the dependencies. I implemented Gradientboosted to core/spark/Algorithms. What's the purpose of implementing Gradientboosted? Next step would be to modify *SupervisedSparkModelBuilder.* I think it would be easier for you to first (after finished with the above steps) write a simple standalone java client, which reads a simple dataset (a csv file) and build the ensemble model with Spark. Then you can integrate that logic to the SupervisedSparkModelBuilder, and eventually to the model-building workflow of WSO2 ML. [1] https://github.com/wso2/carbon-ml [2] https://github.com/wso2/product-ml [3] https://docs.wso2.com/display/ML110/Building+from+Source Regards, Supun On Thu, Apr 28, 2016 at 5:22 PM, misgana <[email protected]> wrote: > Hi Supun, > > My current workflow looks like this: > 1. Fork and clone carbon-ml repo form github -- DONE > 2. Setup Dev environment -- IN PROGRESS > 3. Integrate GradientBoosted tree algorithm to carbon-ml -- IN PROGRESS > > Issues: > 1. Do I need only to work with carbon-ml repo or should the whole kernel > be installed? > 2. The Build from source Documentation site varies from what i have > setup(uses svn, downloads whole kernel). Should i follow this? > 3.Could you suggest on how to setup my dev environment? Currently I > installed Spark, converted my project to a maven project. But maven seems > not to properly compile. > 4. I implemented Gradientboosted to core/spark/Algorithms. Next step would > be to modify > *SupervisedSparkModelBuilder. 5. *Here I would check how this would be > integrated in the whole framework and test on Iris dataset.(On this I need > to do some reading) > > I would very appreciate your guidance on this plan/work in progress. > > Best, > Misgana > > > > On 26.04.2016 09:26, Misgana Negassi wrote: > > Hi Supun, > I have forked carbon-ml to my repo[1] and currently I am familiarizing > myself with the code and software architecture. I will make commits after > trying out with a new algorithm. > > [1] https://github.com/zemoel/carbon-ml > > On 26.04.2016 06:47, Supun Sethunga wrote: > > Hi Misgana, > > As you progress, please keep us posted too. It would be nice if you > can share your code as well (Github project). You can take a fork of repo > [1], and start working on your fork. > > [1] https://github.com/wso2/carbon-ml > > On Mon, Apr 25, 2016 at 7:57 PM, Misgana Negassi < > <[email protected]>[email protected]> wrote: > >> Hi Supun, >> >> Thank you for accepting me for this project!I am excited to work on it >> and start right away with the links you sent. >> >> Best, >> Misgana >> >> >> >> On 25.04.2016 12:06, Supun Sethunga wrote: >> >> Hi Misgana, >> >> Congratulations for getting accepted for the gsoc 2016! Hope you are >> ready to get started with the project. >> >> To get more familiarized with the code, I'm sharing the implementations >> of the current algorithms [1]. For your ensemble method, you need to add >> three more cases (for the three types of ensembles) for the method [2]. You >> may try out adding a new algorithm to he existing flow, and see how it >> works. Please feel free to raise any questions/issues you come across. >> >> [1] >> <https://github.com/wso2/carbon-ml/tree/master/components/ml/org.wso2.carbon.ml.core/src/main/java/org/wso2/carbon/ml/core/spark/algorithms> >> https://github.com/wso2/carbon-ml/tree/master/components/ml/org.wso2.carbon.ml.core/src/main/java/org/wso2/carbon/ml/core/spark/algorithms >> [2] >> <https://github.com/wso2/carbon-ml/blob/master/components/ml/org.wso2.carbon.ml.core/src/main/java/org/wso2/carbon/ml/core/spark/algorithms/SupervisedSparkModelBuilder.java#L101> >> https://github.com/wso2/carbon-ml/blob/master/components/ml/org.wso2.carbon.ml.core/src/main/java/org/wso2/carbon/ml/core/spark/algorithms/SupervisedSparkModelBuilder.java#L101 >> >> Regards, >> Supun >> >> On Thu, Mar 24, 2016 at 9:31 PM, Misgana Negassi < >> <[email protected]>[email protected]> wrote: >> >>> Hi Supun, >>> >>> Thank you for your support and advice in this proposal process! >>> >>> In the case you are interested, I am attaching my report paper with >>> contains my work with ensemble methods particularly Stacking. >>> >>> Best, >>> Misgana >>> >>> >>> On 24.03.2016 04:12, Supun Sethunga wrote: >>> >>> Looks good! Please go ahead and submit to GSoC. >>> >>> Thanks, >>> Supun >>> >>> On Thu, Mar 24, 2016 at 4:02 AM, Misgana Negassi < >>> <[email protected]>[email protected]> wrote: >>> >>>> Hi Supun, >>>> >>>> I have added the changes you recommended. Could you kindly give me a >>>> feedback? >>>> >>>> Best, >>>> Misgana >>>> >>>> On 23.03.2016 15:04, Supun Sethunga wrote: >>>> >>>> Hi Misgana, >>>> >>>> I went through your proposal. Overall it looks good. Here are a few >>>> comments I would like to point out: >>>> >>>> - Its better to have some sort of an architecture diagram, >>>> explaining your solution in a higher level. >>>> - In the timeline, better to break down the "Week 13 (May 23 >>>> June 20, 2016)" into three sub-levels, and allocate timeslots for each >>>> of >>>> the three methods (Stacking, Boosting and Bagging) separately. That >>>> would >>>> make it easy for you to work on those methods separately, as well as to >>>> track the progress. >>>> - In the timeline, can you double check the "week" numbers..? for >>>> eg; in [*Week 1-3 (May 23 June 20, 2016*], I guess it should be >>>> "*Week >>>> 1-4*" (there are four weeks in the mentioned duration). Similarly, >>>> check the others too. >>>> >>>> Please share us the draft proposal once you fix those. >>>> >>>> Thanks, >>>> Supun >>>> >>>> On Wed, Mar 23, 2016 at 7:17 PM, Misgana Negassi < >>>> <[email protected]>[email protected]> wrote: >>>> >>>>> Hi Supun, >>>>> >>>>> I am attaching my proposal draft. I am very grateful for your comments. >>>>> >>>>> Thanks, >>>>> Misgana >>>>> >>>>> >>>>> On 23.03.2016 04:54, Supun Sethunga wrote: >>>>> >>>>> Hi Misgana, >>>>> >>>>> As we have mentioned in the project proposal as well, the main >>>>> objective is to integrate ensemble support for the existing flow of the >>>>> WSO2 Machine Learner. We are focusing on the three methods: Bagging, >>>>> Boosting and Stacking. (On technique per each of these methods) >>>>> >>>>> If you haven't tried out already, you can get to know the Machine >>>>> Learner product by downloading it and running it (Please use link [1] to >>>>> download). Official documentation [2] and blog [3] will help you on how to >>>>> use the product. You can also go through the source code of WSO2 ML >>>>> ([4] and [5]), and get familiarized with the current implementations. >>>>> >>>>> Meantime, as Nirmal mentioned, can you please send us the draft of the >>>>> proposal so that we can review it and give you a feedback? >>>>> >>>>> [1] <http://wso2.com/products/machine-learner/> >>>>> http://wso2.com/products/machine-learner/ >>>>> [2] <https://docs.wso2.com/display/ML100/Introducing+Machine+Learner> >>>>> https://docs.wso2.com/display/ML100/Introducing+Machine+Learner >>>>> [3] >>>>> <http://supunsetunga.blogspot.com/2015/09/building-your-first-predictive-model.html> >>>>> http://supunsetunga.blogspot.com/2015/09/building-your-first-predictive-model.html >>>>> [4] <https://github.com/wso2/carbon-ml> >>>>> https://github.com/wso2/carbon-ml >>>>> [5] <https://github.com/wso2/product-ml> >>>>> https://github.com/wso2/product-ml >>>>> >>>>> Thanks, >>>>> Supun >>>>> >>>>> On Wed, Mar 23, 2016 at 7:20 AM, Nirmal Fernando < <[email protected]> >>>>> [email protected]> wrote: >>>>> >>>>>> Thanks, Misgana for your interest in a WSO2 ML GSoC project. Whilst I >>>>>> let Supun give you some more information on the project, I encourage you >>>>>> to >>>>>> create a draft proposal and send us for review. >>>>>> >>>>>> On Wed, Mar 23, 2016 at 2:58 AM, Misgana Negassi < >>>>>> <[email protected]>[email protected]> wrote: >>>>>> >>>>>>> Hallo! >>>>>>> >>>>>>> I am Misgana, hailing from Freiburg, Germany and I am interested in >>>>>>> working with you on the Ensemble methods . I have already implemented >>>>>>> Stacking in python(code available in github/zemoel) and compared it to >>>>>>> other ensemble methods such as Ensemble Selection on AUC performance >>>>>>> measures. The comparison also included using above mentioned methods as >>>>>>> part of an automated machine learning platform(Autosklearn). >>>>>>> >>>>>>> I am currently working on my proposal and would be grateful for your >>>>>>> reply. >>>>>>> >>>>>>> Misgana >>>>>>> >>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> >>>>>> Thanks & regards, >>>>>> Nirmal >>>>>> >>>>>> Team Lead - WSO2 Machine Learner >>>>>> Associate Technical Lead - Data Technologies Team, WSO2 Inc. >>>>>> Mobile: +94715779733 >>>>>> Blog: <http://nirmalfdo.blogspot.com/>http://nirmalfdo.blogspot.com/ >>>>>> >>>>>> >>>>>> >>>>> >>>>> >>>>> -- >>>>> *Supun Sethunga* >>>>> Software Engineer >>>>> WSO2, Inc. >>>>> <http://wso2.com/>http://wso2.com/ >>>>> lean | enterprise | middleware >>>>> Mobile : +94 716546324 <%2B94%20716546324> >>>>> >>>>> >>>>> >>>> >>>> >>>> -- >>>> *Supun Sethunga* >>>> Software Engineer >>>> WSO2, Inc. >>>> <http://wso2.com/>http://wso2.com/ >>>> lean | enterprise | middleware >>>> Mobile : +94 716546324 <%2B94%20716546324> >>>> >>>> >>>> >>> >>> >>> -- >>> *Supun Sethunga* >>> Software Engineer >>> WSO2, Inc. >>> <http://wso2.com/>http://wso2.com/ >>> lean | enterprise | middleware >>> Mobile : +94 716546324 <%2B94%20716546324> >>> >>> >>> >> >> >> -- >> *Supun Sethunga* >> Software Engineer >> WSO2, Inc. >> <http://wso2.com/>http://wso2.com/ >> lean | enterprise | middleware >> Mobile : +94 716546324 <%2B94%20716546324> >> >> >> > > > -- > *Supun Sethunga* > Software Engineer > WSO2, Inc. > <http://wso2.com/>http://wso2.com/ > lean | enterprise | middleware > Mobile : +94 716546324 > > > > -- *Supun Sethunga* Software Engineer WSO2, Inc. http://wso2.com/ lean | enterprise | middleware Mobile : +94 716546324
_______________________________________________ Dev mailing list [email protected] http://wso2.org/cgi-bin/mailman/listinfo/dev
