Hi Misgana,

Please find the answers inline.

1. Do I need only to work with carbon-ml repo or should the whole kernel be
> installed?

Don't need to build the kernal. Building carbon-ml [1] and then product-ml
[2] would be enough.

The Build from source Documentation site varies from what i have setup(uses
> svn, downloads whole kernel). Should i follow this?

Just download the source-code ([1] and [2]), and execute a "maven clean
install" from the source directory. As I mentioned earlier, no need to
download or build the Carbon Kernal.

Could  you suggest on how to setup my dev environment? Currently I
> installed Spark, converted my project to a maven project. But maven seems
> not to properly compile.

No need to install spark. Spark is only used as an external library (jars).
If you are using Eclipse/IntelliJ IDEA, import the source-code as a maven
project. IDE will automatically resolve the dependencies.

 I implemented Gradientboosted to core/spark/Algorithms.

What's the purpose of implementing Gradientboosted?

Next step would be to modify *SupervisedSparkModelBuilder.*

I think it would be easier for you to first (after finished with the above
steps) write a simple standalone java client, which reads a simple dataset
(a csv file) and build the ensemble model with Spark. Then you can
integrate that logic to the SupervisedSparkModelBuilder, and eventually to
the model-building workflow of WSO2 ML.

[1] https://github.com/wso2/carbon-ml
[2] https://github.com/wso2/product-ml
[3] https://docs.wso2.com/display/ML110/Building+from+Source

Regards,
Supun


On Thu, Apr 28, 2016 at 5:22 PM, misgana <[email protected]> wrote:

> Hi Supun,
>
> My current workflow looks like this:
> 1. Fork and clone carbon-ml repo form github  -- DONE
> 2. Setup Dev environment -- IN PROGRESS
> 3. Integrate GradientBoosted tree algorithm to carbon-ml  -- IN PROGRESS
>
> Issues:
> 1. Do I need only to work with carbon-ml repo or should the whole kernel
> be installed?
> 2. The Build from source Documentation site varies from what i have
> setup(uses svn, downloads whole kernel). Should i follow this?
> 3.Could  you suggest on how to setup my dev environment? Currently I
> installed Spark, converted my project to a maven project. But maven seems
> not to properly compile.
> 4. I implemented Gradientboosted to core/spark/Algorithms. Next step would
> be to modify
> *SupervisedSparkModelBuilder.  5. *Here I would check how this would be
> integrated in the whole framework and test on Iris dataset.(On this I need
> to do some reading)
>
> I would very appreciate your guidance on this plan/work in progress.
>
> Best,
> Misgana
>
>
>
> On 26.04.2016 09:26, Misgana Negassi wrote:
>
> Hi Supun,
> I have forked carbon-ml to my repo[1] and currently I am familiarizing
> myself with the code and software architecture. I will make commits after
> trying out with a new algorithm.
>
> [1] https://github.com/zemoel/carbon-ml
>
> On 26.04.2016 06:47, Supun Sethunga wrote:
>
> Hi Misgana,
>
> As you progress, please keep us posted too. It would be nice if you
> can share your code as well (Github project). You can take a fork of repo
> [1], and start working on your fork.
>
> [1] https://github.com/wso2/carbon-ml
>
> On Mon, Apr 25, 2016 at 7:57 PM, Misgana Negassi <
> <[email protected]>[email protected]> wrote:
>
>> Hi Supun,
>>
>> Thank you for accepting me for this project!I am excited to work on it
>> and start right away with the links you sent.
>>
>> Best,
>> Misgana
>>
>>
>>
>> On 25.04.2016 12:06, Supun Sethunga wrote:
>>
>> Hi Misgana,
>>
>> Congratulations for getting accepted for the gsoc 2016! Hope you are
>> ready to get started with the project.
>>
>> To get more familiarized with the code, I'm sharing the implementations
>> of the current algorithms [1]. For your ensemble method, you need to add
>> three more cases (for the three types of ensembles) for the method [2]. You
>> may try out adding a new algorithm to he existing flow, and see how it
>> works. Please feel free to raise any questions/issues you come across.
>>
>> [1]
>> <https://github.com/wso2/carbon-ml/tree/master/components/ml/org.wso2.carbon.ml.core/src/main/java/org/wso2/carbon/ml/core/spark/algorithms>
>> https://github.com/wso2/carbon-ml/tree/master/components/ml/org.wso2.carbon.ml.core/src/main/java/org/wso2/carbon/ml/core/spark/algorithms
>> [2]
>> <https://github.com/wso2/carbon-ml/blob/master/components/ml/org.wso2.carbon.ml.core/src/main/java/org/wso2/carbon/ml/core/spark/algorithms/SupervisedSparkModelBuilder.java#L101>
>> https://github.com/wso2/carbon-ml/blob/master/components/ml/org.wso2.carbon.ml.core/src/main/java/org/wso2/carbon/ml/core/spark/algorithms/SupervisedSparkModelBuilder.java#L101
>>
>> Regards,
>> Supun
>>
>> On Thu, Mar 24, 2016 at 9:31 PM, Misgana Negassi <
>> <[email protected]>[email protected]> wrote:
>>
>>> Hi Supun,
>>>
>>> Thank you for your support and advice in this proposal process!
>>>
>>> In the case you are interested, I am attaching my report paper with
>>> contains my work with ensemble methods particularly Stacking.
>>>
>>> Best,
>>> Misgana
>>>
>>>
>>> On 24.03.2016 04:12, Supun Sethunga wrote:
>>>
>>> Looks good! Please go ahead and submit to GSoC.
>>>
>>> Thanks,
>>> Supun
>>>
>>> On Thu, Mar 24, 2016 at 4:02 AM, Misgana Negassi <
>>> <[email protected]>[email protected]> wrote:
>>>
>>>> Hi Supun,
>>>>
>>>> I have added the changes you recommended. Could you kindly give me a
>>>> feedback?
>>>>
>>>> Best,
>>>> Misgana
>>>>
>>>> On 23.03.2016 15:04, Supun Sethunga wrote:
>>>>
>>>> Hi Misgana,
>>>>
>>>> I went through your proposal. Overall it looks good. Here are a few
>>>> comments I would like to point out:
>>>>
>>>>    - Its better to have some sort of an architecture diagram,
>>>>    explaining your solution in a higher level.
>>>>    - In the timeline, better to break down the "Week 1­3 (May 23 ­
>>>>    June 20, 2016)" into three sub-levels, and allocate timeslots for each 
>>>> of
>>>>    the three methods (Stacking, Boosting and Bagging) separately. That 
>>>> would
>>>>    make it easy for you to work on those methods separately, as well as to
>>>>    track the progress.
>>>>    - In the timeline, can you double check the "week" numbers..? for
>>>>    eg; in [*Week 1­-3 (May 23 ­ June 20, 2016*], I guess it should be 
>>>> "*Week
>>>>    1-4*" (there are four weeks in the mentioned duration). Similarly,
>>>>    check the others too.
>>>>
>>>> Please share us the draft proposal once you fix those.
>>>>
>>>> Thanks,
>>>> Supun
>>>>
>>>> On Wed, Mar 23, 2016 at 7:17 PM, Misgana Negassi <
>>>> <[email protected]>[email protected]> wrote:
>>>>
>>>>> Hi Supun,
>>>>>
>>>>> I am attaching my proposal draft. I am very grateful for your comments.
>>>>>
>>>>> Thanks,
>>>>> Misgana
>>>>>
>>>>>
>>>>> On 23.03.2016 04:54, Supun Sethunga wrote:
>>>>>
>>>>> Hi Misgana,
>>>>>
>>>>> As we have mentioned in the project proposal as well, the main
>>>>> objective is to integrate ensemble support for the existing flow of the
>>>>> WSO2 Machine Learner. We are focusing on the three methods: Bagging,
>>>>> Boosting and Stacking. (On technique per each of these methods)
>>>>>
>>>>> If you haven't tried out already, you can get to know the Machine
>>>>> Learner product by downloading it and running it (Please use link [1] to
>>>>> download). Official documentation [2] and blog [3] will help you on how to
>>>>> use the product. You can also go through the source code of WSO2 ML
>>>>> ([4] and [5]), and get familiarized with the current implementations.
>>>>>
>>>>> Meantime, as Nirmal mentioned, can you please send us the draft of the
>>>>> proposal so that we can review it and give you a feedback?
>>>>>
>>>>> [1]  <http://wso2.com/products/machine-learner/>
>>>>> http://wso2.com/products/machine-learner/
>>>>> [2]  <https://docs.wso2.com/display/ML100/Introducing+Machine+Learner>
>>>>> https://docs.wso2.com/display/ML100/Introducing+Machine+Learner
>>>>> [3]
>>>>> <http://supunsetunga.blogspot.com/2015/09/building-your-first-predictive-model.html>
>>>>> http://supunsetunga.blogspot.com/2015/09/building-your-first-predictive-model.html
>>>>> [4]  <https://github.com/wso2/carbon-ml>
>>>>> https://github.com/wso2/carbon-ml
>>>>> [5]  <https://github.com/wso2/product-ml>
>>>>> https://github.com/wso2/product-ml
>>>>>
>>>>> Thanks,
>>>>> Supun
>>>>>
>>>>> On Wed, Mar 23, 2016 at 7:20 AM, Nirmal Fernando < <[email protected]>
>>>>> [email protected]> wrote:
>>>>>
>>>>>> Thanks, Misgana for your interest in a WSO2 ML GSoC project. Whilst I
>>>>>> let Supun give you some more information on the project, I encourage you 
>>>>>> to
>>>>>> create a draft proposal and send us for review.
>>>>>>
>>>>>> On Wed, Mar 23, 2016 at 2:58 AM, Misgana Negassi <
>>>>>> <[email protected]>[email protected]> wrote:
>>>>>>
>>>>>>> Hallo!
>>>>>>>
>>>>>>> I am Misgana, hailing from Freiburg, Germany and I am interested in
>>>>>>> working with you on the Ensemble methods . I have already implemented
>>>>>>> Stacking in python(code available in github/zemoel) and compared it to
>>>>>>> other ensemble methods such as Ensemble Selection on AUC performance
>>>>>>> measures. The comparison also included using above mentioned methods as
>>>>>>> part of an automated machine learning platform(Autosklearn).
>>>>>>>
>>>>>>> I am currently working on my proposal and would be grateful for your
>>>>>>> reply.
>>>>>>>
>>>>>>> Misgana
>>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> --
>>>>>>
>>>>>> Thanks & regards,
>>>>>> Nirmal
>>>>>>
>>>>>> Team Lead - WSO2 Machine Learner
>>>>>> Associate Technical Lead - Data Technologies Team, WSO2 Inc.
>>>>>> Mobile: +94715779733
>>>>>> Blog: <http://nirmalfdo.blogspot.com/>http://nirmalfdo.blogspot.com/
>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> *Supun Sethunga*
>>>>> Software Engineer
>>>>> WSO2, Inc.
>>>>> <http://wso2.com/>http://wso2.com/
>>>>> lean | enterprise | middleware
>>>>> Mobile : +94 716546324 <%2B94%20716546324>
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> *Supun Sethunga*
>>>> Software Engineer
>>>> WSO2, Inc.
>>>> <http://wso2.com/>http://wso2.com/
>>>> lean | enterprise | middleware
>>>> Mobile : +94 716546324 <%2B94%20716546324>
>>>>
>>>>
>>>>
>>>
>>>
>>> --
>>> *Supun Sethunga*
>>> Software Engineer
>>> WSO2, Inc.
>>> <http://wso2.com/>http://wso2.com/
>>> lean | enterprise | middleware
>>> Mobile : +94 716546324 <%2B94%20716546324>
>>>
>>>
>>>
>>
>>
>> --
>> *Supun Sethunga*
>> Software Engineer
>> WSO2, Inc.
>> <http://wso2.com/>http://wso2.com/
>> lean | enterprise | middleware
>> Mobile : +94 716546324 <%2B94%20716546324>
>>
>>
>>
>
>
> --
> *Supun Sethunga*
> Software Engineer
> WSO2, Inc.
> <http://wso2.com/>http://wso2.com/
> lean | enterprise | middleware
> Mobile : +94 716546324
>
>
>
>


-- 
*Supun Sethunga*
Software Engineer
WSO2, Inc.
http://wso2.com/
lean | enterprise | middleware
Mobile : +94 716546324
_______________________________________________
Dev mailing list
[email protected]
http://wso2.org/cgi-bin/mailman/listinfo/dev

Reply via email to