Hi All; I've submitted my proposal for STANBOL-1294. I hope that I have a chance to contribute STANBOL with GSoC.
Thanks; Furkan KAMACI 2014-03-20 15:57 GMT+02:00 Furkan KAMACI <furkankam...@gmail.com>: > Hi Rafa; > > Thanks for the explanations and ideas. I will start to write a proposal > for it. On the other hand could I learn that is there any mentor who is > volunteer for it :) > > Thanks; > Furkan KAMACI > > > 2014-03-20 15:47 GMT+02:00 Rafa Haro <rh...@apache.org>: > > Hi Furkan, >> >> El 20/03/14 14:10, Furkan KAMACI escribió: >> >> Hi; >>> >>> If anybody can suggest something about to make this issue more clear it >>> will be nice. >>> >>> Thanks; >>> Furkan KAMACI >>> >> Welcome to the Stanbol community :-). As you can check at STANBOL-1294, >> this issue is related to further improvements of the current Topic >> Classification engine in Stanbol. Although there are some clear points of >> improvement (mainly current missing features at STANBOL-197), it is still a >> high level idea that would be nice to discuss in detail here. Some of the >> possible expected new features would be the following: >> >> 1. Different implementations for managing the TrainingSet. In the current >> approach, the training set has to be stored in Solr and the users have to >> configure which fields will be used for training and which fields will be >> used as categories. It would be nice to have an abstract API for managing a >> TrainingSet in stanbol independent of the final backend which actually >> could be Solr or any other storage system. >> >> 2. Different implementations of the Classifier. Current classifier API is >> also completely coupled with the current implementation, therefore it >> should be refactored for allowing different implementations based on, for >> instance, different frameworks like OpenNLP and Apache Mahout >> >> 3. Change current TopicClassification engine for working with the new >> APIs. >> >> 4. Also, as Rupert pointed in another email, evaluation support would be >> also great. >> >> These are, of course, initial ideas, but we are looking forward to hear >> more suggestions. >> >> Cheers, >> Rafa >> >> >>> >>> 2014-03-20 14:51 GMT+02:00 Furkan KAMACI <furkankam...@gmail.com>: >>> >>> Hi; >>>> >>>> I'm attending a Master program at Computer Engineering for Machine >>>> Learning and NLP on Big Data at one of the top universities of Turkey. I >>>> am a Senior Java Developer and a team lead of a big project which uses >>>> Solr. On the other hand I am one of the most active people at Solr mail >>>> list and one of mail list moderators. >>>> >>>> I want to work for STANBOL-1294 Topic Classification Framework for >>>> Stanbol if I can catch up the deadline. >>>> >>>> Thanks; >>>> Furkan KAMACI >>>> >>>> >>>> >>>> >>>> >> >