It's really exciting to see freebase integration. Congratulations!
2013/5/28 Dileepa Jayakody <[email protected]> > Congrats Antonio! :) > > > On Tue, May 28, 2013 at 12:23 PM, Fabian Christ < > [email protected]> wrote: > > > Hi, > > > > this is very cool :) Congrats to Antonio and Dileepa! > > > > Best, > > - Fabian > > > > 2013/5/28 Antonio Perez <[email protected]>: > > > Hi All, > > > > > > Thanks a lot to let me participate in this project and to become part > of > > > this community. > > > I'll work hard in order to finish the project (I hope so) with your > help > > > and (why not) to become > > > in a new contributor to Stanbol :) > > > > > > My proposal abstract : Freebase Entity Disambiguation Engine In Apache > > > Stanbol > > > > > > > > > In the Web of Data, the information is structured and connected, being > > more > > > feasible to drive the users through the correct resources because these > > > resources are properly described in a language that a machine can > > > understand. In order to make the Web of Data a reality, it is necessary > > to > > > structure all the unstructured information. The structuring or semantic > > > enrichment process used to involve the recognition of complex entities > > and > > > concepts. The extracted entities need to be linked with “real world” > > > knowledge bases entries in order to acquire their semantics. The task > of > > > associating entity mentions in texts with Knowledge Bases entries is > > > commonly known as Entity Linking. The most complex issue of Entity > > Linking > > > is entity disambiguation resolution. Entity Disambiguation tries to > > resolve > > > the synonymy and homonymy over names’ mentions, i.e., the fact that an > > > entity can have many different names and the fact that the same name > can > > > refer to more than one entity. > > > > > > The main goal of the present proposal is to develop a disambiguation > > engine > > > for the Open-Source project Apache Stanbol using Freebase as Knowledge > > > Base. This is not an easy task to integrate Freebase in Stanbol as > > > Knowledge Base. Apache Stanbol provides a set of reusable components > for > > > Semantic Content Management. One of such component is a Content > Enhancer, > > > which can be used to extract concepts and entities from texts and link > > them > > > with any Knowledge Base registered in Stanbol. Apache Stanbol already > > > manages others semantics databases like DBpedia or Geonames. The GSoC > > > project would contribute with all the developments necessary to fully > > > support Freebase in Stanbol including disambiguation engines for this > > > Knowledge Base. > > > Freebase is an open, Creative Commons licensed repository of structured > > > data of almost 23 million entities. An entity is a single person, > place, > > or > > > thing. Freebase connects entities together as a graph. Freebase > contains > > at > > > this time of writing more than 37 million topics, 1,998 types, and more > > > than 30,000 properties. This is not a small database by any measure. > > > > > > Thanks > > > > > > Antonio > > > > > > > > > > > > > > > > > > On Tue, May 28, 2013 at 4:58 AM, Dileepa Jayakody < > > [email protected] > > >> wrote: > > > > > >> Hi All, > > >> > > >> Thanks a lot for the wishes and selecting me to become part of this > > amazing > > >> community. :) > > >> I hope to do my best in this summer project with all your help and > > >> guidance, and hopefully become a continuous contributor to Stanbol. > > >> > > >> My proposal abstract : FOAF Co-reference Based Entity Disambiguation > > Engine > > >> In Apache Stanbol > > >> > > >> The proposed project focuses on developing an 'Entity Disambiguation > > >> Engine' in Apache Stanbol by computing co-referent relations in > > >> friend-of-a-friend (FOAF) data-sets. The same entity (persons, > > >> organizations) can be referred by different names and vice-versa on > the > > web > > >> which leads to the 'named ambiguity' problem of entities. This problem > > can > > >> affect the accuracy and relevance of results inferred by semantic > > engines > > >> and leads to the requirement of using effective disambiguation > > techniques > > >> to process entities as part of the enhancement process in the semantic > > >> engines. This proposal focuses on using FOAF profiles as a datasource > > and > > >> process them to resolve name ambiguity problem in an effective way. > > >> > > >> > > >> FOAF is a vocabulary used to describe people, organizations and > groups > > in > > >> the form of linked data to form an entity network on the web. The > > >> relationship of these FOAF instances can be very useful to derive new > > >> knowledge about entities using semantic techniques. The co-reference > > >> analysis can use FOAF attributes such as mbox, homepage, weblog, as > > unique > > >> identifiers to match FOAF instances to identify co-referent clusters > and > > >> use it to disambiguate entities over the web. This project aims to > > develop > > >> a comprehensive disambiguation algorithm by identifying and clustering > > >> co-referent FOAF instances which describes the same entity over the > web. > > >> > > >> > > >> Thanks, > > >> > > >> Dileepa > > >> > > >> > > >> On Tue, May 28, 2013 at 1:01 AM, Rupert Westenthaler < > > >> [email protected]> wrote: > > >> > > >> > Hi all, > > >> > > > >> > Congratulations to Antonio and Dileepa! Great news for Stanbol. > Thanks > > >> > for your interest in Stanbol and the great proposals. This will be > an > > >> > exiting coding summer. > > >> > > > >> > Will try my best as a Mentor > > >> > > > >> > best > > >> > Rupert > > >> > > > >> > p.s. Antonio, Dileepa: It would be cool if you could provide a > summary > > >> > of your proposals here on the list ^^ > > >> > > > >> > > > >> > On Mon, May 27, 2013 at 9:21 PM, Rafa Haro <[email protected]> wrote: > > >> > > Nice!!! > > >> > > > > >> > > Congratulations to the students!! Now it's when the funny stuff > > >> starts!! > > >> > > > > >> > > > > >> > > El lunes, 27 de mayo de 2013, Andreas Kuckartz escribió: > > >> > > > > >> > >> A few minutes ago Google announced the selected GSoC projects. > > These > > >> two > > >> > >> Stanbol proposals were selected: > > >> > >> > > >> > >> Freebase Entity Disambiguation in Apache Stanbol > > >> > >> Antonio David Perez Morales > > >> > >> > > >> > >> > > >> > > > >> > > > http://www.google-melange.com/gsoc/project/google/gsoc2013/adperezmorales/10001 > > >> > >> > > >> > >> FOAF Co-reference Based Entity Disambiguation Engine In Apache > > Stanbol > > >> > >> Dileepa Jayakody > > >> > >> > > >> > > > >> > > > http://www.google-melange.com/gsoc/project/google/gsoc2013/dileepaj/14001 > > >> > >> > > >> > >> Congratulations to the two students! > > >> > >> > > >> > >> Cheers, > > >> > >> Andreas > > >> > >> > > >> > > > > >> > > -- > > >> > > > > >> > > ------------------------------ > > >> > > This message should be regarded as confidential. If you have > > received > > >> > this > > >> > > email in error please notify the sender and destroy it > immediately. > > >> > > Statements of intent shall only become binding when confirmed in > > hard > > >> > copy > > >> > > by an authorised signatory. > > >> > > > > >> > > Zaizi Ltd is registered in England and Wales with the registration > > >> number > > >> > > 6440931. The Registered Office is 222 Westbourne Studios, 242 > Acklam > > >> > Road, > > >> > > London W10 5JJ, UK. > > >> > > > >> > > > >> > > > >> > -- > > >> > | Rupert Westenthaler [email protected] > > >> > | Bodenlehenstraße 11 ++43-699-11108907 > > >> > | A-5500 Bischofshofen > > >> > > > >> > > > > > > -- > > > > > > ------------------------------ > > > This message should be regarded as confidential. If you have received > > this > > > email in error please notify the sender and destroy it immediately. > > > Statements of intent shall only become binding when confirmed in hard > > copy > > > by an authorised signatory. > > > > > > Zaizi Ltd is registered in England and Wales with the registration > number > > > 6440931. The Registered Office is 222 Westbourne Studios, 242 Acklam > > Road, > > > London W10 5JJ, UK. > > > > > > > > -- > > Fabian > > http://twitter.com/fctwitt > > >
