Hi there,
Thank you for your well informed message.

Please see inline.

On Tue, Mar 20, 2012 at 4:24 PM, Siwei Yu <[email protected]> wrote:

> Dear Sir/Madam,
>
> I'm a Chinese PhD student major in Semantic Web. I'm quite interested
> in contributing to DBpedia Spotlight in GSoC 2012. Among the project
> ideas, I would like to work on integrating DBpedia Spotlight as
> Enhancement Engine within Apache Stanbol. I have studied the User
> Manual [1] and the Technical Documentation [2] of DBpedia Spotlight,
> as well as the Engine Enhancement development guide from Apache
> Stanbol [3]. In the source code, SpotlightInterface [4] provides a
> good demonstration of the programmatic usage DBpedia Spotlight. After
> the document studying work, I can totally understand the goals and the
> approaches of this project, and I think I can make it this summer.
> Here're some questions:
>
> 1. The general procedure of DBpedia Spotlight can be broken down in
> four stages: (1) Spotting, (2) Candidate Selection, (3)
> Disambiguation, (4) Filtering. There are three web services for users.
> What stage(s) does each service work on? Am I right for the following
> mappings?
> - http://spotlight.dbpedia.org/rest/annotate : (1) (2) (3)
> - http://spotlight.dbpedia.org/rest/candidates : (1) (2)
> - http://spotlight.dbpedia.org/rest/disambiguate : (3)
> We can perform the additional stage (4) in each service for data
> filtering. Is that true?
>

The /annotate implements (4) fully. The /candidates and /disambiguate
should also do so, but we've noticed that we never quite finished rolling
it out. Also, /candidates does (3) as well, but it returns all of the
ranked disambiguations, while /annotate returns only the first. We need to
do some reorganization in our REST interfaces.


> 2. Should I develop one Enhancement Engine containing three services,
> or three engines (i.e. each service as an engine)? It's maybe related
> to the service function granularity. What's your opinion?
>

We could have one engine for each task separately, and an enhancement chain
should connect them together. We should also introduce a REST API /spot for
(1). We could perhaps make /candidates implement only (2) and make
/annotate accept a &verbose=on to act like the current /candidates does.

Besides all of this reorganization that has to happen, Rupert is the guy
from Stanbol that can help you position your application in that regard.


> 3. In [1], you mention that "core/SpotlightFactory" and
> "rest/Candidates.java" would help understanding the programmatic usage
> of DBpedia Spotlight. But I can not find them in svn. It's greatly
> appreciated if you point out their package paths. Or any other
> important source codes?
>

By the way, my name is Siwei Yu. I have good knowledge of semantic
> technologies, such as RDF, OWL, SPARQL. I'm also familiar with the
> mainstream Java based RDF/OWL processing tools like owlapi, Jena,
> Sesame, AllegroGraph. I have strong Java coding skills with of good
> knowledge of the software design patterns. My research background
> meets the requirements very well. I believe it'll be a wonderful
> summer working with the DBpedia Spotlight community.
>

It would be good if you leveraged some of your Semantic Web background in
your application. The idea of a /feedback API, which receives corrections
made by the users could fit well in this regard.


>
> Looking forward to hearing from you!
>
> [1] http://wiki.dbpedia.org/spotlight/usersmanual?v=i0m
> [2] http://wiki.dbpedia.org/spotlight/technicaldocumentation?v=3qy
> [3] http://incubator.apache.org/stanbol/docs/trunk/enhancer/engines/
> [4]
> http://dbp-spotlight.svn.sourceforge.net/viewvc/dbp-spotlight/trunk/rest/src/main/java/org/dbpedia/spotlight/web/rest/SpotlightInterface.java?revision=363&view=markup
>
> Yours,
> Siwei Yu
>
>
------------------------------------------------------------------------------
This SF email is sponsosred by:
Try Windows Azure free for 90 days Click Here 
http://p.sf.net/sfu/sfd2d-msazure
_______________________________________________
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users

Reply via email to