Hi Juhi,
Any particular interest among those two?
In the description we referenced a few papers which could be used as
potential ideas.
DBpedia Spotlight – Better Surface Form Matching:
We are currently having some problems with linguistic variations. One of
the ideas (but it is not fixed so feel free to bring yours as well :] )
could be to use the superstring matching from the Babelfy paper. But It
would also be nice to mix it with some of the methods described in: [1]
DBpedia Spotlight – Better Context Vectors:
The quality of the context vectors seems a bit weird for some entities.
Values are not normalised, among other issues.
The aim here is to improve context vectors/disambiguation.
Some of the ideas could be: use glove/word2vec entity vectors.
But also maybe bringing some ideas from discourse parsing like the ones
described in [2]
A good start if you are interested would probably be:
- taking a quick look at the mentioned papers to get some ideas
- play a bit with spotlight ( demo
http://dbpedia-spotlight.github.io/demo/ )
- try to set it up locally, (compile it, run it)
- trying to understand how the spotlight stores work. Probably the paper
describing how spotlight works should be a good overview [3]
[1] https://aclweb.org/anthology/P/P11/P11-1095.pdf
[2] http://www.aclweb.org/anthology/C14-1213
[3] http://blog.semantic-web.at/wp-content/uploads/2011/09/p1_mendes.pdf
On Wed, Mar 4, 2015 at 9:23 AM, Marco Fossati <[email protected]> wrote:
> Hi Juhi,
>
> you should send all your inquiries in the mailing list (in CC).
> Please check out the following page with our project ideas:
> http://wiki.dbpedia.org/gsoc2015/ideas
>
> Cheers!
>
> On 3/3/15 2:58 PM, JUHI TANDON wrote:
> > Hello Marco,
> >
> > I am Juhi Tandon pursuing my major in Computational Linguistics from
> > IIIT Hyderabad. I am an NLP enthusiast and as such I found the projects
> > these projects particularly interesting :
> >
> >
> > DBpedia Spotlight – Better Context Vectors and
> >
> >
> > DBpedia Spotlight – Better Surface Form Matching
> >
> > I would like to contribute to one of these projects as a part of GSOC
> > 2015 Program. If the mentors can please provide some insights on where
> > to begin from.
> >
> > Thanks and Regards,
> >
> > Juhi
>
> --
> Marco Fossati
> http://about.me/marco.fossati
> Twitter: @hjfocs
> Skype: hell_j
>
>
> ------------------------------------------------------------------------------
> Dive into the World of Parallel Programming The Go Parallel Website,
> sponsored
> by Intel and developed in partnership with Slashdot Media, is your hub for
> all
> things parallel software development, from weekly thought leadership blogs
> to
> news, videos, case studies, tutorials and more. Take a look and join the
> conversation now. http://goparallel.sourceforge.net/
> _______________________________________________
> Dbpedia-gsoc mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc
>
------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the
conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Dbpedia-gsoc mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc