Hi Matt,
Thanks for your message. We usually move the dev-intensive discussions
to dbpedia-developers. Please feel free to subscribe and send us a patch
there. The current implementation also seems to have a bug that makes it
crash from time to time. I think there are some chars that makes it throw a
NullPointerException. If you're interested in taking a look at that, we'd
definitely appreciate.
About the surface forms, we used DBpedia Spotlight's indexing process to
obtain those. What you see in the lookup index is a subset of:
http://dbpedia.org/Datasets/NLP
The class used to generate the data is available here:
https://github.com/dbpedia-spotlight/dbpedia-spotlight/blob/master/index/src/main/scala/org/dbpedia/spotlight/util/ExtractCandidateMap.scala
https://github.com/dbpedia-spotlight/dbpedia-spotlight/blob/master/core/src/main/scala/org/dbpedia/spotlight/util/CreateLexicalizations.scala
Cheers,
Pablo
On Sun, Nov 18, 2012 at 7:26 PM, Matthew Haynes <[email protected]> wrote:
> Hiya,
>
> Not sure if this is the correct place for development discussions or not
> so please correct me if I'm wrong.
>
> I had a bit of trouble building the dbpedia lookup tool as maven started
> to complain about not being able to find the nxparser jar despite it being
> in the lib directory. I have a small patch that upgrades the nxparser
> version and sets maven to pull in the jar from the project's maven repo on
> google code: http://code.google.com/p/nxparser/wiki/Maven
>
> What is the best way to submit this via Sourceforge?
>
> I'm also quite interested in trying to add some new functionality to the
> lookup tool, geospatial search in particular. To do this I first need to be
> able to build a new index though! I was wondering if anybody had any code
> or advice for building the surface forms? I have a simple bash (sed + perl)
> script that parses the links in the wikipedia dumps and seems to work OK,
> but the parsing is fairly simplistic when it comes to some of the wiki
> syntax. Would be very interested to know how the original surface forms
> were built for the current lookup index.
>
> Many thanks,
>
> Matt
>
>
>
>
>
> ------------------------------------------------------------------------------
> Monitor your physical, virtual and cloud infrastructure from a single
> web console. Get in-depth insight into apps, servers, databases, vmware,
> SAP, cloud infrastructure, etc. Download 30-day Free Trial.
> Pricing starts from $795 for 25 servers or applications!
> http://p.sf.net/sfu/zoho_dev2dev_nov
> _______________________________________________
> Dbpedia-discussion mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>
--
---
Pablo N. Mendes
http://pablomendes.com
Events: http://wole2012.eurecom.fr
------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion