Hi Pablo,
I would like to contribute to a fix, but am afraid that I lack the programmer's
knowledge. If there is any other way to contribute, please let me know.
What would be a sensible size for the pieces to feed to Spotlight? I will be
needing to split up a book at least in pages, and, even better still, into
alineas/paragraphs/Absaetze.
Kind regards,
Gerard
________________________________
Van: Pablo N. Mendes [[email protected]]
Verzonden: woensdag 24 april 2013 12:56
Aan: Kuys, Gerard
CC: DBpediaSpotlight Users
Onderwerp: Re: What is needed to to Entity Recognition for fairly long texts?
Hi Gerard,
The solution at the moment would be to cut the input text into pieces before
sending to the service. A fix on our side will come with the solution to the
following documented issue:
https://github.com/dbpedia-spotlight/dbpedia-spotlight/issues/166
Would you be interested in contributing a fix for this issue?
Cheers,
Pablo
On Wed, Apr 24, 2013 at 3:04 AM, Kuys, Gerard
<[email protected]<mailto:[email protected]>> wrote:
Hi all,
Since I am new on this mailing list, I don’t know whether or not I am asking
for things that already are documented somewhere. Apologies if that were the
case!
What I really would like to know, is if it is possible to do entity recognition
on fairly long texts (like the one enclosed). I might be embarking on a project
in which there are scores of this kind of texts, and if I had to wait for
DBpedia Spotlight for a day or two for a single text or part of it, the project
would turn out to be not feasible. Of course, I will restrict the types to be
searched as much as possible (Persons, Places), but then still it is taking too
much time. Dimitris was so kind to restart the Spotlight server and tell me so,
otherwise I would still be waiting for a result.
What is it you suggest me to do? And are there requirements for the text to be
improved, in order to make the spotting process easier?
Thanks for your answer,
Gerard Kuys
Nl.dbpedia.org<http://Nl.dbpedia.org>
Disclaimer
Dit bericht met eventuele bijlagen is vertrouwelijk en uitsluitend bestemd voor
de geadresseerde. Indien u niet de bedoelde ontvanger bent, wordt u verzocht de
afzender te waarschuwen en dit bericht met eventuele bijlagen direct te
verwijderen en/of te vernietigen. Het is niet toegestaan dit bericht en
eventuele bijlagen te vermenigvuldigen, door te sturen, openbaar te maken, op
te slaan of op andere wijze te gebruiken. Ordina N.V. en/of haar
groepsmaatschappijen accepteren geen verantwoordelijkheid of aansprakelijkheid
voor schade die voortvloeit uit de inhoud en/of de verzending van dit bericht.
This e-mail and any attachments are confidential and are solely intended for
the addressee. If you are not the intended recipient, please notify the sender
and delete and/or destroy this message and any attachments immediately. It is
prohibited to copy, to distribute, to disclose or to use this e-mail and any
attachments in any other way. Ordina N.V. and/or its group companies do not
accept any responsibility nor liability for any damage resulting from the
content of and/or the transmission of this message.
--
Pablo N. Mendes
http://pablomendes.com
Disclaimer
Dit bericht met eventuele bijlagen is vertrouwelijk en uitsluitend bestemd voor
de geadresseerde. Indien u niet de bedoelde ontvanger bent, wordt u verzocht de
afzender te waarschuwen en dit bericht met eventuele bijlagen direct te
verwijderen en/of te vernietigen. Het is niet toegestaan dit bericht en
eventuele bijlagen te vermenigvuldigen, door te sturen, openbaar te maken, op
te slaan of op andere wijze te gebruiken. Ordina N.V. en/of haar
groepsmaatschappijen accepteren geen verantwoordelijkheid of aansprakelijkheid
voor schade die voortvloeit uit de inhoud en/of de verzending van dit bericht.
This e-mail and any attachments are confidential and are solely intended for
the addressee. If you are not the intended recipient, please notify the sender
and delete and/or destroy this message and any attachments immediately. It is
prohibited to copy, to distribute, to disclose or to use this e-mail and any
attachments in any other way. Ordina N.V. and/or its group companies do not
accept any responsibility nor liability for any damage resulting from the
content of and/or the transmission of this message.
------------------------------------------------------------------------------
Try New Relic Now & We'll Send You this Cool Shirt
New Relic is the only SaaS-based application performance monitoring service
that delivers powerful full stack analytics. Optimize and monitor your
browser, app, & servers with just a few lines of code. Try New Relic
and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_apr
_______________________________________________
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users