Hi Batica,
I do not understand the question. Too much info does not seem to be a bad
thing, if you have many ways to clean them up. You could, for example,
write a simple script that removes everything that doesn't pass a given
threshold.
Cheers,
Pablo
On Thu, Feb 9, 2012 at 1:15 PM, Batica Dzonic <[email protected]> wrote:
> Thank you Pablo, but I have another problem now.
> The DBpedia Spotlight lexicalizations dataset contains "too much
> information", for example:
> <http://dbpedia.org/resource/United_States> <
> http://lexvo.org/ontology#label> "American Country Music Singer"@en, and
> when I query local lookup, for example
> http://lookup.dbpedia.org/api/search.asmx/KeywordSearch?QueryString=music
> United
> States is first results (because this resource has the highest refCount)
> :)
>
> I suspect that the problem in these redundant data that I cited as an
> example.
> Am I wrong somewhere?
>
> From lexicalizations dataset I have extracted the following information,
> all triplets that contain <http://lexvo.org/ontology#label> in the
> following way:
>
> -In lexicallizations dataset
> <http://dbpedia.org/resource/United_States> <
> http://lexvo.org/ontology#label> "American Country Music Singer"@en <
> http://dbepdia.org/spotlight/id/United_States---American_Country_Music_Singer>
> .
>
> -In my surface_forms.nt
> <http://dbpedia.org/resource/United_States> <
> http://lexvo.org/ontology#label> "American Country Music Singer"@en
>
> And for refCounts:
> -In lexicallizations dataset:
> <http://dbpedia.org/resource/United_States> <
> http://dbpedia.org/spotlight/score#uriCount> "330045"^^<
> http://www.w3.org/2001/XMLSchema#integer> .
>
> -In my ref_counts.nt
> <http://dbpedia.org/resource/United_States> <
> http://dbpedia.org/property/refCount> "330045"^^<
> http://www.w3.org/2001/XMLSchema#integer> .
>
> To mention that everything else works fine.
> Is there another way to get to this information (ref_counts.nt and
> surface_forms.nt)
>
> Cheers,
> Batica
>
>
> --- On *Mon, 1/30/12, Pablo Mendes <[email protected]>* wrote:
>
>
> From: Pablo Mendes <[email protected]>
> Subject: Re: [Dbpedia-discussion] DBpedia Lookup lucene index - download
> link?
> To: "Batica Dzonic" <[email protected]>
> Cc: [email protected]
> Date: Monday, January 30, 2012, 2:15 PM
>
>
> The source code is here, under the same license as the extraction
> framework. Kudos to Max Jakob for (re)implementing it.
> http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/lookup/
>
> The string to URI associations are available from the DBpedia Spotlight
> lexicalizations dataset:
> http://spotlight.dbpedia.org/download/lexicalizations.tgz
>
> It is available under
> creativecommons.org/licenses/by/3.0/
> You could build your index from that after some grep/sed string
> manipulation.
>
> Cheers,
> Pablo
>
> On Mon, Jan 30, 2012 at 10:21 PM, Batica Dzonic
> <[email protected]<http://mc/[email protected]>
> > wrote:
>
> I need to setup DBpedia lookup on local machine.
> Do I have to build lucene dbpedia lookup index or can somene give me a
> link for dowloading prebuilt DBpedia Lucene Lookup index, if that even
> exists?
>
>
> ------------------------------------------------------------------------------
> Try before you buy = See our experts in action!
> The most comprehensive online learning library for Microsoft developers
> is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
> Metro Style Apps, more. Free future releases when you subscribe now!
> http://p.sf.net/sfu/learndevnow-dev2
> _______________________________________________
> Dbpedia-discussion mailing list
> [email protected]<http://mc/[email protected]>
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>
>
------------------------------------------------------------------------------
Virtualization & Cloud Management Using Capacity Planning
Cloud computing makes use of virtualization - but cloud computing
also focuses on allowing computing to be delivered as a service.
http://www.accelacomm.com/jaw/sfnl/114/51521223/
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion