Georgi Kobilarov wrote:
>> This is the exact same source code and queries I'm using. Either the
>> amount of actors in Wikipedia reduced dramatically, the extractors are
>> broken or something is not right with the SPARQL endpoint.
>>
>
> Well, at least I can say it's neither the amount of actors in Wikipedia
> nor the dbpedia extractors.
>
> grep -c http://dbpedia.org/ontology/Actor instancetype_en.nt
> 30912
>
> Best,
> Georgi
>
> --
> Georgi Kobilarov
> Freie Universität Berlin
> www.georgikobilarov.com
>
>
>> -----Original Message-----
>> From: Michael Haas [mailto:[email protected]]
>> Sent: Friday, July 03, 2009 8:44 PM
>> To: Kingsley Idehen
>> Cc: DBpedia
>> Subject: Re: [Dbpedia-discussion] Current state of the SPARQL endpoint
>>
>> Kingsley Idehen wrote:
>>
>>
>>> Michael,
>>>
>>> Please retry:
>>>
>>> SELECT ?name WHERE { ?a
>>> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
>>> <http://dbpedia.org/ontology/Actor> . ?a
>>> <http://www.w3.org/2000/01/rdf-schema#label> ?name }
>>>
>>>
>> Thanks for the follow-up. However, It's still not working. With
>>
> DBpedia
>
>> 3.2, I got ~25000 results:
>>
>> l...@moar:~/uni/IE/group3$ wc -l oldseeds/actors.txt
>> 26932 oldseeds/actors.txt
>>
>>
>> Now, it's more like ~150:
>>
>> l...@moar:~/uni/IE/group3$ wc -l seeds/actors.txt
>> 132 seeds/actors.txt
>>
>> (Please note these numbers are after post-processing, ie removing
>> duplicates etc).
>>
>> This is the exact same source code and queries I'm using. Either the
>> amount of actors in Wikipedia reduced dramatically, the extractors are
>> broken or something is not right with the SPARQL endpoint.
>>
>>
>>
>>> The issue has nothing to do with "Anytime Query" feature and
>>>
>> everything
>>
>>> to do with misconfiguration
>>> of the Virtuoso cluster instance.
>>>
>> Thanks for the update - you previously suggested it was the "Anytime
>> Query", that's why I was looking into that.
>>
>> Regards,
>>
>> Michael
>>
>>
>>
>>
>>
> -----------------------------------------------------------------------
>
>> -------
>> _______________________________________________
>> Dbpedia-discussion mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>>
>
>
Michael,
Using a non standard SPARQL extension to solve a most basic database
problem i.e. count stuff:
SELECT count(distinct ?name)
WHERE { ?a a <http://dbpedia.org/ontology/Actor> .
?a <http://www.w3.org/2000/01/rdf-schema#label> ?name
}
Gives you a total of: 98,641
You can also look at the VoiD graph via stats oriented URIs like:
1. http://dbpedia.org/void/page/Actor109765278Stat
Then try.
define sql:signal-void-variables 1 define input:default-graph-uri
<http://dbpedia.org> SELECT ?name WHERE { ?a
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
<http://dbpedia.org/ontology/Actor> . ?a
<http://www.w3.org/2000/01/rdf-schema#label> ?name } ORDER BY ?name
LIMIT 100 OFFSET 10000
We have a window of 40,000 records (amply generous since you are clearly
crawling this data).
--
Regards,
Kingsley Idehen Weblog: http://www.openlinksw.com/blog/~kidehen
President & CEO
OpenLink Software Web: http://www.openlinksw.com
------------------------------------------------------------------------------
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion