Georgi Kobilarov wrote:
>> This is the exact same source code and queries I'm using. Either the
>> amount of actors in Wikipedia reduced dramatically, the extractors are
>> broken or something is not right with the SPARQL endpoint.
>>     
>
> Well, at least I can say it's neither the amount of actors in Wikipedia
> nor the dbpedia extractors.
>
> grep -c http://dbpedia.org/ontology/Actor instancetype_en.nt
> 30912
>
> Best,
> Georgi
>
> --
> Georgi Kobilarov
> Freie Universität Berlin
> www.georgikobilarov.com
>
>   
>> -----Original Message-----
>> From: Michael Haas [mailto:[email protected]]
>> Sent: Friday, July 03, 2009 8:44 PM
>> To: Kingsley Idehen
>> Cc: DBpedia
>> Subject: Re: [Dbpedia-discussion] Current state of the SPARQL endpoint
>>
>> Kingsley Idehen wrote:
>>
>>     
>>> Michael,
>>>
>>> Please retry:
>>>
>>> SELECT ?name WHERE { ?a
>>> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
>>> <http://dbpedia.org/ontology/Actor> . ?a
>>> <http://www.w3.org/2000/01/rdf-schema#label> ?name }
>>>
>>>       
>> Thanks for the follow-up. However, It's still not working. With
>>     
> DBpedia
>   
>> 3.2, I got ~25000 results:
>>
>> l...@moar:~/uni/IE/group3$ wc -l oldseeds/actors.txt
>> 26932 oldseeds/actors.txt
>>
>>
>> Now, it's more like ~150:
>>
>> l...@moar:~/uni/IE/group3$ wc -l seeds/actors.txt
>> 132 seeds/actors.txt
>>
>> (Please note these numbers are after post-processing, ie removing
>> duplicates etc).
>>
>> This is the exact same source code and queries I'm using. Either the
>> amount of actors in Wikipedia reduced dramatically, the extractors are
>> broken or something is not right with the SPARQL endpoint.
>>
>>
>>     
>>> The issue has nothing to do with "Anytime Query" feature and
>>>       
>> everything
>>     
>>> to do with misconfiguration
>>> of the Virtuoso cluster instance.
>>>       
>> Thanks for the update - you previously suggested it was the "Anytime
>> Query", that's why I was looking into that.
>>
>> Regards,
>>
>> Michael
>>
>>
>>
>>
>>     
> -----------------------------------------------------------------------
>   
>> -------
>> _______________________________________________
>> Dbpedia-discussion mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>>     
>
>   

Michael,

Using a non standard SPARQL extension to solve a most basic database 
problem i.e. count stuff:
 SELECT  count(distinct ?name)
WHERE { ?a a <http://dbpedia.org/ontology/Actor> .
                   ?a <http://www.w3.org/2000/01/rdf-schema#label> ?name
               }

Gives you a total of: 98,641

You can also look at the VoiD graph via stats oriented URIs like:

1. http://dbpedia.org/void/page/Actor109765278Stat


Then try.

define sql:signal-void-variables 1 define input:default-graph-uri
<http://dbpedia.org> SELECT ?name WHERE { ?a
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
<http://dbpedia.org/ontology/Actor> . ?a
<http://www.w3.org/2000/01/rdf-schema#label> ?name } ORDER BY ?name
LIMIT 100 OFFSET 10000

We have a window of 40,000 records (amply generous since you are clearly 
crawling this data).


-- 


Regards,

Kingsley Idehen       Weblog: http://www.openlinksw.com/blog/~kidehen
President & CEO 
OpenLink Software     Web: http://www.openlinksw.com





------------------------------------------------------------------------------
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to