Re: [Wikidata] Recognizing deleted resource in the Wikidata SPARQL endpoint

2016-03-19 Thread Stas Malyshev
Hi!

> I am currently running an experiment to figure out how many Wikidata
> entries refer to identifiers in our dataset (i.e. using property P727)
> but I am receiving in the results entries that have apparently been
> deleted/deprecated (e.g. http://www.wikidata.org/entity/Q18573617)... is

Could you send me the query and the items you see that are wrong?

> there a way to detect them using SPARQL, perhaps some meta-property or
> some information in a statement, or is it simply because the endpoint is
> not in sync with the main repo. 

Short answer - unfortunately, no. Longer answer in
https://phabricator.wikimedia.org/T128947#2104017


-- 
Stas Malyshev
smalys...@wikimedia.org

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Recognizing deleted resource in the Wikidata SPARQL endpoint

2016-03-19 Thread Mbch331

Op 18-3-2016 om 19:30 schreef Stas Malyshev:

Hi!


I am currently running an experiment to figure out how many Wikidata
entries refer to identifiers in our dataset (i.e. using property P727)
but I am receiving in the results entries that have apparently been
deleted/deprecated (e.g. http://www.wikidata.org/entity/Q18573617)... is

Could you send me the query and the items you see that are wrong?


there a way to detect them using SPARQL, perhaps some meta-property or
some information in a statement, or is it simply because the endpoint is
not in sync with the main repo.

Short answer - unfortunately, no. Longer answer in
https://phabricator.wikimedia.org/T128947#2104017


That ticket is a restricted task, so not everybody can see the longer 
answer.


Mbch331

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Recognizing deleted resource in the Wikidata SPARQL endpoint

2016-03-18 Thread Hugo Manguinhas
Hi Stas, Mbch331,

Thanks both for your help!

Is there an alternative to SPARQL that can either check for deleted records or 
give only non-deleted ones? currently, I am checking if they are deleted by 
making a "ping" using HTTP head requests but this takes a huge amount of time 
since I need to check about 70k resources.

... btw, the SPARQL query that I am running is this one:
SELECT ?wdt ?cho WHERE { ?wdt wdt:P727 ?cho }

Thanks again!

Best regards,
Hugo


From: Wikidata [wikidata-boun...@lists.wikimedia.org] on behalf of Mbch331 
[mbch331.wikipe...@gmail.com]
Sent: 18 March 2016 20:41
To: Discussion list for the Wikidata project.
Subject: Re: [Wikidata] Recognizing deleted resource in the Wikidata SPARQL 
endpoint

Op 18-3-2016 om 19:30 schreef Stas Malyshev:
> Hi!
>
>> I am currently running an experiment to figure out how many Wikidata
>> entries refer to identifiers in our dataset (i.e. using property P727)
>> but I am receiving in the results entries that have apparently been
>> deleted/deprecated (e.g. http://www.wikidata.org/entity/Q18573617)... is
> Could you send me the query and the items you see that are wrong?
>
>> there a way to detect them using SPARQL, perhaps some meta-property or
>> some information in a statement, or is it simply because the endpoint is
>> not in sync with the main repo.
> Short answer - unfortunately, no. Longer answer in
> https://phabricator.wikimedia.org/T128947#2104017
>
>
That ticket is a restricted task, so not everybody can see the longer
answer.

Mbch331

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Recognizing deleted resource in the Wikidata SPARQL endpoint

2016-03-18 Thread Stas Malyshev
Hi!

> Is there an alternative to SPARQL that can either check for deleted
> records or give only non-deleted ones? currently, I am checking if
> they are deleted by making a "ping" using HTTP head requests but this
> takes a huge amount of time since I need to check about 70k
> resources.

Well, SPARQL data store is not supposed to contain any deleted
entries... But looks like there's some bug there. If you give me the
list of the "bad" entries, it's easy to update them. Considerable harder
is to find *why* they weren't updated in the first place. I'm still
looking into it.

-- 
Stas Malyshev
smalys...@wikimedia.org

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata