Am 01.07.2016 um 01:42 schrieb Nuria Ruiz: > Is this data always requested via http from an api endpoint that will hit a > varnish cache? (Daniel can probably answer this)
Yes. Special:EntityData is a regular special page, and action=wbgetentities is a regular MW web API request, as your example shows. > If the data you are interested in can be inferred from these requests there is > no additional data gathering needed. Yay! > Nor does it tell us how > often statements/RDF triples show up in the Wikidata Query Service. I'm no expert on the query service, adding Stas for that. As far as I know, SPARQL queries go through Varnish directly to BlazeGraph. In any case, they are not processed by MediaWiki at all. Tracking how often an entity is mentioned in a GET request to the SPARQL service should be possible based on the varnish request logs, with a bit of regex magic. POST requests are more tricky, I suppose. However, I don't think we are logging the contents of responses at all. I suppose that would have to be build into BlazeGraph somehow. And even if we did that, that would only tell use which entities were present in a result, not which entities were used to answer a query. E.g. if you list all instances of a class (including subclasses), the entities representing the classes are essential to answering the query, but they are not present in the result (and only the top-most class is present in the query). -- Daniel Kinzler Senior Software Developer Wikimedia Deutschland Gesellschaft zur Förderung Freien Wissens e.V. _______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
