On Fri, Dec 12, 2014 at 2:41 AM, Ricordisamoa
<[email protected]> wrote:
> Il 11/12/2014 23:28, Dan Garry ha scritto:
>>
>> THIS IS AWESOME
>>
>> Do you know when we are going to be able to start querying this via an API
> in production?
>>
>> The Mobile Apps Team would love to consume this data, as opposed to the
> present data exposed via the CommonsMetadata API (which is scraped, eugh).
>
> As far as I understand the information Guillaume is talking about is exactly
> the one scraped by CommonsMetadata.
> See https://tools.wmflabs.org/mrmetadata/how_it_works.html:
> «The script needs to go through all file description pages of a wiki, and
> check for machine-readable metadata by querying the CommonsMetadata
> extension.»

That's correct. However, just to be clear, CommonsMetadata doesn't
just scrape the HTML (or the wikitext), it scrapes the HTML to look
for the machine-readable markers, and exposes that information through
the API.

Until we have Structured Data (which is /at least/ a year out),
CommonsMetadata is still the best way to access that information.

-- 
Guillaume Paumier

_______________________________________________
Wikitech-ambassadors mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitech-ambassadors

Reply via email to