For the other people who are reading this: I also got this question. Solved this by doing a query on the database, see https://quarry.wmflabs.org/query/14350

Parsing wikitext is generally messy. Quite a few identifier templates on Commons (like https://commons.wikimedia.org/wiki/Template:Rijksmonument ) set a tracker category and use the identifier as the sorting key. This way it's possible to keep track of what identifier is used on what page (see https://www.mediawiki.org/wiki/Manual:Categorylinks_table for the database layout). In this case no tracker category was set so the externallinks table was used as a fallback ( https://www.mediawiki.org/wiki/Manual:Externallinks_table ).

Maarten


On 25-11-16 15:11, Hugo Manguinhas wrote:
Hi everyone,

I am new to the Commons API and would like to know how to get (in a machine 
readable way) the metadata found within the Summary section of a page.

In particular, given a File page like this one: 
https://commons.wikimedia.org/wiki/File:African_Dusky_Nightjar_(Caprimulgus_pectoralis)_(W1CDR0000386_BD28).ogghttps://commons.wikimedia.org/wiki/File:African_Dusky_Nightjar_(Caprimulgus_pectoralis)_(W1CDR0000386_BD28).ogg

I would like to get the "Europeana link" part... it is enough for me to get the 
data as Wiki markup, but parsing the whole HTML would be too much :S

... btw, is there any way to query for such data? I have been using the API 
Sandbox (https://en.wikipedia.org/wiki/Special:ApiSandbox ) but could not find 
a method that could do this...

Your help is really appreciated! Thank you in advance!

Best regards,
Hugo
_______________________________________________
Commons-l mailing list
Commons-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/commons-l


_______________________________________________
Commons-l mailing list
Commons-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/commons-l

Reply via email to