Hi! Yes, you can grab it from the API. Here is an sample of request [1]. I don’t know if pywikipedia has a nice wrapper for it but you can do this request easily by directly using low level pywikipedia code.
Thomas [1] https://commons.wikimedia.org/w/api.php?action=query&prop=imageinfo&format=jsonfm&iiprop=metadata&iilimit=5&titles=File%3A%E0%B4%9C%E0%B4%BE%E0%B4%A4%E0%B4%BF%E0%B4%95%E0%B5%8D%E0%B4%95%E0%B5%81%E0%B4%AE%E0%B5%8D%E0%B4%AE%E0%B4%BF.pdf Le 6 déc. 2013 à 19:00, ബാലശങ്കർ സി <[email protected]> a écrit : > Hi all, > I am from ml.wikisource.org and I am having a doubt regarding Mediawiki API > and PDF files. I want to know if I can use Pywikipedia to grab the text layer > of a pdf file (in the file namespace, obviously) . Is the mediawiki API > handling any such functionality? Thanks in advance. > > Regards, > Balasankar C > http://balasankarc.in > _______________________________________________ > Wikisource-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wikisource-l
_______________________________________________ Wikisource-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikisource-l
