> For now, I'm doing this in a personal PAWS notebook and I plan to run it on around 800 enwiki and 200 eswiki articles. But I would like to share the notebook with others in the future, so they can use it for their own list of articles, and I may try to make it into a Toolforge tool eventually.
While accessing via cloudelastic replicas would certainly be more performant, for a set of 1k articles sequentially requesting them through the public mediawiki api's should be doable. A query such as this will return the weighted tags: https://en.wikipedia.org/wiki/Special:ApiSandbox#action=query&format=json&prop=cirrusdoc&titles=Email&formatversion=2&cdincludes=weighted_tags That API does report that it is an internal format and subject to change, but that internal format is the exact same thing we would see talking to cloudelastic directly.
_______________________________________________ Discovery mailing list -- [email protected] To unsubscribe send an email to [email protected]
