Thank you very much ! Are there any tutorials for scraping the dump ? To be
specific , I am most comfortable with python .


On Fri, Mar 21, 2014 at 6:54 PM, Brad Jorsch (Anomie) <[email protected]
> wrote:

> On Fri, Mar 21, 2014 at 1:23 PM, Radhika Gaonkar <[email protected]>wrote:
>
>>     I am very new to the mediawiki-api . I am planning on using this to
>> extract geo - information about places . I have been referring to this
>> tutorial by scraper wiki
>> https://blog.scraperwiki.com/2011/12/how-to-scrape-and-parse-wikipedia/. 
>> Though I am not sure if I should be using this api or the offline dump
>>
>
> Don't scrape the live wiki. You may scrape a dump if you'd like.
>
> Note that, if the only "geo information" you need are the coordinates, you
> can use the API to query them, like
> https://en.wikipedia.org/w/api.php?action=query&prop=coordinates&titles=Washington,%20D.C
> .
>
> and what is the difference between the two data sets ?
>>
>
> Mainly, the dumps are generated about once per month so they're not
> completely up-to-date.
>
>
> --
> Brad Jorsch (Anomie)
> Software Engineer
> Wikimedia Foundation
>
> _______________________________________________
> Mediawiki-api mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-api
>
>


-- 
Radhika Gaonkar
Final year B.E.( Hons) Computer Science
BITS Pilani K. K . Birla Goa Campus
_______________________________________________
Mediawiki-api mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/mediawiki-api

Reply via email to