Hi Erick, What you are calling "macros" are actually called "templates" in the Wikipedia (or, more generally, MediaWiki) context. In Portuguese, the name is "Predefinição."
You can find all the templates on a wiki through the "Special:AllPages" page, and then choose the "Template" or "Predefinição" namespace. Here is a direct link. I suspect you will find there are many thousands of templates; breaking that down into those that supply significant text (which I would guess are the majority) vs. those that do not will probably be a difficult task; sorry, I don't have much to offer in that area. http://pt.wikipedia.org/w/index.php?title=Especial:Todas_as_páginas&namespace=10 I hope this helps! -Pete -- Pete Forsyth [[User:Peteforsyth]] [email protected] 503-383-9454 mobile On Feb 14, 2012, at 1:14 PM, Erick Fonseca wrote: > Greetings, > > I'm writing a script to read Wikipedia dump files and generate raw > text from them, much like it would appear in a web browser. > At first, I ignored all macros, discarding anything between {{ ... }}, > but I soon learned that some macros generate useful text. Now I need a > comprehensive list of all existing macros, to know which of them I > should treat. > As I believe some macros are language dependent, I am dealing with the > Portuguese Wikipedia. > > Thank you for any help, > Erick > > _______________________________________________ > Wikipedia-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wikipedia-l _______________________________________________ Wikipedia-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikipedia-l
