Amir, Gerard: The easiest way to test locally at the moment is to use the standalone 'mw-ocg-bundler' and 'mw-ocg-latexer' node packages. There are good installation instructions in the READMEs, see:
https://npmjs.org/package/mw-ocg-bundler https://npmjs.org/package/mw-ocg-latexer and let me know if I need to document anything better. This will let you pull individual articles from an arbitrary wiki, and then typeset them with xelatex. There is currently good support for quite a number of languages. My standard test case contains: http://ar.wikipedia.org/wiki/ليونيل_ميسي http://ar.wikipedia.org/wiki/بشير_الثاني_الشهابي http://ar.wikipedia.org/wiki/حمزة_بن_عبد_المطلب http://ar.wikipedia.org/wiki/إسطنبول http://ar.wikipedia.org/wiki/الحرب_الإنجليزية_الزنجبارية http://de.wikipedia.org/wiki/Papier http://en.wikipedia.org/wiki/Durian http://es.wikipedia.org/wiki/Latas_de_sopa_Campbell http://fa.wikipedia.org/wiki/کعبه_زرتشت http://fr.wikipedia.org/wiki/Trachylepis_atlantica http://he.wikipedia.org/wiki/ספרטה http://hi.wikipedia.org/wiki/रामायण http://it.wikipedia.org/wiki/La_vita_è_meravigliosa http://ja.wikipedia.org/wiki/熊野三山本願所 http://ja.wikipedia.org/wiki/金星の日面通過 http://ko.wikipedia.org/wiki/조화진동자 http://ml.wikipedia.org/wiki/മലയാളം http://pl.wikipedia.org/wiki/Efekt_potwierdzenia http://pt.wikipedia.org/wiki/Scaphyglottis http://ru.wikipedia.org/wiki/Битва_при_Платеях http://simple.wikipedia.org/wiki/Taoism http://vi.wikipedia.org/wiki/Vệ_tinh_tự_nhiên_của_Sao_Thiên_Vương http://zh.wikipedia.org/wiki/納粹德國海軍 and a few other English articles. That said, I don't read most of these languages, so I've mostly been trying to ensure that our output matches the HTML displayed by the wiki. It is quite possible I've chosen bad-looking fonts, or that there are other details that could be improved. (For example, the way that Vietnamese stacked accents was bad for a while; I've fixed that now.) Comments eagerly requested! --scott ps. there are a number of minor issues with citations in RTL languages, even in our standard HTML rendering on the wikis; it appears that our citation templates should be more aggressive about adding <bdi> tags or lang attributes to ensure that citations of LTR sources in an RTL article are displayed as nicely as possible. If these fixes are made to the source, the latex output should inherit them. _______________________________________________ Wikitech-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikitech-l
