Re: [Wikitext-l] Markup cleansing by clearing all linguistic elements

Federico Leva (Nemo) Tue, 10 Jun 2014 07:24:41 -0700

Gabriel Wicke, 10/06/2014 02:30:

If you haven't heard of it, thenhttps://www.mediawiki.org/wiki/Parsoid
might be useful. It lets you work on HTML instead of wikitext, and can
convert that HTML back to wikitext.


I'm also curious how the this work will interact with
https://www.mediawiki.org/wiki/Content_translation, which is also based on
Parsoid.

There is no interaction because PageMigration doesn't need to manipulateHTML. :)

The question might have been unclear: what would be interesting (ifeasily available) is the ability to input a wikitext and get as output*only* the wikitext "markup" i.e. everything except the "linguistic"plain text (with some approximation). So for the example athttps://www.mediawiki.org/wiki/API:Parsing_wikitext#Example_2

[[foo]] [[API:Query|bar]] [http://www.example.com/ baz] -> [[]][[API:Query|]] [http://www.example.com/ ]


or something like that.

AFAIK there are solutions to get the plain text, e.g. people often wantto look up the text of a Wiktionary entry from the API (with varyingdegrees of success), but I'm not sure if there is something available todo the opposite or one would need to build it on top of those existingtools, by "subtraction".


Nemo

_______________________________________________
Wikitext-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitext-l

Re: [Wikitext-l] Markup cleansing by clearing all linguistic elements

Reply via email to