https://bugzilla.wikimedia.org/show_bug.cgi?id=7

Gabriel Wicke <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |[email protected]

--- Comment #14 from Gabriel Wicke <[email protected]> 2012-08-09 17:40:41 
UTC ---
We have a fairly complete PEG tokenizer grammar in Parsoid
(http://www.mediawiki.org/wiki/Parsoid), which describes the context-free
portions of wikitext. Context-sensitive portions are handled in token stream
transformers. The PEG parse tree is flattened to a token stream so that we can
support unbalanced template expansions, and finally converted into a DOM using
a tree builder library according to the error recovery algorithms described in
the HTML5 spec.

The grammar is interspersed with actions and uses syntactic scope flags to
compress the grammar productions a bit, so it is not the most readable grammar
ever. Unrolling productions for all scope permutations might not help that much
either, as this would increase the size of the grammar a lot.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to