Re: Doxia Parsing API?

Vincent Massol Mon, 31 Dec 2007 02:46:33 -0800


On Dec 27, 2007, at 11:20 AM, Vincent Massol wrote:

Hi Juan,
Thanks for your email and sorry for my late answer, I've just seenthe mails now.
I've started using the confluence parser as a starting point forwriting the XWiki parser. Re the speed, the confluence parser alsogenerates a Block Tree but I'm not sure how this affects performancenegatively.

I can answer that... It'll matter for large documents since userswould not start to see anything output before the end of the parsing.Modifying the parser to call traverse() whenever a block is createdwould be very easy to do though. I think I might add a flag for thexwiki parser to decide what to do.

However there are some cases where the full parsing is required. Forexample the XWiki TOC macro requires the full parsing to be done sinceit needs to know all the section headers. Of course a second levelparsing could also be done, looking only for headers but that wouldaffect the performance a bit. So for all macros that require thedocument structure for rendering we need the full parsing to be donefirst. However it's hard to know quickly if the document containsmacros macros that work on the document structure and thus we mighthave to parse the whole doc anyway...


Thanks
-Vincent

FWIW I've run some quick tests between the JavaCC-generated parserfor XWiki that is in the wikimodel parser vs the "hand-written"Confluence parser in Doxia (since confluence and xwiki are ofsimilar complexity for their syntaxes) and the result I got so faris that the "hand-written" parser is faster so I've gone ahead andused the "hand-written" confluence parser as a starting point.
Thanks again
-Vincent

On Dec 19, 2007, at 5:01 PM, Juan F. Codagnone wrote:
Hi Vicent,

On Wednesday 19 December 2007, Vincent Massol wrote:
...
I'd like to implement a Doxia parser for XWiki. However I've noticed
there's no standard in Doxia yet for parsing. Actually looking at
Doxia confluence, twiki and Apt I see each does it with his owncode.
However the Confluence and TWiki implementations are very similar,
each defining Block, BlockParser, etc.
...
content). Does anyone have any idea how the Confluence parsercompares
for example with, say, a JavaCC-generated parser?
The confluence parser was made after the twiki parser by Jason.
When i first wrote the twiki parser i felt that it was easier tomake an adhocparser instead of a generated one for a language that has manyexceptions.(Also i was also reading a TDD book at that time, and i wanted tomake some
practice, and the adhoc parser was perfect)

Here is the original post
http://mail-archives.apache.org/mod_mbox/maven-doxia-dev/200511.mbox/[EMAIL 
PROTECTED]
Two years later i think it was a good decision. One developer thatnever sawthe original code was conforable adding new language feature andbugfixes.
In terms of of fast rendering mechanism, the twiki parser has adraback: itfirst builds a block tree (like a DOM tree), and then the blockgenerates the
events for the Sink.

Juan.

--
Buenos Aires, Argentina 22°C with windsat 9 km/h E

Re: Doxia Parsing API?

Reply via email to