Hello -
I am using TreeBuilder to parse some HTML produced by Adobe's RoboHelp
application.  

It uses a "placeholder" element to enable content reuse:
<placeholder id="1234">

These elements have no text nodes and no end tag. They only have an id
attribute.

I am parsing the file and wish to output these elements "as is".

TreeBuilder is creating end tags for each placeholder element and
placing all of the end tags at the end of the document immediately
before the closing "body" tag:

</placeholder></placeholder></placeholder></body>

I've tried a few combinations of settings for $root->ignore_unknown,
$root->implicit_tags and $root->as_HTML(), and have tried adding
"placeholder" to %HTML::Tagset::optionalEndTag, but I have not found
anything that works yet.  

Any suggestions?
Thanks,
Dave

Reply via email to