The true good solution would be to have a parser implemented in C and available through every standard installation of PHP. It could be used by other languages too.

I am keeping my fingers crossed, hoping that someone much more knowledgable than I will do this. :)


I have published [1] simple pull parser ( to be precise tokenizer ) capable
to tokenize both - HTML and XML. It is very fast and simple but in C++. I beleive that its concept can be reused in PHP.

[1] http://www.codeproject.com/cpp/HTML_XML_Scanner.asp

Andrew Fedoniouk.
http://terrainformatica.com




Reply via email to