I was hoping someone on here might be able to point me towards any sample code there might be out there for using the HTML parser within Gecko without an associated on-screen window.
What I would like to do is modify the HTML parser so that the text elements in the html page are printed out as the URL is being loaded, rather than running a parser (any parser) on the URL and then stepping through the document after parsing. (My employer has specific reasons to do it this way rather than waiting for parsing to conclude). The general idea is to print out the page almost as is, but making some changes in the text nodes (for example, repeating the text in each, would be a trivial example). I ahve the transformation code already, BUT I have been having problems finding an HTML parser that will allow me to modify its internals. I want it to work for the various encodings of Japanese that are out there as well as various latin scripts. I've had a look at the Mozilla code and it seems that messing round with it is a nontrivial matter - Id be interested in knowing if, in your opinion, what I want to achieve is feasible with a week or two's work, or if I would be better off looking elsewhere, and if so, if you know of anywhere on the web where someone has done something similar. I'd really appreciate any responses. The email address quoted for me is not my work email, it is a spamtrap, anyone who wants to email me directly, please do so at donnla at linearb dot co dot no spam thanks uk. Thank you, Donnla.
