Hi All, I created a native html parser based on libhubbub, a parser library used by the netsurf browser project. There were quite a few html pages that didn't parse correctly on tautologistics's html parser so I thought it might be easier pulling in a parser from an existing web browser. I considered using webkit & firefox, but those browsers had too many external dependencies. The parser can operate in blocking or non-blocking mode, and streamed (chunked) data. The wonderful jsdom library uses tautologistics/node-htmlparser by default, but one can choose this parser as the overriding default. The readme shows an example of how this is done.
Github: https://github.com/deanmao/node-hubbub To install: npm install hubbub -- Job Board: http://jobs.nodejs.org/ Posting guidelines: https://github.com/joyent/node/wiki/Mailing-List-Posting-Guidelines You received this message because you are subscribed to the Google Groups "nodejs" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/nodejs?hl=en?hl=en
