Hi James,
I kind of remember that you created a script to parse HTML 5 specification and break into pieces? Using [html5lib][1] probably?
Do you have an handy link for the script? Did you choose to break on specific heading levels? Best. [1]: http://code.google.com/p/html5lib/ -- Karl Dubost - W3C http://www.w3.org/QA/ Be Strict To Be Cool
