Hi James,
I kind of remember that you created a script to parse HTML 5
specification and break into pieces? Using [html5lib][1] probably?
I think that was me instead
Do you have an handy link for the script?
http://html5.googlecode.com/svn/trunk/spec-splitter/spec-splitter.py
Did you choose to break on specific heading levels?
It breaks on <h2>, <h3>, and a few hard-coded extra headings ('video',
'the-canvas', 'the-command', 'tokenisation', 'tree-construction')
Best.
--
Philip Taylor
[EMAIL PROTECTED]