I found this piece of opensource software: http://memberwebs.com/nielsen/software/rtfx/ which is at least 10 times faster than any commercial products I've tried at turning an RTF file into an XML file which can then be parsed with various XML tools. I know python can be used to take apart an RTF file directly, but this tool seems to allow me to use other tools for automatic conversion with no coding.
Dave
Nice find! Yes, it converts the NCHS ICD-9-CM RTF files perfectly, and teh resulting XML has enough hints contained in teh arguments to the <para> tags to be able to extract the semantic meaning we need.
And rtfx runs on all platforms as a command line programme - perfect!
Tim C
smime.p7s
Description: S/MIME Cryptographic Signature
