On 7/22/2019 5:18 AM, Roger Shuttleworth wrote:
I have a set of well over 100 Word documents (I know...) that I would like to convert to simple XML (not the kind that Word exports!). They are all four pages long and pretty consistent in terms of structure, and paragraph styles are used for the most part, though not character styles. If you were me, what methods would you look at?

I have used structured FM for years and am familiar with DITA and DocBook. I know that there is a route from Word doc > FrameMaker > Structured FrameMaker > XML that would involve creating a conversion table, a DTD, and a structured application. I have done that in the past, though it was a few years ago. I realise that it would mean a lot of up-front work to get it working, as well as ensuring that styles are used fully and consistently in my source documents.
Roger,
   You now have three approaches to consider--conversion table, MIF2Go, and Word XML. Often in such projects, the developer's experience has a lot to do with the chosen route. I would probably start with a conversion table and if you have past experience doing so, it might be the most straightforward approach. I often touch up the structure produced by a conversion table with XSLT. I would be cautious about starting from Word XML because it is very focused on formatting details and there would be a lot to ignore.

   Your last clause, "ensuring that styles are used fully and consistently in my source documents," may well indicate where the bulk of the work has to be done. You don't indicate how long your hundred Word documents are or how consistently the authors attempted to use Word styles, but even in the best of practical cases there is probably a lot of work to do.

   Also, you mention creating a DTD as part of a conversion table approach. Does the target XML you want to create use a DTD? A schema? Neither? Has it been designed? FM can export XML without a DTD, although tables, graphics, and cross-references may require one.

   And I will join the other respondents and offer to meet with you online to look at a conversion table approach.
    --Lynne


--
Lynne A. Price
Text Structure Consulting, Inc.
Specializing in structured FrameMaker consulting, application development, and 
training
[email protected]            http://www.txstruct.com
voice/fax: (510) 583-1505      cell phone: (510) 421-2284

_______________________________________________

This message is from the Framers mailing list

Send messages to [email protected]
Visit the list's homepage at  http://www.frameusers.com
Archives located at http://www.mail-archive.com/framers%40lists.frameusers.com/
Subscribe and unsubscribe at 
http://lists.frameusers.com/listinfo.cgi/framers-frameusers.com
Send administrative questions to [email protected]

Reply via email to