On 10/24/2014 09:44 PM, Jeff Hooker wrote:
I’ve attached a sample problem file. The problem itself I can reproduce
across number of files sharing the same author.
Use paste from word to drop the document into a Docbook Article. It will
work reasonably well, considering the fact it it’s an offensively
terrible document.
In my opinion,
1) Your document is an ``average'' MS-Word document, not a ``terrible'' one.
2) Using XXE v6.1, "Paste from Word" works *quite* *well* on the
document you sent us (safe for the low-res, ugly, image files which are
generated by MS-Word itself, not by "Paste from Word").
The generated DocBook document would have been much better if your
MS-Word author used real headings, real figure captions, real hyperlinks.
Instead your MS-Word author created paragraphs having custom styles.
These custom styles are indeed nice to look at, but they do not convey
any structure information like most MS-Word standard styles do. Example:
your "Heading 1" custom style has no outline level.
Of course, this can be fixed by customizing "Paste from Word" as
explained in this document:
XMLmind XML Editor - How to adapt "Paste from Word" to your needs
http://www.xmlmind.com/xmleditor/_distrib/doc/pastefromword/index.html
Next, delete the title called “Remove This” and then Paste From Word
again. This time the first table will end up with two titles
This is clearly a bug as this makes the DocBook document invalid.
and the
second table will have no title at all. If there were further tables in
the document, all the table titles would be moved up by one, until they
attempted to cross a boundary marked by a Title.
Yes, that's right. Your description of the issue is 100% accurate. In
fact, "Paste from Word", while not giving the best possible results,
works as expected.
In MS-Word, a table or figure caption is simply paragraph styled as a
caption which may be found before or *after* the table or figure.
Currently "Paste from Word" uses a simple heuristic to solve the
following problem: do I have to attach this caption be to the table
immediately preceding the caption or to the table immediately following
the caption?
Adding a heading like your “Remove This” of course helps "Paste from
Word" to make the right choice.
Needless to say that we'll try to improve the heuristic in the next
version of XMLmind XML Editor.
Many thanks for reporting this issue. You are welcome to send us as many
MS-Word documents posing problems as you wish.
--
XMLmind XML Editor Support List
[email protected]
http://www.xmlmind.com/mailman/listinfo/xmleditor-support