https://issues.apache.org/bugzilla/show_bug.cgi?id=53556
Priority: P2
Bug ID: 53556
Assignee: [email protected]
Summary: Mispositioned Textboxes In Reading Doc Files Through
HWPF
Severity: major
Classification: Unclassified
OS: Linux
Reporter: [email protected]
Hardware: PC
Status: NEW
Version: 3.8
Component: HWPF
Product: POI
Created attachment 29070
--> https://issues.apache.org/bugzilla/attachment.cgi?id=29070&action=edit
This is the document which i was unable to read properly.
I tried reading doc and docx files using Apache POI 3.8. It worked fine until i
encountered textboxes.
If the format of the document is like this:
paragraph 1
textbox 1
paragraph 2
textbox 2
paragraph 3
Then the output should be:
paragraph 1 textbox 1 paragraph 2 textbox 2 paragraph 3
But HWPF reads such .doc file as:
paragraph 1 paragraph 2 paragraph 3 textbox 1 textbox 2
It seems to be adding textboxes at the end and not at the place where it should
be, i.e. between the paragraphs.
In case of .docx files, XWPF didn't read textboxes at all.
I tried methods getText(), getTextFromPieces(), extractText(),
getParagraphText(), but none of these helped.
--
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]