https://issues.apache.org/bugzilla/show_bug.cgi?id=48075
Summary: Broken paragraph to text mapping in some documents
Product: POI
Version: 3.5-dev
Platform: PC
OS/Version: Linux
Status: NEW
Severity: normal
Priority: P2
Component: HWPF
AssignedTo: [email protected]
ReportedBy: [email protected]
WordExtractor.getParagraphText() extracts incomplete and broken text data from
attached document. Hovever, WordExtractor.getTextFromPieces() extracts complete
correct text (the same as in MS Office).
It seems that there is a problem in paragraph to text mapping.
Problem exists on few documents from the same source, text extraction from many
other documents works fine.
POI version poi-3.6-beta1-20091002 (svn trunk)
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]