DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT <http://issues.apache.org/bugzilla/show_bug.cgi?id=30235>. ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND INSERTED IN THE BUG DATABASE.
http://issues.apache.org/bugzilla/show_bug.cgi?id=30235 [PATCH] Fixes for unicode/tables etc for real world documents Summary: [PATCH] Fixes for unicode/tables etc for real world documents Product: POI Version: unspecified Platform: Other OS/Version: Other Status: NEW Severity: Critical Priority: Other Component: HDF AssignedTo: [EMAIL PROTECTED] ReportedBy: [EMAIL PROTECTED] A merge of the latest code from Goss Interactive that includes the fixes that we rolled into our latest product release. This has been tested against a series of testcase word documents that were supplied by clients or created specifically in-house. Platforms and formats tested include: Office 97/2k/xp, Mac, OpenOffice. All testcases can now be read correctly. Writing is not totally transparent and still needs some work. From our further testing, previous HWPF write functionality appears to remain unaffected. Files altered: ============== hwpf/model/ListLevel.java ------------------------- - fix for listlevel() - papx and chpx arrays were being copied in the wrong order - added getLevelProperties() hwpf/model/ListTables.java -------------------------- - added getListData() hwpf/model/SectionTable.java ---------------------------- - fixed CPtoFC() to accomodate non-contigous textpieces hwpf/model/TextPiece.java ------------------------- - added CP_start property and accessor used by CPtoFC in model/SectionTable.java hwpf/model/TextPiecetable.java ------------------------------ - fix for constructor to take into account non-contigous textpieces hwpf/sprm/ParagraphSprmUncompressor.java ---------------------------------------- - fix for tabs being read as INTs instead of the SHORTs that they are hwpf/sprm/TableSprmUncompressor ------------------------------- - fix for TC entries not always being present in the Word file hwpf/usermodel/CharacterRun.java -------------------------------- - changed to use updateSprm() rather than addSprm() to match code elsewhere that prevents additional sprms being created if they already exist - added accessors for properties that Word uses when handling embedded objects such as Hyperlinks, Pictures, OleObjects etc hwpf/usermodel/Paragraph.java ----------------------------- - added accessors for Ilfo and Ilvl used by Word for numbered lists hwpf/usermodel/Range.java ------------------------- - fixed text() to correctly return unicode text - fixed findRange() to prevent loop indexes going out of bounds, causing an exception - fixed getTable() to cope with tables that start at the beginning of a section such that they don't get merged with preceeding tables - fixed getTable() to return a table at the correct tablelevel hwpf/usermodel/Table.java ------------------------- - minor code tidy hwpf/usermodel/TableRow.java ---------------------------- - removed constraint requiring levelNum==1 in constructor that doesn't work when dealing with documents that include sections --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
