https://bz.apache.org/bugzilla/show_bug.cgi?id=63813
--- Comment #4 from teresa....@linguamatics.com --- (In reply to Axel Howind from comment #3) Thanks for looking into this issue. > > ----- > > Extracting the file on the command line yields: > > axel@xiaolong tmp % unzip ../symbol_test.doc > Archive: ../symbol_test.doc > warning [../symbol_test.doc]: 10574 extra bytes at beginning or within > zipfile > (attempting to process anyway) > inflating: [Content_Types].xml > inflating: _rels/.rels > inflating: theme/theme/themeManager.xml > inflating: theme/theme/theme1.xml > inflating: theme/theme/_rels/themeManager.xml.rels > I think it is since 'symbol_test' is 'doc' type where as 'Test.docx' is 'ooxml docx' type. > Could it be that the file is corrupt? Compare with a simple test document: > > axel@xiaolong tmp % unzip ../Test.docx > Archive: ../Test.docx > inflating: [Content_Types].xml > inflating: _rels/.rels > inflating: word/_rels/document.xml.rels > inflating: word/document.xml > inflating: word/theme/theme1.xml > inflating: word/settings.xml > inflating: docProps/core.xml > inflating: word/fontTable.xml > inflating: word/webSettings.xml > inflating: word/styles.xml > inflating: docProps/app.xml > > But since Apple pages renders it correctly and you said that you have > multiple such documents, maybe I am missing something. > > Anyway, I'm out of this one. Yes I have many documents and besides it is not only 'greater than equal' symbol but there are other characters that are converetd into '('. I am in need of identifying each of this character to postprocess it. -- You are receiving this mail because: You are the assignee for the bug. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org For additional commands, e-mail: dev-h...@poi.apache.org