https://bz.apache.org/bugzilla/show_bug.cgi?id=63813

--- Comment #4 from teresa....@linguamatics.com ---
(In reply to Axel Howind from comment #3)
Thanks for looking into this issue.
> 
> -----
> 
> Extracting the file on the command line yields:
> 
> axel@xiaolong tmp % unzip ../symbol_test.doc 
> Archive:  ../symbol_test.doc
> warning [../symbol_test.doc]:  10574 extra bytes at beginning or within
> zipfile
>   (attempting to process anyway)
>   inflating: [Content_Types].xml     
>   inflating: _rels/.rels             
>   inflating: theme/theme/themeManager.xml  
>   inflating: theme/theme/theme1.xml  
>   inflating: theme/theme/_rels/themeManager.xml.rels  
> 

I think it is since 'symbol_test' is 'doc' type where as 'Test.docx' is 'ooxml
docx' type.

> Could it be that the file is corrupt? Compare with a simple test document:
> 
> axel@xiaolong tmp % unzip ../Test.docx 
> Archive:  ../Test.docx
>   inflating: [Content_Types].xml     
>   inflating: _rels/.rels             
>   inflating: word/_rels/document.xml.rels  
>   inflating: word/document.xml       
>   inflating: word/theme/theme1.xml   
>   inflating: word/settings.xml       
>   inflating: docProps/core.xml       
>   inflating: word/fontTable.xml      
>   inflating: word/webSettings.xml    
>   inflating: word/styles.xml         
>   inflating: docProps/app.xml
> 
> But since Apple pages renders it correctly and you said that you have
> multiple such documents, maybe I am missing something.
> 
> Anyway, I'm out of this one.

Yes I have many documents and besides it is not only 'greater than equal'
symbol but there are other characters that are converetd into '('. 
I am in need of identifying each of this character to postprocess it.

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org
For additional commands, e-mail: dev-h...@poi.apache.org

Reply via email to