DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=30303>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=30303

Cannot read word documents properties for docs created on Mac

           Summary: Cannot read word documents properties for docs created
                    on Mac
           Product: POI
           Version: unspecified
          Platform: Macintosh
        OS/Version: MacOS X
            Status: NEW
          Severity: Blocker
          Priority: Other
         Component: HPFS
        AssignedTo: [EMAIL PROTECTED]
        ReportedBy: [EMAIL PROTECTED]


Several word documents(probably all) created on Mac and opened for reading 
properties on windows caused a UnsupportedEncodingException cp10000 in the line:
value = new String(src, (int) first, l, codepageToEncoding(codepage));

That is because I think there is a parsing error in extracting the encoding. 
However the actual text looked ok. So my workaround was:

value = new String(src, (int) first, l);
                if(codepage != -1) {
                    try {
                        value = new String(src, (int) first, l, 
codepageToEncoding(codepage));
                    } catch(UnsupportedEncodingException ignore) {
                        // The previous assignment is acceptable when the 
encoding is not supported
                        // Want to throw a warning message here, but how?
                    }
                }

Let me know if you need any sample documents to demonstrate the error. I have 
quite a few of them.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to