[jira] [Commented] (TIKA-2118) Misleading exception on a password protected XLS

2016-10-20 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15592103#comment-15592103
 ] 

Tim Allison commented on TIKA-2118:
---

Added an EncryptedDocumentException "encryption not supported for old excel 
files" in POI

> Misleading exception on a password protected XLS
> 
>
> Key: TIKA-2118
> URL: https://issues.apache.org/jira/browse/TIKA-2118
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 1.13
> Environment: Windows 7 x64, JVM 1.8.0_101
>Reporter: Seva Alekseyev
> Attachments: BUSJDRVGZF7FKDA6L4PNTNATHQCLRW4O.xls
>
>
> When parsing the following password protected Excel file:
> https://dl.dropboxusercontent.com/u/92341073/Copy%20of%20I-LHD%203E.xls
> Tika emits an IllegalArgumentException with a message "Unsupported codepage 
> requested". The inability to parse has nothing to do with codepage, that 
> error is misleading.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2118) Misleading exception on a password protected XLS

2016-10-20 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15592059#comment-15592059
 ] 

Tim Allison commented on TIKA-2118:
---

https://bz.apache.org/bugzilla/show_bug.cgi?id=60284

Turns out that file is encrypted as part of write-protect.  Encryption for old 
Excel files is not supported.

> Misleading exception on a password protected XLS
> 
>
> Key: TIKA-2118
> URL: https://issues.apache.org/jira/browse/TIKA-2118
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 1.13
> Environment: Windows 7 x64, JVM 1.8.0_101
>Reporter: Seva Alekseyev
> Attachments: BUSJDRVGZF7FKDA6L4PNTNATHQCLRW4O.xls
>
>
> When parsing the following password protected Excel file:
> https://dl.dropboxusercontent.com/u/92341073/Copy%20of%20I-LHD%203E.xls
> Tika emits an IllegalArgumentException with a message "Unsupported codepage 
> requested". The inability to parse has nothing to do with codepage, that 
> error is misleading.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2118) Misleading exception on a password protected XLS

2016-10-19 Thread Seva Alekseyev (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15590401#comment-15590401
 ] 

Seva Alekseyev commented on TIKA-2118:
--

I'll have to do some homework for that.

> Misleading exception on a password protected XLS
> 
>
> Key: TIKA-2118
> URL: https://issues.apache.org/jira/browse/TIKA-2118
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 1.13
> Environment: Windows 7 x64, JVM 1.8.0_101
>Reporter: Seva Alekseyev
>
> When parsing the following password protected Excel file:
> https://dl.dropboxusercontent.com/u/92341073/Copy%20of%20I-LHD%203E.xls
> Tika emits an IllegalArgumentException with a message "Unsupported codepage 
> requested". The inability to parse has nothing to do with codepage, that 
> error is misleading.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2118) Misleading exception on a password protected XLS

2016-10-19 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15590311#comment-15590311
 ] 

Tim Allison commented on TIKA-2118:
---

Does this happen with non-password-protected XLS version 5 files?

> Misleading exception on a password protected XLS
> 
>
> Key: TIKA-2118
> URL: https://issues.apache.org/jira/browse/TIKA-2118
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 1.13
> Environment: Windows 7 x64, JVM 1.8.0_101
>Reporter: Seva Alekseyev
>
> When parsing the following password protected Excel file:
> https://dl.dropboxusercontent.com/u/92341073/Copy%20of%20I-LHD%203E.xls
> Tika emits an IllegalArgumentException with a message "Unsupported codepage 
> requested". The inability to parse has nothing to do with codepage, that 
> error is misleading.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2118) Misleading exception on a password protected XLS

2016-10-14 Thread Seva Alekseyev (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15577172#comment-15577172
 ] 

Seva Alekseyev commented on TIKA-2118:
--

The codepage number in the exception is bogus. In my file library, I saw 
similar exceptions for codepages all over the place. Some part of the file is 
misparsed and it comes out as codepage number, but it's not.

> Misleading exception on a password protected XLS
> 
>
> Key: TIKA-2118
> URL: https://issues.apache.org/jira/browse/TIKA-2118
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 1.13
> Environment: Windows 7 x64, JVM 1.8.0_101
>Reporter: Seva Alekseyev
>
> When parsing the following password protected Excel file:
> https://dl.dropboxusercontent.com/u/92341073/Copy%20of%20I-LHD%203E.xls
> Tika emits an IllegalArgumentException with a message "Unsupported codepage 
> requested". The inability to parse has nothing to do with codepage, that 
> error is misleading.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2118) Misleading exception on a password protected XLS

2016-10-14 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1557#comment-1557
 ] 

Tim Allison commented on TIKA-2118:
---

You may want to check with the POI users list.  Would the desired outcome be an 
EncryptedFileException or similar?

If the file weren't encrypted, would the current behavior be ok?  The parser 
basically doesn't know what to do with cp3197...and I think that's reasonable.

> Misleading exception on a password protected XLS
> 
>
> Key: TIKA-2118
> URL: https://issues.apache.org/jira/browse/TIKA-2118
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 1.13
> Environment: Windows 7 x64, JVM 1.8.0_101
>Reporter: Seva Alekseyev
>
> When parsing the following password protected Excel file:
> https://dl.dropboxusercontent.com/u/92341073/Copy%20of%20I-LHD%203E.xls
> Tika emits an IllegalArgumentException with a message "Unsupported codepage 
> requested". The inability to parse has nothing to do with codepage, that 
> error is misleading.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)