[
https://issues.apache.org/jira/browse/TIKA-1176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14362582#comment-14362582
]
Tyler Palsulich commented on TIKA-1176:
---------------------------------------
That section of the Parser seems to be completely commented out. The attached
file now causes an index out of bounds Exception (wrapped by a TikaException):
{code}
Exception in thread "main" org.apache.tika.exception.TikaException
at
org.apache.tika.parser.chm.core.ChmExtractor.extractChmEntry(ChmExtractor.java:360)
at org.apache.tika.parser.chm.ChmParser.parse(ChmParser.java:79)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:270)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:270)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:153)
at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:450)
at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:123)
{code}
> ChmDirectoryListingSet does not correctly enumerate directory entries
> ---------------------------------------------------------------------
>
> Key: TIKA-1176
> URL: https://issues.apache.org/jira/browse/TIKA-1176
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.4
> Reporter: Doug Martin
> Attachments: HelpStudioSample.chm
>
>
> ChmDirectoryListingSet.enumerateOneSegment method does not correctly
> enumerate directory entries when ChmCommons.indexOf returns -1 for work data
> or user data. Here is the offending code:
> {code}
> int indexWorkData = ChmCommons.indexOf(dir_chunk,
> "::".getBytes());
> int indexUserData = ChmCommons.indexOf(dir_chunk,
> "/".getBytes());
> if (indexUserData < indexWorkData)
> setPlaceHolder(indexUserData);
> else
> setPlaceHolder(indexWorkData);
> if (getPlaceHolder() > 0 ...
> {code}
> If either indexUserData or indexWorkData is -1, that value will be set as the
> placeholder index, resulting in the method returning without processing any
> entries.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)