[ 
https://issues.apache.org/jira/browse/TIKA-1560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14337112#comment-14337112
 ] 

Tim Allison edited comment on TIKA-1560 at 2/25/15 8:22 PM:
------------------------------------------------------------

When I just tested this file with Tika 1.8-SNAPSHOT with -Xmx6g, I got this 
exception:
{noformat}
Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected R
untimeException from org.apache.tika.parser.microsoft.TNEFParser@3e9b1010
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:258
)
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256
)
        at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
20)
        at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:146)
        at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:440)
        at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:116)
Caused by: java.lang.IllegalArgumentException: Unknown type 9914 / 0x26BA - 0x26
ba @ -1
        at org.apache.poi.hmef.attribute.MAPIAttribute.getLength(MAPIAttribute.j
ava:200)
        at org.apache.poi.hmef.attribute.MAPIAttribute.create(MAPIAttribute.java
:166)
        at org.apache.poi.hmef.attribute.TNEFMAPIAttribute.<init>(TNEFMAPIAttrib
ute.java:41)
        at org.apache.poi.hmef.attribute.TNEFAttribute.create(TNEFAttribute.java
:67)
        at org.apache.poi.hmef.HMEFMessage.processMessage(HMEFMessage.java:97)
        at org.apache.poi.hmef.HMEFMessage.process(HMEFMessage.java:79)
        at org.apache.poi.hmef.HMEFMessage.<init>(HMEFMessage.java:64)
        at org.apache.tika.parser.microsoft.TNEFParser.parse(TNEFParser.java:80)

        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256
)
        ... 5 more
{noformat}

I also got an OOM when I tried with 2g and 4g...


was (Author: [email protected]):
When I just tested this file with Tika 1.8-SNAPSHOT with -Xmx6g, I got this 
error:
{noformat}
Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected R
untimeException from org.apache.tika.parser.microsoft.TNEFParser@3e9b1010
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:258
)
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256
)
        at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
20)
        at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:146)
        at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:440)
        at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:116)
Caused by: java.lang.IllegalArgumentException: Unknown type 9914 / 0x26BA - 0x26
ba @ -1
        at org.apache.poi.hmef.attribute.MAPIAttribute.getLength(MAPIAttribute.j
ava:200)
        at org.apache.poi.hmef.attribute.MAPIAttribute.create(MAPIAttribute.java
:166)
        at org.apache.poi.hmef.attribute.TNEFMAPIAttribute.<init>(TNEFMAPIAttrib
ute.java:41)
        at org.apache.poi.hmef.attribute.TNEFAttribute.create(TNEFAttribute.java
:67)
        at org.apache.poi.hmef.HMEFMessage.processMessage(HMEFMessage.java:97)
        at org.apache.poi.hmef.HMEFMessage.process(HMEFMessage.java:79)
        at org.apache.poi.hmef.HMEFMessage.<init>(HMEFMessage.java:64)
        at org.apache.tika.parser.microsoft.TNEFParser.parse(TNEFParser.java:80)

        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256
)
        ... 5 more
{noformat}



> OutOfMemoryError analyzinig specific file
> -----------------------------------------
>
>                 Key: TIKA-1560
>                 URL: https://issues.apache.org/jira/browse/TIKA-1560
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.7
>         Environment: OS: Ubuntu Linux 12.04 and 14.04
> JVM: OpenJDK 1.7, Oracle JDK 1.7, Oracle JDK 1.8
>            Reporter: Rubén Bressler
>         Attachments: e3284d17-c814-46c1-b33e-ee774305d987.dat
>
>
> I have a specific file when applying tika-app.jar it tries to process ends 
> with an OOM error. The output is the same no matter what virtual machine or 
> Tika version is used.
> {code}
> $ java -jar tika-app-1.7.jar e3284d17-c814-46c1-b33e-ee774305d987.dat 
> Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
>       at java.lang.StringCoding$StringDecoder.decode(StringCoding.java:149)
>       at java.lang.StringCoding.decode(StringCoding.java:193)
>       at java.lang.String.<init>(String.java:414)
>       at org.apache.poi.util.StringUtil.getFromUnicodeLE(StringUtil.java:77)
>       at 
> org.apache.poi.hmef.attribute.MAPIAttribute.create(MAPIAttribute.java:149)
>       at 
> org.apache.poi.hmef.attribute.TNEFMAPIAttribute.<init>(TNEFMAPIAttribute.java:41)
>       at 
> org.apache.poi.hmef.attribute.TNEFAttribute.create(TNEFAttribute.java:67)
>       at org.apache.poi.hmef.HMEFMessage.processMessage(HMEFMessage.java:97)
>       at org.apache.poi.hmef.HMEFMessage.process(HMEFMessage.java:79)
>       at org.apache.poi.hmef.HMEFMessage.<init>(HMEFMessage.java:64)
>       at org.apache.tika.parser.microsoft.TNEFParser.parse(TNEFParser.java:80)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256)
>       at 
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
>       at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:146)
>       at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:440)
>       at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:116)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to