[ 
https://issues.apache.org/jira/browse/TIKA-621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matt Sheppard updated TIKA-621:
-------------------------------

    Description: 
I've run across an RTF documents which tika is failing to convert on 64bit 
platforms (Windows and Linux) using the Java 7 early access version. The same 
document is successfully converted on 32bit Windows and Linux, and using Java 6.

{noformat}
java -jar tika-app-0.9.jar -t full.rtf 
Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected 
RuntimeException from org.apache.tika.parser.rtf.RTFParser@1fa78298
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:107)
        at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:302)
        at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:91)
Caused by: java.lang.NullPointerException
        at javax.swing.text.GapContent.compare(Unknown Source)
        at javax.swing.text.GapContent.findSortIndex(Unknown Source)
        at javax.swing.text.GapContent.createPosition(Unknown Source)
        at javax.swing.text.AbstractDocument.createPosition(Unknown Source)
        at javax.swing.text.AbstractDocument$LeafElement.<init>(Unknown Source)
        at javax.swing.text.AbstractDocument.createLeafElement(Unknown Source)
        at 
javax.swing.text.DefaultStyledDocument$ElementBuffer.insertElement(Unknown 
Source)
        at 
javax.swing.text.DefaultStyledDocument$ElementBuffer.insertUpdate(Unknown 
Source)
        at javax.swing.text.DefaultStyledDocument$ElementBuffer.insert(Unknown 
Source)
        at javax.swing.text.DefaultStyledDocument.insertUpdate(Unknown Source)
        at javax.swing.text.AbstractDocument.handleInsertString(Unknown Source)
        at javax.swing.text.AbstractDocument.insertString(Unknown Source)
        at 
org.apache.tika.parser.rtf.RTFParser$CustomStyledDocument.insertString(RTFParser.java:376)
        at 
javax.swing.text.rtf.RTFReader$DocumentDestination.deliverText(Unknown Source)
        at 
javax.swing.text.rtf.RTFReader$TextHandlingDestination.handleText(Unknown 
Source)
        at javax.swing.text.rtf.RTFReader.handleText(Unknown Source)
        at javax.swing.text.rtf.RTFParser.write(Unknown Source)
        at javax.swing.text.rtf.AbstractFilter.write(Unknown Source)
        at javax.swing.text.rtf.AbstractFilter.readFromStream(Unknown Source)
        at javax.swing.text.rtf.RTFEditorKit.read(Unknown Source)
        at org.apache.tika.parser.rtf.RTFParser.parse(RTFParser.java:112)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
        ... 5 more
{noformat}

Will upload the file in question somehwere shortly.

  was:
I've run across an RTF documents which tika is failing to convert on 64bit 
platforms (Windows and Linux) using the Java 7 early access version. The same 
document is successfully converted on 32bit Windows and Linux, and using Java 6.

{noformat}
java -jar tika-app-0.9.jar -t full.rtf 
Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected 
RuntimeException from org.apache.tika.parser.rtf.RTFParser@1fa78298
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
        at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:107)
        at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:302)
        at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:91)
Caused by: java.lang.NullPointerException
        at javax.swing.text.GapContent.compare(Unknown Source)
        at javax.swing.text.GapContent.findSortIndex(Unknown Source)
        at javax.swing.text.GapContent.createPosition(Unknown Source)
        at javax.swing.text.AbstractDocument.createPosition(Unknown Source)
        at javax.swing.text.AbstractDocument$LeafElement.<init>(Unknown Source)
        at javax.swing.text.AbstractDocument.createLeafElement(Unknown Source)
        at 
javax.swing.text.DefaultStyledDocument$ElementBuffer.insertElement(Unknown 
Source)
        at 
javax.swing.text.DefaultStyledDocument$ElementBuffer.insertUpdate(Unknown 
Source)
        at javax.swing.text.DefaultStyledDocument$ElementBuffer.insert(Unknown 
Source)
        at javax.swing.text.DefaultStyledDocument.insertUpdate(Unknown Source)
        at javax.swing.text.AbstractDocument.handleInsertString(Unknown Source)
        at javax.swing.text.AbstractDocument.insertString(Unknown Source)
        at 
org.apache.tika.parser.rtf.RTFParser$CustomStyledDocument.insertString(RTFParser.java:376)
        at 
javax.swing.text.rtf.RTFReader$DocumentDestination.deliverText(Unknown Source)
        at 
javax.swing.text.rtf.RTFReader$TextHandlingDestination.handleText(Unknown 
Source)
        at javax.swing.text.rtf.RTFReader.handleText(Unknown Source)
        at javax.swing.text.rtf.RTFParser.write(Unknown Source)
        at javax.swing.text.rtf.AbstractFilter.write(Unknown Source)
        at javax.swing.text.rtf.AbstractFilter.readFromStream(Unknown Source)
        at javax.swing.text.rtf.RTFEditorKit.read(Unknown Source)
        at org.apache.tika.parser.rtf.RTFParser.parse(RTFParser.java:112)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
        ... 5 more
{noformat}


> RTF parsing fails with Java 7 early access on 64bit platforms
> -------------------------------------------------------------
>
>                 Key: TIKA-621
>                 URL: https://issues.apache.org/jira/browse/TIKA-621
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.8, 0.9
>         Environment: $ java -version
> java version "1.7.0-ea"
> Java(TM) SE Runtime Environment (build 1.7.0-ea-b134)
> Java HotSpot(TM) 64-Bit Server VM (build 21.0-b04, mixed mode)
> (Seen using this version of Java on both Windows 2008 and CentOS 5)
>            Reporter: Matt Sheppard
>
> I've run across an RTF documents which tika is failing to convert on 64bit 
> platforms (Windows and Linux) using the Java 7 early access version. The same 
> document is successfully converted on 32bit Windows and Linux, and using Java 
> 6.
> {noformat}
> java -jar tika-app-0.9.jar -t full.rtf 
> Exception in thread "main" org.apache.tika.exception.TikaException: 
> Unexpected RuntimeException from org.apache.tika.parser.rtf.RTFParser@1fa78298
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
>       at 
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
>       at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:107)
>       at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:302)
>       at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:91)
> Caused by: java.lang.NullPointerException
>       at javax.swing.text.GapContent.compare(Unknown Source)
>       at javax.swing.text.GapContent.findSortIndex(Unknown Source)
>       at javax.swing.text.GapContent.createPosition(Unknown Source)
>       at javax.swing.text.AbstractDocument.createPosition(Unknown Source)
>       at javax.swing.text.AbstractDocument$LeafElement.<init>(Unknown Source)
>       at javax.swing.text.AbstractDocument.createLeafElement(Unknown Source)
>       at 
> javax.swing.text.DefaultStyledDocument$ElementBuffer.insertElement(Unknown 
> Source)
>       at 
> javax.swing.text.DefaultStyledDocument$ElementBuffer.insertUpdate(Unknown 
> Source)
>       at javax.swing.text.DefaultStyledDocument$ElementBuffer.insert(Unknown 
> Source)
>       at javax.swing.text.DefaultStyledDocument.insertUpdate(Unknown Source)
>       at javax.swing.text.AbstractDocument.handleInsertString(Unknown Source)
>       at javax.swing.text.AbstractDocument.insertString(Unknown Source)
>       at 
> org.apache.tika.parser.rtf.RTFParser$CustomStyledDocument.insertString(RTFParser.java:376)
>       at 
> javax.swing.text.rtf.RTFReader$DocumentDestination.deliverText(Unknown Source)
>       at 
> javax.swing.text.rtf.RTFReader$TextHandlingDestination.handleText(Unknown 
> Source)
>       at javax.swing.text.rtf.RTFReader.handleText(Unknown Source)
>       at javax.swing.text.rtf.RTFParser.write(Unknown Source)
>       at javax.swing.text.rtf.AbstractFilter.write(Unknown Source)
>       at javax.swing.text.rtf.AbstractFilter.readFromStream(Unknown Source)
>       at javax.swing.text.rtf.RTFEditorKit.read(Unknown Source)
>       at org.apache.tika.parser.rtf.RTFParser.parse(RTFParser.java:112)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
>       ... 5 more
> {noformat}
> Will upload the file in question somehwere shortly.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to