Tanmay Mandal created PDFBOX-1507:
-------------------------------------

             Summary: Getting Issue at text reading 
                 Key: PDFBOX-1507
                 URL: https://issues.apache.org/jira/browse/PDFBOX-1507
             Project: PDFBox
          Issue Type: Bug
          Components: Parsing
    Affects Versions: 1.7.1
         Environment: windows, runing pdfbox in .Net using ikvm-7.2.4630.5 
conversion , we are actually converting pdf into ALTO file
            Reporter: Tanmay Mandal
             Fix For: 1.8.0


<?xml version="1.0" encoding="UTF-8"?><alto xmlns="http://www.loc.gov/standards/
alto/alto-v2.0.xsd"><Description><MeasurementUnit>inch1200</MeasurementUnit></De
scription><Layout>
<Page>
<PrintSpace>
<TextBlock>
<TextLine>
Feb 04, 2013 8:40:03 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
WARNING: java.lang.NullPointerException
java.lang.NullPointerException
        at org.apache.pdfbox.util.PDFTextStripper.processTextPosition(PDFTextStr
ipper.java:954)
        at org.apache.pdfbox.util.PDFStreamEngine.processEncodedText(PDFStreamEn
gine.java:498)
        at org.apache.pdfbox.util.operator.ShowTextGlyph.process(ShowTextGlyph.j
ava:62)
        at org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngin
e.java:556)
        at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngi
ne.java:271)
        at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngi
ne.java:237)
        at org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.
java:218)
        at cli.org.apache.pdfbox.examples.util.PrintWordLocations.processDocumen
ts(PrintWordLocation.cs:185)
        at cli.org.apache.pdfbox.examples.util.PrintWordLocations.Main(PrintWord
Location.cs:228)
        at cli.System.AppDomain._nExecuteAssembly(Unknown Source)
        at cli.System.AppDomain.ExecuteAssembly(Unknown Source)
        at cli.Microsoft.VisualStudio.HostingProcess.HostProc.RunUsersAssembly(U
nknown Source)

Feb 04, 2013 8:40:03 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
WARNING: java.lang.NullPointerException
java.lang.NullPointerException
        at org.apache.pdfbox.util.PDFTextStripper.processTextPosition(PDFTextStr
ipper.java:954)
        at org.apache.pdfbox.util.PDFStreamEngine.processEncodedText(PDFStreamEn
gine.java:498)
        at org.apache.pdfbox.util.operator.ShowTextGlyph.process(ShowTextGlyph.j
ava:62)
        at org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngin
e.java:556)
        at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngi
ne.java:271)
        at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngi
ne.java:237)
        at org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.
java:218)
        at cli.org.apache.pdfbox.examples.util.PrintWordLocations.processDocumen
ts(PrintWordLocation.cs:185)
        at cli.org.apache.pdfbox.examples.util.PrintWordLocations.Main(PrintWord
Location.cs:228)
        at cli.System.AppDomain._nExecuteAssembly(Unknown Source)
        at cli.System.AppDomain.ExecuteAssembly(Unknown Source)
        at cli.Microsoft.VisualStudio.HostingProcess.HostProc.RunUsersAssembly(U
nknown Source)

Feb 04, 2013 8:40:03 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
WARNING: java.lang.NullPointerException
java.lang.NullPointerException
        at org.apache.pdfbox.util.PDFTextStripper.processTextPosition(PDFTextStr
ipper.java:954)
        at org.apache.pdfbox.util.PDFStreamEngine.processEncodedText(PDFStreamEn
gine.java:498)
        at org.apache.pdfbox.util.operator.ShowTextGlyph.process(ShowTextGlyph.j
ava:62)
        at org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngin
e.java:556)
        at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngi
ne.java:271)
        at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngi
ne.java:237)
        at org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.
java:218)
        at cli.org.apache.pdfbox.examples.util.PrintWordLocations.processDocumen
ts(PrintWordLocation.cs:185)
        at cli.org.apache.pdfbox.examples.util.PrintWordLocations.Main(PrintWord
Location.cs:228)
        at cli.System.AppDomain._nExecuteAssembly(Unknown Source)
        at cli.System.AppDomain.ExecuteAssembly(Unknown Source)
        at cli.Microsoft.VisualStudio.HostingProcess.HostProc.RunUsersAssembly(U
nknown Source)

Feb 04, 2013 8:40:03 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
WARNING: java.lang.NullPointerException
java.lang.NullPointerException
        at org.apache.pdfbox.util.PDFTextStripper.processTextPosition(PDFTextStr
ipper.java:954)
        at org.apache.pdfbox.util.PDFStreamEngine.processEncodedText(PDFStreamEn
gine.java:498)
        at org.apache.pdfbox.util.operator.ShowTextGlyph.process(ShowTextGlyph.j
ava:62)
        at org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngin
e.java:556)
        at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngi
ne.java:271)
        at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngi
ne.java:237)
        at org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.
java:218)
        at cli.org.apache.pdfbox.examples.util.PrintWordLocations.processDocumen
ts(PrintWordLocation.cs:185)
        at cli.org.apache.pdfbox.examples.util.PrintWordLocations.Main(PrintWord
Location.cs:228)
        at cli.System.AppDomain._nExecuteAssembly(Unknown Source)
        at cli.System.AppDomain.ExecuteAssembly(Unknown Source)
        at cli.Microsoft.VisualStudio.HostingProcess.HostProc.RunUsersAssembly(U
nknown Source)

</TextLine>
</TextBlock>
</PrintSpace>
</Page>


We have converted Java code in C# from https://github.com/cokernel/pdf2alto


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to