So I am trying to do a very simple read of a pdf file. As simple as this: public String getSource(String pdfLink) throws IOException { log.debug("Getting the sources for pdfLink = {}", pdfLink); URL url = new URL(pdfLink); URLConnection connection = url.openConnection(); //60 seconds connection timeout connection.setConnectTimeout(1000 * 60); PdfReader reader = new PdfReader(this.getResponseAsBytes(connection.getInputStream()));
int numberOfPages = reader.getNumberOfPages(); StringBuilder builder = new StringBuilder(); for(int i=1;i<=numberOfPages;++i){ String result = PdfTextExtractor.getTextFromPage(reader, i); System.out.println("AAAAAAAAAA = "+result); builder.append(result); } return builder.toString(); } /** * read chunks of 1024 bytes from input stream and put them into the output stream */ private byte[] getResponseAsBytes(InputStream inputStream) throws IOException{ byte[] bytes = new byte[1024]; ByteArrayOutputStream outputStream = new ByteArrayOutputStream(); while(inputStream.read(bytes)!=-1) outputStream.write(bytes); return outputStream.toByteArray(); } as a result I get: Exception in thread "main" java.lang.NullPointerException at com.itextpdf.text.pdf.DocumentFont.doType1TT(DocumentFont.java:400) at com.itextpdf.text.pdf.DocumentFont.init(DocumentFont.java:128) at com.itextpdf.text.pdf.DocumentFont.<init>(DocumentFont.java:113) at com.itextpdf.text.pdf.CMapAwareDocumentFont.<init>(CMapAwareDocumentFont.java:99) at com.itextpdf.text.pdf.parser.PdfContentStreamProcessor.getFont(PdfContentStreamProcessor.java:157) at com.itextpdf.text.pdf.parser.PdfContentStreamProcessor.access$4200(PdfContentStreamProcessor.java:79) at com.itextpdf.text.pdf.parser.PdfContentStreamProcessor$SetTextFont.invoke(PdfContentStreamProcessor.java:612) at com.itextpdf.text.pdf.parser.PdfContentStreamProcessor.invokeOperator(PdfContentStreamProcessor.java:267) at com.itextpdf.text.pdf.parser.PdfContentStreamProcessor.processContent(PdfContentStreamProcessor.java:387) at com.itextpdf.text.pdf.parser.PdfReaderContentParser.processContent(PdfReaderContentParser.java:79) at com.itextpdf.text.pdf.parser.PdfTextExtractor.getTextFromPage(PdfTextExtractor.java:73) at com.itextpdf.text.pdf.parser.PdfTextExtractor.getTextFromPage(PdfTextExtractor.java:88) at com.eugen.romina.v2.pdf.source.generator.FromPDFLinkToString.getSource(FromPDFLinkToString.java:43) at com.eugen.romina.v2.pdf.source.generator.FromPDFLinkToString.main(FromPDFLinkToString.java:63) Funny thing, If I downgrade to 2.1.7 everything works fine. This very much sounds like a bug to me. Cheers, Eugene. ------------------------------------------------------------------------------ Get 100% visibility into Java/.NET code with AppDynamics Lite! It's a free troubleshooting tool designed for production. Get down to code-level detail for bottlenecks, with <2% overhead. Download for free and get started troubleshooting in minutes. http://pubads.g.doubleclick.net/gampad/clk?id=48897031&iu=/4140/ostg.clktrk _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php