Re: text extraction

2010-09-06 Thread Andreas Lehmkühler
Hi, Gesendet: Sa, 04. Sep 2010 Von: reinhard schwabreinhard.sch...@aon.at extracted text with PDDocument doc = PDDocument.load(new URL( http://people.ischool.berkeley.edu/~hearst/irbook/print/chap10.pdf;)); PDFTextStripper stripper = new PDFTextStripper();

[jira] Commented: (PDFBOX-808) PDTrueTypeFont.loadTTF() freezes (at TTFDataStream.java:195)

2010-09-06 Thread Martin Hilpert (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12906447#action_12906447 ] Martin Hilpert commented on PDFBOX-808: --- I teste with

[jira] Created: (PDFBOX-817) IllegalArgumentException not catched or declared while creating ICC profile

2010-09-06 Thread Timo Boehme (JIRA)
IllegalArgumentException not catched or declared while creating ICC profile --- Key: PDFBOX-817 URL: https://issues.apache.org/jira/browse/PDFBOX-817 Project: PDFBox

[jira] Updated: (PDFBOX-817) IllegalArgumentException not catched or declared while creating ICC profile

2010-09-06 Thread Timo Boehme (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timo Boehme updated PDFBOX-817: --- Attachment: pdfbox_issue817.patch patch wrapping IllegalArgumentException in IOException and

[jira] Created: (PDFBOX-818) PDFParser fails if object/xref starts at same line as endobj of a stream object

2010-09-06 Thread Timo Boehme (JIRA)
PDFParser fails if object/xref starts at same line as endobj of a stream object --- Key: PDFBOX-818 URL: https://issues.apache.org/jira/browse/PDFBOX-818 Project: PDFBox

[jira] Updated: (PDFBOX-818) PDFParser fails if object/xref starts at same line as endobj of a stream object

2010-09-06 Thread Timo Boehme (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timo Boehme updated PDFBOX-818: --- Attachment: pdfbox_issue818.patch patch solving the described problem by first inserting a space