[jira] [Updated] (PDFBOX-1859) ClassCastException for unknown destination type
[ https://issues.apache.org/jira/browse/PDFBOX-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hendrik Lescak updated PDFBOX-1859: --- Attachment: Speisepläne.pdf > ClassCastException for unknown destination type > --- > > Key: PDFBOX-1859 > URL: https://issues.apache.org/jira/browse/PDFBOX-1859 > Project: PDFBox > Issue Type: Bug > Components: PDModel >Affects Versions: 1.8.3, 2.0.0 >Reporter: Hendrik Lescak > Attachments: Speisepläne.pdf > > > Trying to read the outlines failed for the attached document. > {code:java} > import java.io.IOException; > import org.apache.pdfbox.pdmodel.PDDocument; > import > org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination; > import > org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem; > import > org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode; > /** > * @author mailto:andre.kisch...@interface-projects.de";>André > Kischkel > * @since 22.01.2014 > * @version $Revision$ > */ > public class TestPDDestination { > public static void main(String[] args) throws IOException { > PDDocument doc = PDDocument.load("Speisepläne.pdf"); > traverse(doc.getDocumentCatalog().getDocumentOutline()); > doc.close(); > } > > static void traverse(PDOutlineNode node) throws IOException { > if (node instanceof PDOutlineItem) { > PDDestination dst = ((PDOutlineItem) > node).getDestination(); > /** >* throws java.lang.ClassCastException: > org.apache.pdfbox.cos.COSFloat cannot be cast to > org.apache.pdfbox.cos.COSName, >* but should be something like a PDPageXYZDestination! >*/ > System.out.println(dst); > } > for (PDOutlineItem child = node.getFirstChild(); child != null; > child = child.getNextSibling()) { > traverse(child); > } > } > } > {code} -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (PDFBOX-1859) ClassCastException for unknown destination type
[ https://issues.apache.org/jira/browse/PDFBOX-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hendrik Lescak updated PDFBOX-1859: --- Description: Trying to read the outlines failed for the attached document. {code} import java.io.IOException; import org.apache.pdfbox.pdmodel.PDDocument; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode; /** * @author mailto:andre.kisch...@interface-projects.de";>André Kischkel * @since 22.01.2014 * @version $Revision$ */ public class TestPDDestination { public static void main(String[] args) throws IOException { PDDocument doc = PDDocument.load("Speisepläne.pdf"); traverse(doc.getDocumentCatalog().getDocumentOutline()); doc.close(); } static void traverse(PDOutlineNode node) throws IOException { if (node instanceof PDOutlineItem) { PDDestination dst = ((PDOutlineItem) node).getDestination(); /** * throws java.lang.ClassCastException: org.apache.pdfbox.cos.COSFloat cannot be cast to org.apache.pdfbox.cos.COSName, * but should be something like a PDPageXYZDestination! */ System.out.println(dst); } for (PDOutlineItem child = node.getFirstChild(); child != null; child = child.getNextSibling()) { traverse(child); } } } {code} was: Trying to read the outlines failed for the attached document. {code:java} import java.io.IOException; import org.apache.pdfbox.pdmodel.PDDocument; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode; /** * @author mailto:andre.kisch...@interface-projects.de";>André Kischkel * @since 22.01.2014 * @version $Revision$ */ public class TestPDDestination { public static void main(String[] args) throws IOException { PDDocument doc = PDDocument.load("Speisepläne.pdf"); traverse(doc.getDocumentCatalog().getDocumentOutline()); doc.close(); } static void traverse(PDOutlineNode node) throws IOException { if (node instanceof PDOutlineItem) { PDDestination dst = ((PDOutlineItem) node).getDestination(); /** * throws java.lang.ClassCastException: org.apache.pdfbox.cos.COSFloat cannot be cast to org.apache.pdfbox.cos.COSName, * but should be something like a PDPageXYZDestination! */ System.out.println(dst); } for (PDOutlineItem child = node.getFirstChild(); child != null; child = child.getNextSibling()) { traverse(child); } } } {code} > ClassCastException for unknown destination type > --- > > Key: PDFBOX-1859 > URL: https://issues.apache.org/jira/browse/PDFBOX-1859 > Project: PDFBox > Issue Type: Bug > Components: PDModel >Affects Versions: 1.8.3, 2.0.0 >Reporter: Hendrik Lescak > Attachments: Speisepläne.pdf > > > Trying to read the outlines failed for the attached document. > {code} > import java.io.IOException; > import org.apache.pdfbox.pdmodel.PDDocument; > import > org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination; > import > org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem; > import > org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode; > /** > * @author mailto:andre.kisch...@interface-projects.de";>André > Kischkel > * @since 22.01.2014 > * @version $Revision$ > */ > public class TestPDDestination { > public static void main(String[] args) throws IOException { > PDDocument doc = PDDocument.load("Speisepläne.pdf"); > traverse(doc.getDocumentCatalog().getDocumentOutline()); > doc.close(); > } > > static void traverse(PDOutlineNode node) throws IOException { > if (node instanceof PDOutlineItem) { > PDDestination dst = ((PDOutlineItem) > node).getDestination(); > /** >* throws java.lang.ClassCastException: > org.apache.pdfbox.cos.C
[jira] [Updated] (PDFBOX-1859) ClassCastException for unknown destination type
[ https://issues.apache.org/jira/browse/PDFBOX-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hendrik Lescak updated PDFBOX-1859: --- Description: Trying to read the outlines failed for the attached document. {code} import java.io.IOException; import org.apache.pdfbox.pdmodel.PDDocument; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode; public class TestPDDestination { public static void main(String[] args) throws IOException { PDDocument doc = PDDocument.load("Speisepläne.pdf"); traverse(doc.getDocumentCatalog().getDocumentOutline()); doc.close(); } static void traverse(PDOutlineNode node) throws IOException { if (node instanceof PDOutlineItem) { PDDestination dst = ((PDOutlineItem) node).getDestination(); /** * throws java.lang.ClassCastException: org.apache.pdfbox.cos.COSFloat cannot be cast to org.apache.pdfbox.cos.COSName, * but should be something like a PDPageXYZDestination! */ System.out.println(dst); } for (PDOutlineItem child = node.getFirstChild(); child != null; child = child.getNextSibling()) { traverse(child); } } } {code} was: Trying to read the outlines failed for the attached document. {code} import java.io.IOException; import org.apache.pdfbox.pdmodel.PDDocument; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode; /** * @author mailto:andre.kisch...@interface-projects.de";>André Kischkel * @since 22.01.2014 * @version $Revision$ */ public class TestPDDestination { public static void main(String[] args) throws IOException { PDDocument doc = PDDocument.load("Speisepläne.pdf"); traverse(doc.getDocumentCatalog().getDocumentOutline()); doc.close(); } static void traverse(PDOutlineNode node) throws IOException { if (node instanceof PDOutlineItem) { PDDestination dst = ((PDOutlineItem) node).getDestination(); /** * throws java.lang.ClassCastException: org.apache.pdfbox.cos.COSFloat cannot be cast to org.apache.pdfbox.cos.COSName, * but should be something like a PDPageXYZDestination! */ System.out.println(dst); } for (PDOutlineItem child = node.getFirstChild(); child != null; child = child.getNextSibling()) { traverse(child); } } } {code} > ClassCastException for unknown destination type > --- > > Key: PDFBOX-1859 > URL: https://issues.apache.org/jira/browse/PDFBOX-1859 > Project: PDFBox > Issue Type: Bug > Components: PDModel >Affects Versions: 1.8.3, 2.0.0 >Reporter: Hendrik Lescak > Attachments: Speisepläne.pdf > > > Trying to read the outlines failed for the attached document. > {code} > import java.io.IOException; > import org.apache.pdfbox.pdmodel.PDDocument; > import > org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination; > import > org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem; > import > org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode; > public class TestPDDestination { > public static void main(String[] args) throws IOException { > PDDocument doc = PDDocument.load("Speisepläne.pdf"); > traverse(doc.getDocumentCatalog().getDocumentOutline()); > doc.close(); > } > > static void traverse(PDOutlineNode node) throws IOException { > if (node instanceof PDOutlineItem) { > PDDestination dst = ((PDOutlineItem) > node).getDestination(); > /** >* throws java.lang.ClassCastException: > org.apache.pdfbox.cos.COSFloat cannot be cast to > org.apache.pdfbox.cos.COSName, >* but should be something like a PDPageXYZDestination! >*/ > System.out.println(dst); > } > for (PDOutlineIt
[jira] [Created] (PDFBOX-1859) ClassCastException for unknown destination type
Hendrik Lescak created PDFBOX-1859: -- Summary: ClassCastException for unknown destination type Key: PDFBOX-1859 URL: https://issues.apache.org/jira/browse/PDFBOX-1859 Project: PDFBox Issue Type: Bug Components: PDModel Affects Versions: 1.8.3, 2.0.0 Reporter: Hendrik Lescak Trying to read the outlines failed for the attached document. {code:java} import java.io.IOException; import org.apache.pdfbox.pdmodel.PDDocument; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem; import org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode; /** * @author mailto:andre.kisch...@interface-projects.de";>André Kischkel * @since 22.01.2014 * @version $Revision$ */ public class TestPDDestination { public static void main(String[] args) throws IOException { PDDocument doc = PDDocument.load("Speisepläne.pdf"); traverse(doc.getDocumentCatalog().getDocumentOutline()); doc.close(); } static void traverse(PDOutlineNode node) throws IOException { if (node instanceof PDOutlineItem) { PDDestination dst = ((PDOutlineItem) node).getDestination(); /** * throws java.lang.ClassCastException: org.apache.pdfbox.cos.COSFloat cannot be cast to org.apache.pdfbox.cos.COSName, * but should be something like a PDPageXYZDestination! */ System.out.println(dst); } for (PDOutlineItem child = node.getFirstChild(); child != null; child = child.getNextSibling()) { traverse(child); } } } {code} -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] Issue Comment Edited: (PDFBOX-855) Extracted Text of MS Word generated PDFs corrupt
[ https://issues.apache.org/jira/browse/PDFBOX-855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12919311#action_12919311 ] Hendrik Lescak edited comment on PDFBOX-855 at 10/8/10 11:41 AM: - The problem occurred in a Word-Document containing an embedded "VISIO-Express Drawing-Object". And there the text was not recognized correctly as it was before your patch. Maybe this issue is out of scope. Unfortunately I can not post the example file, it is from a customer project. > Extracted Text of MS Word generated PDFs corrupt > > > Key: PDFBOX-855 > URL: https://issues.apache.org/jira/browse/PDFBOX-855 > Project: PDFBox > Issue Type: Bug > Components: Text extraction >Affects Versions: 1.3.0 > Environment: All >Reporter: Hendrik Lescak > > Since Revision 1003195 (PDFBOX-828: fixed some issues with positioning when > extracting or rendering text) the text extraction with PDFTextStripper > behaves differently for PDF documents generated with the MS Office Word 2007 > "Save as PDF" Feature. > For example: The Term "Fachbereichsleiter" changed to "F a c hb e re ic hsle > ite r" after PDFBOX-828. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Created: (PDFBOX-855) Extracted Text of MS Word generated PDFs corrupt
Extracted Text of MS Word generated PDFs corrupt Key: PDFBOX-855 URL: https://issues.apache.org/jira/browse/PDFBOX-855 Project: PDFBox Issue Type: Bug Components: Text extraction Affects Versions: 1.3.0 Environment: All Reporter: Hendrik Lescak Since Revision 1003195 (PDFBOX-828: fixed some issues with positioning when extracting or rendering text) the text extraction with PDFTextStripper behaves differently for PDF documents generated with the MS Office Word 2007 "Save as PDF" Feature. For example: The Term "Fachbereichsleiter" changed to "F a c hb e re ic hsle ite r" after PDFBOX-828. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.