[jira] [Updated] (PDFBOX-1859) ClassCastException for unknown destination type

2014-01-22 Thread Hendrik Lescak (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hendrik Lescak updated PDFBOX-1859:
---

Attachment: Speisepläne.pdf

> ClassCastException for unknown destination type
> ---
>
> Key: PDFBOX-1859
> URL: https://issues.apache.org/jira/browse/PDFBOX-1859
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 1.8.3, 2.0.0
>Reporter: Hendrik Lescak
> Attachments: Speisepläne.pdf
>
>
> Trying to read the outlines failed for the attached document.
> {code:java}
> import java.io.IOException;
> import org.apache.pdfbox.pdmodel.PDDocument;
> import 
> org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination;
> import 
> org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem;
> import 
> org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode;
> /**
>  * @author mailto:andre.kisch...@interface-projects.de";>André 
> Kischkel
>  * @since 22.01.2014
>  * @version $Revision$
>  */
> public class TestPDDestination {
>   public static void main(String[] args) throws IOException {
>   PDDocument doc = PDDocument.load("Speisepläne.pdf");
>   traverse(doc.getDocumentCatalog().getDocumentOutline());
>   doc.close();
>   }
>   
>   static void traverse(PDOutlineNode node) throws IOException {
>   if (node instanceof PDOutlineItem) {
>   PDDestination dst = ((PDOutlineItem) 
> node).getDestination();
>   /**
>* throws java.lang.ClassCastException: 
> org.apache.pdfbox.cos.COSFloat cannot be cast to 
> org.apache.pdfbox.cos.COSName,
>* but should be something like a PDPageXYZDestination!
>*/
>   System.out.println(dst);
>   }
>   for (PDOutlineItem child = node.getFirstChild(); child != null; 
> child = child.getNextSibling()) {
>   traverse(child);
>   }
>   }
> }
> {code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (PDFBOX-1859) ClassCastException for unknown destination type

2014-01-22 Thread Hendrik Lescak (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hendrik Lescak updated PDFBOX-1859:
---

Description: 
Trying to read the outlines failed for the attached document.

{code}
import java.io.IOException;

import org.apache.pdfbox.pdmodel.PDDocument;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode;

/**
 * @author mailto:andre.kisch...@interface-projects.de";>André 
Kischkel
 * @since 22.01.2014
 * @version $Revision$
 */
public class TestPDDestination {

public static void main(String[] args) throws IOException {
PDDocument doc = PDDocument.load("Speisepläne.pdf");
traverse(doc.getDocumentCatalog().getDocumentOutline());
doc.close();
}

static void traverse(PDOutlineNode node) throws IOException {
if (node instanceof PDOutlineItem) {
PDDestination dst = ((PDOutlineItem) 
node).getDestination();
/**
 * throws java.lang.ClassCastException: 
org.apache.pdfbox.cos.COSFloat cannot be cast to org.apache.pdfbox.cos.COSName,
 * but should be something like a PDPageXYZDestination!
 */
System.out.println(dst);
}
for (PDOutlineItem child = node.getFirstChild(); child != null; 
child = child.getNextSibling()) {
traverse(child);
}
}
}
{code}

  was:
Trying to read the outlines failed for the attached document.

{code:java}
import java.io.IOException;

import org.apache.pdfbox.pdmodel.PDDocument;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode;

/**
 * @author mailto:andre.kisch...@interface-projects.de";>André 
Kischkel
 * @since 22.01.2014
 * @version $Revision$
 */
public class TestPDDestination {

public static void main(String[] args) throws IOException {
PDDocument doc = PDDocument.load("Speisepläne.pdf");
traverse(doc.getDocumentCatalog().getDocumentOutline());
doc.close();
}

static void traverse(PDOutlineNode node) throws IOException {
if (node instanceof PDOutlineItem) {
PDDestination dst = ((PDOutlineItem) 
node).getDestination();
/**
 * throws java.lang.ClassCastException: 
org.apache.pdfbox.cos.COSFloat cannot be cast to org.apache.pdfbox.cos.COSName,
 * but should be something like a PDPageXYZDestination!
 */
System.out.println(dst);
}
for (PDOutlineItem child = node.getFirstChild(); child != null; 
child = child.getNextSibling()) {
traverse(child);
}
}
}
{code}


> ClassCastException for unknown destination type
> ---
>
> Key: PDFBOX-1859
> URL: https://issues.apache.org/jira/browse/PDFBOX-1859
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 1.8.3, 2.0.0
>Reporter: Hendrik Lescak
> Attachments: Speisepläne.pdf
>
>
> Trying to read the outlines failed for the attached document.
> {code}
> import java.io.IOException;
> import org.apache.pdfbox.pdmodel.PDDocument;
> import 
> org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination;
> import 
> org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem;
> import 
> org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode;
> /**
>  * @author mailto:andre.kisch...@interface-projects.de";>André 
> Kischkel
>  * @since 22.01.2014
>  * @version $Revision$
>  */
> public class TestPDDestination {
>   public static void main(String[] args) throws IOException {
>   PDDocument doc = PDDocument.load("Speisepläne.pdf");
>   traverse(doc.getDocumentCatalog().getDocumentOutline());
>   doc.close();
>   }
>   
>   static void traverse(PDOutlineNode node) throws IOException {
>   if (node instanceof PDOutlineItem) {
>   PDDestination dst = ((PDOutlineItem) 
> node).getDestination();
>   /**
>* throws java.lang.ClassCastException: 
> org.apache.pdfbox.cos.C

[jira] [Updated] (PDFBOX-1859) ClassCastException for unknown destination type

2014-01-22 Thread Hendrik Lescak (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hendrik Lescak updated PDFBOX-1859:
---

Description: 
Trying to read the outlines failed for the attached document.

{code}
import java.io.IOException;

import org.apache.pdfbox.pdmodel.PDDocument;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode;

public class TestPDDestination {

public static void main(String[] args) throws IOException {
PDDocument doc = PDDocument.load("Speisepläne.pdf");
traverse(doc.getDocumentCatalog().getDocumentOutline());
doc.close();
}

static void traverse(PDOutlineNode node) throws IOException {
if (node instanceof PDOutlineItem) {
PDDestination dst = ((PDOutlineItem) 
node).getDestination();
/**
 * throws java.lang.ClassCastException: 
org.apache.pdfbox.cos.COSFloat cannot be cast to org.apache.pdfbox.cos.COSName,
 * but should be something like a PDPageXYZDestination!
 */
System.out.println(dst);
}
for (PDOutlineItem child = node.getFirstChild(); child != null; 
child = child.getNextSibling()) {
traverse(child);
}
}
}
{code}

  was:
Trying to read the outlines failed for the attached document.

{code}
import java.io.IOException;

import org.apache.pdfbox.pdmodel.PDDocument;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode;

/**
 * @author mailto:andre.kisch...@interface-projects.de";>André 
Kischkel
 * @since 22.01.2014
 * @version $Revision$
 */
public class TestPDDestination {

public static void main(String[] args) throws IOException {
PDDocument doc = PDDocument.load("Speisepläne.pdf");
traverse(doc.getDocumentCatalog().getDocumentOutline());
doc.close();
}

static void traverse(PDOutlineNode node) throws IOException {
if (node instanceof PDOutlineItem) {
PDDestination dst = ((PDOutlineItem) 
node).getDestination();
/**
 * throws java.lang.ClassCastException: 
org.apache.pdfbox.cos.COSFloat cannot be cast to org.apache.pdfbox.cos.COSName,
 * but should be something like a PDPageXYZDestination!
 */
System.out.println(dst);
}
for (PDOutlineItem child = node.getFirstChild(); child != null; 
child = child.getNextSibling()) {
traverse(child);
}
}
}
{code}


> ClassCastException for unknown destination type
> ---
>
> Key: PDFBOX-1859
> URL: https://issues.apache.org/jira/browse/PDFBOX-1859
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 1.8.3, 2.0.0
>Reporter: Hendrik Lescak
> Attachments: Speisepläne.pdf
>
>
> Trying to read the outlines failed for the attached document.
> {code}
> import java.io.IOException;
> import org.apache.pdfbox.pdmodel.PDDocument;
> import 
> org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination;
> import 
> org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem;
> import 
> org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode;
> public class TestPDDestination {
>   public static void main(String[] args) throws IOException {
>   PDDocument doc = PDDocument.load("Speisepläne.pdf");
>   traverse(doc.getDocumentCatalog().getDocumentOutline());
>   doc.close();
>   }
>   
>   static void traverse(PDOutlineNode node) throws IOException {
>   if (node instanceof PDOutlineItem) {
>   PDDestination dst = ((PDOutlineItem) 
> node).getDestination();
>   /**
>* throws java.lang.ClassCastException: 
> org.apache.pdfbox.cos.COSFloat cannot be cast to 
> org.apache.pdfbox.cos.COSName,
>* but should be something like a PDPageXYZDestination!
>*/
>   System.out.println(dst);
>   }
>   for (PDOutlineIt

[jira] [Created] (PDFBOX-1859) ClassCastException for unknown destination type

2014-01-22 Thread Hendrik Lescak (JIRA)
Hendrik Lescak created PDFBOX-1859:
--

 Summary: ClassCastException for unknown destination type
 Key: PDFBOX-1859
 URL: https://issues.apache.org/jira/browse/PDFBOX-1859
 Project: PDFBox
  Issue Type: Bug
  Components: PDModel
Affects Versions: 1.8.3, 2.0.0
Reporter: Hendrik Lescak


Trying to read the outlines failed for the attached document.

{code:java}
import java.io.IOException;

import org.apache.pdfbox.pdmodel.PDDocument;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.destination.PDDestination;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineItem;
import 
org.apache.pdfbox.pdmodel.interactive.documentnavigation.outline.PDOutlineNode;

/**
 * @author mailto:andre.kisch...@interface-projects.de";>André 
Kischkel
 * @since 22.01.2014
 * @version $Revision$
 */
public class TestPDDestination {

public static void main(String[] args) throws IOException {
PDDocument doc = PDDocument.load("Speisepläne.pdf");
traverse(doc.getDocumentCatalog().getDocumentOutline());
doc.close();
}

static void traverse(PDOutlineNode node) throws IOException {
if (node instanceof PDOutlineItem) {
PDDestination dst = ((PDOutlineItem) 
node).getDestination();
/**
 * throws java.lang.ClassCastException: 
org.apache.pdfbox.cos.COSFloat cannot be cast to org.apache.pdfbox.cos.COSName,
 * but should be something like a PDPageXYZDestination!
 */
System.out.println(dst);
}
for (PDOutlineItem child = node.getFirstChild(); child != null; 
child = child.getNextSibling()) {
traverse(child);
}
}
}
{code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] Issue Comment Edited: (PDFBOX-855) Extracted Text of MS Word generated PDFs corrupt

2010-10-08 Thread Hendrik Lescak (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12919311#action_12919311
 ] 

Hendrik Lescak edited comment on PDFBOX-855 at 10/8/10 11:41 AM:
-

The problem occurred in a Word-Document containing an embedded "VISIO-Express 
Drawing-Object". And there the text was not recognized correctly as it was 
before your patch. Maybe this issue is out of scope. Unfortunately I can not 
post the example file, it is from a customer project.

  
> Extracted Text of MS Word generated PDFs corrupt
> 
>
> Key: PDFBOX-855
> URL: https://issues.apache.org/jira/browse/PDFBOX-855
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 1.3.0
> Environment: All
>Reporter: Hendrik Lescak
>
> Since Revision 1003195 (PDFBOX-828: fixed some issues with positioning when 
> extracting or rendering text) the text extraction with PDFTextStripper 
> behaves differently for PDF documents generated with the MS Office Word 2007 
> "Save as PDF" Feature. 
> For example: The Term "Fachbereichsleiter" changed to "F a c hb e re ic hsle 
> ite r" after PDFBOX-828.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (PDFBOX-855) Extracted Text of MS Word generated PDFs corrupt

2010-10-07 Thread Hendrik Lescak (JIRA)
Extracted Text of MS Word generated PDFs corrupt


 Key: PDFBOX-855
 URL: https://issues.apache.org/jira/browse/PDFBOX-855
 Project: PDFBox
  Issue Type: Bug
  Components: Text extraction
Affects Versions: 1.3.0
 Environment: All
Reporter: Hendrik Lescak


Since Revision 1003195 (PDFBOX-828: fixed some issues with positioning when 
extracting or rendering text) the text extraction with PDFTextStripper behaves 
differently for PDF documents generated with the MS Office Word 2007 "Save as 
PDF" Feature. 

For example: The Term "Fachbereichsleiter" changed to "F a c hb e re ic hsle 
ite r" after PDFBOX-828.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.