[jira] [Updated] (PDFBOX-2069) PDF's with Tc before Tm are getting incorrect spacing in PDFTextArea

2014-05-10 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-2069: --- Attachment: PDFBOX-2609.pdf PDF file that shows the problem PDF's with Tc before Tm are getting

[jira] [Commented] (PDFBOX-2069) PDF's with Tc before Tm are getting incorrect spacing in PDFTextArea

2014-05-13 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13994085#comment-13994085 ] Joel Hirsh commented on PDFBOX-2069: One more comment: Using setSpacingTolerance and

[jira] [Created] (PDFBOX-2069) PDF's with Tc before Tm are getting incorrect spacing in PDFTextArea

2014-05-15 Thread Joel Hirsh (JIRA)
Joel Hirsh created PDFBOX-2069: -- Summary: PDF's with Tc before Tm are getting incorrect spacing in PDFTextArea Key: PDFBOX-2069 URL: https://issues.apache.org/jira/browse/PDFBOX-2069 Project: PDFBox

[jira] [Updated] (PDFBOX-2069) PDF's with Tc before Tm are getting incorrect spacing in PDFTextArea

2014-05-16 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-2069: --- Attachment: PDFBox-2609-patch.zip Patch that addresses this problem PDF's with Tc before Tm are

[jira] [Created] (PDFBOX-2158) ExtractText missing most of text in this PDF file, due to font bonding box with minus infinity

2014-06-22 Thread Joel Hirsh (JIRA)
Joel Hirsh created PDFBOX-2158: -- Summary: ExtractText missing most of text in this PDF file, due to font bonding box with minus infinity Key: PDFBOX-2158 URL: https://issues.apache.org/jira/browse/PDFBOX-2158

[jira] [Updated] (PDFBOX-2158) ExtractText missing most of text in this PDF file, due to font bonding box with minus infinity

2014-06-22 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-2158: --- Attachment: negative.text.box.pdf File that exhibits this problem ExtractText missing most of

[jira] [Updated] (PDFBOX-2023) Text extraction gets nothing / zero font height

2014-07-17 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-2023: --- Attachment: zero_height.pdf Snippet of a PDF with Type 3 text that is getting zero height when

[jira] [Commented] (PDFBOX-2023) Text extraction gets nothing / zero font height

2014-07-17 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14064613#comment-14064613 ] Joel Hirsh commented on PDFBOX-2023: Have a similar problem in 1.8.6 Its on a Type 3

[jira] [Created] (PDFBOX-2463) ExtractTextByArea mangling second half of this string - transposed, skipped, etc

2014-10-29 Thread Joel Hirsh (JIRA)
Joel Hirsh created PDFBOX-2463: -- Summary: ExtractTextByArea mangling second half of this string - transposed, skipped, etc Key: PDFBOX-2463 URL: https://issues.apache.org/jira/browse/PDFBOX-2463

[jira] [Updated] (PDFBOX-2463) ExtractTextByArea mangling second half of this string - transposed, skipped, etc

2014-10-29 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-2463: --- Attachment: mangled_text .pdf Snippet that shows problem ExtractTextByArea mangling second half of

[jira] [Updated] (PDFBOX-3063) Appears that getStrokingColor/getNonStrokingColor are broken

2015-10-27 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-3063: --- Attachment: whitetext.pdf > Appears that getStrokingColor/getNonStrokingColor are broken >

[jira] [Created] (PDFBOX-3063) Appears that getStrokingColor/getNonStrokingColor are broken

2015-10-27 Thread Joel Hirsh (JIRA)
Joel Hirsh created PDFBOX-3063: -- Summary: Appears that getStrokingColor/getNonStrokingColor are broken Key: PDFBOX-3063 URL: https://issues.apache.org/jira/browse/PDFBOX-3063 Project: PDFBox

[jira] [Commented] (PDFBOX-3067) Text strings being returned as single characters, regression from version 1.8

2015-10-29 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14980882#comment-14980882 ] Joel Hirsh commented on PDFBOX-3067: Ok, I think we have been confusing two different things. And I

[jira] [Commented] (PDFBOX-3067) Text strings being returned as single characters, regression from version 1.8

2015-10-29 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14980721#comment-14980721 ] Joel Hirsh commented on PDFBOX-3067: I see that 3067

[jira] [Commented] (PDFBOX-3067) Text strings being returned as single characters, regression from version 1.8

2015-10-29 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14980761#comment-14980761 ] Joel Hirsh commented on PDFBOX-3067: Yes. In the latest, what I get for strings is -

[jira] [Commented] (PDFBOX-3067) Text strings being returned as single characters, regression from version 1.8

2015-10-29 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14981289#comment-14981289 ] Joel Hirsh commented on PDFBOX-3067: Well I downloaded the pdfbox source and built with that, and

[jira] [Commented] (PDFBOX-3063) Appears that getStrokingColor/getNonStrokingColor are broken

2015-10-27 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14976881#comment-14976881 ] Joel Hirsh commented on PDFBOX-3063: Adding all the operators got me back to the results that I was

[jira] [Created] (PDFBOX-3066) Text getting garbled in this file, was Ok in 1.8

2015-10-27 Thread Joel Hirsh (JIRA)
Joel Hirsh created PDFBOX-3066: -- Summary: Text getting garbled in this file, was Ok in 1.8 Key: PDFBOX-3066 URL: https://issues.apache.org/jira/browse/PDFBOX-3066 Project: PDFBox Issue Type:

[jira] [Updated] (PDFBOX-3066) Text getting garbled in this file, was Ok in 1.8

2015-10-27 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-3066: --- Attachment: garbled.pdf > Text getting garbled in this file, was Ok in 1.8 >

[jira] [Updated] (PDFBOX-3063) Appears that getStrokingColor/getNonStrokingColor are broken

2015-10-27 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-3063: --- Attachment: whiteonwhitetext.pdf I had two files with the same symptom, looks like different causes.

[jira] [Updated] (PDFBOX-3067) Text strings being returned as single characters, regression from version 1.8

2015-10-27 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-3067: --- Attachment: singlecharacters.pdf > Text strings being returned as single characters, regression from

[jira] [Created] (PDFBOX-3067) Text strings being returned as single characters, regression from version 1.8

2015-10-27 Thread Joel Hirsh (JIRA)
Joel Hirsh created PDFBOX-3067: -- Summary: Text strings being returned as single characters, regression from version 1.8 Key: PDFBOX-3067 URL: https://issues.apache.org/jira/browse/PDFBOX-3067 Project:

[jira] [Reopened] (PDFBOX-3063) Appears that getStrokingColor/getNonStrokingColor are broken

2015-10-27 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh reopened PDFBOX-3063: > Appears that getStrokingColor/getNonStrokingColor are broken >

[jira] [Updated] (PDFBOX-3078) Text height coming in at half size, regression from 1.8

2015-11-01 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-3078: --- Attachment: wrongsize.pdf > Text height coming in at half size, regression from 1.8 >

[jira] [Created] (PDFBOX-3078) Text height coming in at half size, regression from 1.8

2015-11-01 Thread Joel Hirsh (JIRA)
Joel Hirsh created PDFBOX-3078: -- Summary: Text height coming in at half size, regression from 1.8 Key: PDFBOX-3078 URL: https://issues.apache.org/jira/browse/PDFBOX-3078 Project: PDFBox Issue

[jira] [Updated] (PDFBOX-3076) Type3 Font that is getting zero height text, even in latest 2.0

2015-11-01 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-3076: --- Attachment: type3 zero font size.pdf > Type3 Font that is getting zero height text, even in latest

[jira] [Commented] (PDFBOX-3076) Type3 Font that is getting zero height text, even in latest 2.0

2015-11-01 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14984459#comment-14984459 ] Joel Hirsh commented on PDFBOX-3076: I have created a workaround for this base on the width. I can

[jira] [Comment Edited] (PDFBOX-3076) Type3 Font that is getting zero height text, even in latest 2.0

2015-11-01 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14984459#comment-14984459 ] Joel Hirsh edited comment on PDFBOX-3076 at 11/1/15 5:36 PM: - I have created

[jira] [Commented] (PDFBOX-3078) Text height coming in at half size, regression from 1.8

2015-11-01 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14984470#comment-14984470 ] Joel Hirsh commented on PDFBOX-3078: I put a check in my application to correct for this, and then

[jira] [Updated] (PDFBOX-3066) Text getting garbled in this file, was Ok in 1.8

2015-11-01 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-3066: --- Attachment: (was: type3 zero font size.pdf) > Text getting garbled in this file, was Ok in 1.8 >

[jira] [Updated] (PDFBOX-3066) Text getting garbled in this file, was Ok in 1.8

2015-11-01 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-3066: --- Attachment: (was: very big width of space.pdf) > Text getting garbled in this file, was Ok in

[jira] [Updated] (PDFBOX-3066) Text getting garbled in this file, was Ok in 1.8

2015-11-01 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-3066: --- Attachment: (was: big width of space.pdf) > Text getting garbled in this file, was Ok in 1.8 >

[jira] [Updated] (PDFBOX-3066) Text getting garbled in this file, was Ok in 1.8

2015-10-31 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-3066: --- Attachment: very big width of space.pdf big width of space.pdf > Text getting

[jira] [Created] (PDFBOX-3077) Type 3 Fonts getting incorrect values for widthofspace

2015-10-31 Thread Joel Hirsh (JIRA)
Joel Hirsh created PDFBOX-3077: -- Summary: Type 3 Fonts getting incorrect values for widthofspace Key: PDFBOX-3077 URL: https://issues.apache.org/jira/browse/PDFBOX-3077 Project: PDFBox Issue

[jira] [Updated] (PDFBOX-3066) Text getting garbled in this file, was Ok in 1.8

2015-10-31 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Hirsh updated PDFBOX-3066: --- Attachment: type3 zero font size.pdf > Text getting garbled in this file, was Ok in 1.8 >

[jira] [Created] (PDFBOX-3076) Type3 Font that is getting zero height text, even in latest 2.0

2015-10-31 Thread Joel Hirsh (JIRA)
Joel Hirsh created PDFBOX-3076: -- Summary: Type3 Font that is getting zero height text, even in latest 2.0 Key: PDFBOX-3076 URL: https://issues.apache.org/jira/browse/PDFBOX-3076 Project: PDFBox

[jira] [Created] (PDFBOX-3662) Regression on this file as a result of PDFBOX-3446 fix

2017-01-25 Thread Joel Hirsh (JIRA)
Joel Hirsh created PDFBOX-3662: -- Summary: Regression on this file as a result of PDFBOX-3446 fix Key: PDFBOX-3662 URL: https://issues.apache.org/jira/browse/PDFBOX-3662 Project: PDFBox Issue

[jira] [Commented] (PDFBOX-3662) Regression on this file as a result of PDFBOX-3446 fix

2017-01-25 Thread Joel Hirsh (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15838664#comment-15838664 ] Joel Hirsh commented on PDFBOX-3662: Great, I will try against the trunk. On Wed, Jan 25, 2017 at