Jenkins build is back to normal : PDFBox-trunk #3313

2017-03-25 Thread Apache Jenkins Server
See 


-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Jenkins build is back to normal : PDFBox-trunk » Apache FontBox #3313

2017-03-25 Thread Apache Jenkins Server
See 



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Build failed in Jenkins: PDFBox-trunk » Apache FontBox #3312

2017-03-25 Thread Apache Jenkins Server
See 


Changes:

[lehmi] 2.10-SNAPSHOT -> 3.0.0-SNAPSHOT

--
[...truncated 19.13 KB...]
[INFO] --- maven-source-plugin:2.3:jar (attach-sources) @ fontbox ---
[INFO] Building jar: 

[WARNING] Artifact org.apache.pdfbox:fontbox:java-source:sources:3.0.0-SNAPSHOT 
already attached to project, ignoring duplicate
[WARNING] Failed to getClass for org.apache.felix.bundleplugin.BundlePlugin
[INFO] 
[INFO] --- maven-bundle-plugin:2.5.4:bundle (default-bundle) @ fontbox ---
[INFO] 
[INFO] --- maven-site-plugin:3.4:attach-descriptor (attach-descriptor) @ 
fontbox ---
[WARNING] Failed to getClass for org.apache.maven.plugin.source.SourceJarMojo
[INFO] 
[INFO] --- maven-source-plugin:2.3:jar (attach-sources) @ fontbox ---
[WARNING] Artifact org.apache.pdfbox:fontbox:java-source:sources:3.0.0-SNAPSHOT 
already attached to project, ignoring duplicate
[INFO] 
[INFO] --- apache-rat-plugin:0.11:check (default) @ fontbox ---
[INFO] 51 implicit excludes (use -debug for more details).
[INFO] Exclude: src/main/resources/org/apache/fontbox/cmap/*
[INFO] 122 resources included (use -debug for more details)
Warning:  org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser: Property 
'http://www.oracle.com/xml/jaxp/properties/entityExpansionLimit' is not 
recognized.
Compiler warnings:
  WARNING:  'org.apache.xerces.jaxp.SAXParserImpl: Property 
'http://javax.xml.XMLConstants/property/accessExternalDTD' is not recognized.'
Warning:  org.apache.xerces.parsers.SAXParser: Feature 
'http://javax.xml.XMLConstants/feature/secure-processing' is not recognized.
Warning:  org.apache.xerces.parsers.SAXParser: Property 
'http://javax.xml.XMLConstants/property/accessExternalDTD' is not recognized.
Warning:  org.apache.xerces.parsers.SAXParser: Property 
'http://www.oracle.com/xml/jaxp/properties/entityExpansionLimit' is not 
recognized.
[INFO] Rat check: Summary of files. Unapproved: 0 unknown: 0 generated: 0 
approved: 118 licence.
[INFO] 
[INFO] --- maven-install-plugin:2.5.2:install (default-install) @ fontbox ---
[INFO] Installing 

 to 
/home/jenkins/jenkins-slave/maven-repositories/0/org/apache/pdfbox/fontbox/3.0.0-SNAPSHOT/fontbox-3.0.0-SNAPSHOT.jar
[INFO] Installing 

 to 
/home/jenkins/jenkins-slave/maven-repositories/0/org/apache/pdfbox/fontbox/3.0.0-SNAPSHOT/fontbox-3.0.0-SNAPSHOT.pom
[INFO] Installing 

 to 
/home/jenkins/jenkins-slave/maven-repositories/0/org/apache/pdfbox/fontbox/3.0.0-SNAPSHOT/fontbox-3.0.0-SNAPSHOT-sources.jar
[WARNING] Failed to getClass for org.apache.felix.obrplugin.ObrInstall
[INFO] 
[INFO] --- maven-bundle-plugin:2.5.4:install (default-install) @ fontbox ---
[INFO] Installing 
org/apache/pdfbox/fontbox/3.0.0-SNAPSHOT/fontbox-3.0.0-SNAPSHOT.jar
[INFO] Writing OBR metadata
[INFO] 
[INFO] --- maven-deploy-plugin:2.8.2:deploy (default-deploy) @ fontbox ---
Downloading: 
https://repository.apache.org/content/repositories/snapshots/org/apache/pdfbox/fontbox/3.0.0-SNAPSHOT/maven-metadata.xml
Uploading: 
https://repository.apache.org/content/repositories/snapshots/org/apache/pdfbox/fontbox/3.0.0-SNAPSHOT/fontbox-3.0.0-20170325.185954-1.jar
Uploaded: 
https://repository.apache.org/content/repositories/snapshots/org/apache/pdfbox/fontbox/3.0.0-SNAPSHOT/fontbox-3.0.0-20170325.185954-1.jar
 (1445 KB at 961.9 KB/sec)
Uploading: 
https://repository.apache.org/content/repositories/snapshots/org/apache/pdfbox/fontbox/3.0.0-SNAPSHOT/fontbox-3.0.0-20170325.185954-1.pom
Uploaded: 
https://repository.apache.org/content/repositories/snapshots/org/apache/pdfbox/fontbox/3.0.0-SNAPSHOT/fontbox-3.0.0-20170325.185954-1.pom
 (3 KB at 2.9 KB/sec)
Downloading: 
https://repository.apache.org/content/repositories/snapshots/org/apache/pdfbox/fontbox/maven-metadata.xml
Downloaded: 
https://repository.apache.org/content/repositories/snapshots/org/apache/pdfbox/fontbox/maven-metadata.xml
 (425 B at 0.8 KB/sec)
Uploading: 
https://repository.apache.org/content/repositories/snapshots/org/apache/pdfbox/fontbox/3.0.0-SNAPSHOT/maven-metadata.xml
Uploaded: 
https://repository.apache.org/content/repositories/snapshots/org/apache/pdfbox/fontbox/3.0.0-SNAPSHOT/maven-metadata.xml
 (774 B at 0.9 KB/sec)
Uploading: 
https://repository.apache.org/content/repositories/snapshots/org/apache/pdfbox/fontbox/maven-metadata.xml
Uploaded: 
https://repository.apache.org/content/repositories/snapshots/org/apache/pdfbox/fontbox/maven-metadata.xml
 (465 B at 0.6 KB/sec)
Uploading: 

Build failed in Jenkins: PDFBox-trunk #3312

2017-03-25 Thread Apache Jenkins Server
See 


Changes:

[lehmi] PDFBOX-3735: updated dependencies bouncy castle, ant and commons-io

[lehmi] 2.10-SNAPSHOT -> 3.0.0-SNAPSHOT

--
[...truncated 35.45 KB...]
[INFO] 
[INFO] --- maven-remote-resources-plugin:1.5:process (default) @ fontbox ---
[INFO] 
[INFO] --- maven-resources-plugin:2.6:resources (default-resources) @ fontbox 
---
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] Copying 89 resources
[INFO] Copying 3 resources
[INFO] Copying 3 resources
[INFO] 
[INFO] --- maven-compiler-plugin:3.1:compile (default-compile) @ fontbox ---
[INFO] Nothing to compile - all classes are up to date
[INFO] 
[INFO] --- maven-resources-plugin:2.6:testResources (default-testResources) @ 
fontbox ---
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] Copying 3 resources
[INFO] Copying 3 resources
[INFO] Copying 3 resources
[INFO] 
[INFO] --- maven-compiler-plugin:3.1:testCompile (default-testCompile) @ 
fontbox ---
[INFO] Nothing to compile - all classes are up to date
[INFO] 
[INFO] --- maven-surefire-plugin:2.17:test (default-test) @ fontbox ---
[INFO] Skipping execution of surefire because it has already been run for this 
configuration
[JENKINS] Recording test results[INFO] 
[INFO] --- animal-sniffer-maven-plugin:1.14:check (check-java-version) @ 
fontbox ---
[INFO] Checking unresolved references to org.codehaus.mojo.signature:java16:1.0

[INFO] 
[INFO] --- maven-remote-resources-plugin:1.5:process (default) @ fontbox ---
[INFO] 
[INFO] --- maven-resources-plugin:2.6:resources (default-resources) @ fontbox 
---
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] Copying 89 resources
[INFO] Copying 3 resources
[INFO] Copying 3 resources
[INFO] Copying 3 resources
[INFO] 
[INFO] --- maven-compiler-plugin:3.1:compile (default-compile) @ fontbox ---
[INFO] Nothing to compile - all classes are up to date
[INFO] 
[INFO] --- maven-resources-plugin:2.6:testResources (default-testResources) @ 
fontbox ---
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] Copying 3 resources
[INFO] Copying 3 resources
[INFO] Copying 3 resources
[INFO] Copying 3 resources
[INFO] 
[INFO] --- maven-compiler-plugin:3.1:testCompile (default-testCompile) @ 
fontbox ---
[INFO] Nothing to compile - all classes are up to date
[INFO] 
[INFO] --- maven-surefire-plugin:2.17:test (default-test) @ fontbox ---
[INFO] Skipping execution of surefire because it has already been run for this 
configuration
[JENKINS] Recording test results[INFO] 
[INFO] --- animal-sniffer-maven-plugin:1.14:check (check-java-version) @ 
fontbox ---
[INFO] Checking unresolved references to org.codehaus.mojo.signature:java16:1.0

[INFO] 
[INFO] --- maven-site-plugin:3.4:attach-descriptor (attach-descriptor) @ 
fontbox ---
[WARNING] Failed to getClass for org.apache.maven.plugin.source.SourceJarMojo
[INFO] 
[INFO] --- maven-source-plugin:2.3:jar (attach-sources) @ fontbox ---
[INFO] Building jar: 

[WARNING] Failed to getClass for org.apache.felix.bundleplugin.BundlePlugin
[INFO] 
[INFO] --- maven-bundle-plugin:2.5.4:bundle (default-bundle) @ fontbox ---
[INFO] 
[INFO] --- maven-site-plugin:3.4:attach-descriptor (attach-descriptor) @ 
fontbox ---
[WARNING] Failed to getClass for org.apache.maven.plugin.source.SourceJarMojo
[INFO] 
[INFO] --- maven-source-plugin:2.3:jar (attach-sources) @ fontbox ---
[INFO] 
[INFO] --- apache-rat-plugin:0.11:check (default) @ fontbox ---
[INFO] 51 implicit excludes (use -debug for more details).
[INFO] Exclude: src/main/resources/org/apache/fontbox/cmap/*
[INFO] 122 resources included (use -debug for more details)
Warning:  org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser: Property 
'http://www.oracle.com/xml/jaxp/properties/entityExpansionLimit' is not 
recognized.
Compiler warnings:
  WARNING:  'org.apache.xerces.jaxp.SAXParserImpl: Property 
'http://javax.xml.XMLConstants/property/accessExternalDTD' is not recognized.'
Warning:  org.apache.xerces.parsers.SAXParser: Feature 
'http://javax.xml.XMLConstants/feature/secure-processing' is not recognized.
Warning:  org.apache.xerces.parsers.SAXParser: Property 
'http://javax.xml.XMLConstants/property/accessExternalDTD' is not recognized.
Warning:  org.apache.xerces.parsers.SAXParser: Property 
'http://www.oracle.com/xml/jaxp/properties/entityExpansionLimit' is not 
recognized.
[INFO] Rat check: Summary of files. Unapproved: 0 unknown: 0 generated: 0 
approved: 118 licence.
[INFO] 
[INFO] --- maven-install-plugin:2.5.2:install (default-install) @ fontbox ---
[INFO] Installing 

 to 
/home/jenkins/jenkins-slave/maven-repositories/0/org/apache/pdfbox/fontbox/3.0.0-SNAPSHOT/fontbox-3.0.0-SNAPSHOT.jar
[INFO] Installing 

[jira] [Resolved] (PDFBOX-3735) Update to Bouncy Castle 1.56

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler resolved PDFBOX-3735.

Resolution: Fixed

I've update the following dependencies:
- Bouncy Castle 1.55 -> 1.56
- commons-io 2.45 -> 2.5
- ant 1.9.6 -> 1.9.9


> Update to Bouncy Castle 1.56
> 
>
> Key: PDFBOX-3735
> URL: https://issues.apache.org/jira/browse/PDFBOX-3735
> Project: PDFBox
>  Issue Type: Improvement
>Affects Versions: 3.0.0
>Reporter: Andreas Lehmkühler
>Assignee: Andreas Lehmkühler
> Fix For: 3.0.0
>
>
> We should update to Bouncy Castle 1.56



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3735) Update to Bouncy Castle 1.56

2017-03-25 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15941837#comment-15941837
 ] 

ASF subversion and git services commented on PDFBOX-3735:
-

Commit 1788679 from [~lehmi] in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1788679 ]

PDFBOX-3735: updated dependencies bouncy castle, ant and commons-io

> Update to Bouncy Castle 1.56
> 
>
> Key: PDFBOX-3735
> URL: https://issues.apache.org/jira/browse/PDFBOX-3735
> Project: PDFBox
>  Issue Type: Improvement
>Affects Versions: 3.0.0
>Reporter: Andreas Lehmkühler
>Assignee: Andreas Lehmkühler
> Fix For: 3.0.0
>
>
> We should update to Bouncy Castle 1.56



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Created] (PDFBOX-3735) Update to Bouncy Castle 1.56

2017-03-25 Thread JIRA
Andreas Lehmkühler created PDFBOX-3735:
--

 Summary: Update to Bouncy Castle 1.56
 Key: PDFBOX-3735
 URL: https://issues.apache.org/jira/browse/PDFBOX-3735
 Project: PDFBox
  Issue Type: Improvement
Affects Versions: 3.0.0
Reporter: Andreas Lehmkühler
Assignee: Andreas Lehmkühler
 Fix For: 3.0.0


We should update to Bouncy Castle 1.56



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



How about switching the trunk to java 7?

2017-03-25 Thread Andreas Lehmkuehler


Any objections?

BR
Andreas

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: Branch for new major version?

2017-03-25 Thread Andreas Lehmkuehler

Am 25.03.2017 um 19:01 schrieb Andreas Lehmkuehler:

Am 23.03.2017 um 07:50 schrieb Andreas Lehmkuehler:

Am 21.03.2017 um 19:45 schrieb Maruan Sahyoun:



Am 21.03.2017 um 19:17 schrieb Tilman Hausherr :

Am 21.03.2017 um 17:53 schrieb Andreas Lehmkuehler:

Hi,

I'm thinking about implementing some stuff which would most likely require a
new major version. Obviously there are some similar things which already
came up, e.g. remove some disturbing public constants, switch to java 7 to
use twelve monkeys lib.

The question is, how should we deal with that. I see a handful of possible
ways:

- switch the current trunk to 3.0 and omit 2.1 (for now there are 3 tickets
with "Fix version/s" 2.1 which didn't make it to the 2.0 branch for
different reasons)
- create a 2.1 branch based on the current trunk and switch the current
trunk to 3.0
- create a 3.0 branch and don't change anything else. Merge all changes to
the trunk once 2.1 was released some time in the future
- don't change anything officially, but create a "private" 3.0 branch and
merge those changes to a future 3.0 version


I'm for solution 1 because it's a PITA to care about many versions.



I'd go for 1 too - BR Maruan


Thanks for your input. Let's wait another couple of days maybe until the next
weekend before proceeding with any changes.

Ok, there isn't any other input.

I'm going to rename the trunk version from 2.1-SNAPSHOT to 3.0-SNAPSHOT and I'm
going to do the same with out JIRA tickets.

Done



BR
Andreas



BR
Andreas





Tilman


-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2532) Text extraction fails due to the usage of the internal font mapping

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2532:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Text extraction fails due to the usage of the internal font mapping
> ---
>
> Key: PDFBOX-2532
> URL: https://issues.apache.org/jira/browse/PDFBOX-2532
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 2.0.0
>Reporter: Andreas Lehmkühler
> Fix For: 3.0.0
>
> Attachments: PDFBOX2247-701542_cp_acrobat.txt, PDFBOX2247-701542.pdf, 
> PDFBOX2247-701542_sa_acrobat_osx.txt, PDFBOX2247-701542_sa_acrobat.txt, 
> PDFBOX2247-701542_sa_reader_osx.txt, PDFBOX2247-Debugger.png
>
>
> If a pdf doesn't provide any mapping (neither an encoding nor a toUnicode 
> mapping) we have to decide where to get a suitable mapping ourselves. We 
> can't use the internal font mapping of the type1C font as it doesn't work in 
> every case, see PDFBOX-2377 which provides a solution for the 1.8-branch.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2755) Support filling hybrid PDF forms

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2755:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Support filling hybrid PDF forms
> 
>
> Key: PDFBOX-2755
> URL: https://issues.apache.org/jira/browse/PDFBOX-2755
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm
>Affects Versions: 1.8.9, 1.8.10, 2.0.0
>Reporter: hui xu
>Assignee: Maruan Sahyoun
> Fix For: 3.0.0
>
> Attachments: formtestFailed.pdf, formtestOK.pdf
>
>
> Ran SetField.java in package org.apache.pdfbox.examples.fdf, the field value 
> is changed but can not been saved in pdf file.
> I added 2 more lines code:
> List newList = new ArrayList();
> .
> acroForm.setFields(newList);
> The pdf was saved with the change. But reload the saved pdf file and try to 
> reset some values, it throws NullPointerException, the file can't getFields().



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2258) Speed up SampledImageReader#getStencilImage()

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2258:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Speed up SampledImageReader#getStencilImage()
> -
>
> Key: PDFBOX-2258
> URL: https://issues.apache.org/jira/browse/PDFBOX-2258
> Project: PDFBox
>  Issue Type: Wish
>  Components: Rendering
>Affects Versions: 2.0.0
>Reporter: simon steiner
> Fix For: 3.0.0
>
>
> PDF from PDFBOX-1437
> java -jar ~/pdfbox_th/app/target/pdfbox-app-2.0.0-SNAPSHOT.jar PDFToImage 
> AA.pdf
> org.apache.pdfbox.pdmodel.graphics.color.PDDeviceGray.toRGBImage()
> 31.6991311,472 ms (31.7%)   11,472 ms
> org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.getStencilImage() 
> 29.439924   10,654 ms (29.4%)   10,654 ms
> is there some alternative to
> for (int y = 0; y < height; y++)
> {
> for (int x = 0; x < width; x++)
> {
> op
> }
> }



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3492) Remove classic signing and keep external signing only

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3492:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Remove classic signing and keep external signing only
> -
>
> Key: PDFBOX-3492
> URL: https://issues.apache.org/jira/browse/PDFBOX-3492
> Project: PDFBox
>  Issue Type: Sub-task
>  Components: Signing
>Affects Versions: 2.1.0
>Reporter: Tilman Hausherr
> Fix For: 3.0.0
>
>
> to be done later, lets first wait for feedback on the external signing.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2661) Implement font fallback for AcroForms

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2661:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Implement font fallback for AcroForms
> -
>
> Key: PDFBOX-2661
> URL: https://issues.apache.org/jira/browse/PDFBOX-2661
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm
>Affects Versions: 1.8.8, 2.0.0
>Reporter: Maruan Sahyoun
> Fix For: 3.0.0
>
> Attachments: field-filled.png, field.png, Fonts2-Filled.pdf, 
> fonts2-filled.png, Fonts2.pdf, Fonts.pdf, fonttables.txt, FontTest.java, 
> Franklin.png
>
>
> There are forms where the font specified in the fields default appearance is 
> not pointing to the correct fields or forms resources entry. Adobe 
> Reader/Acrobat have a (unspecified) fallback mechanism to resolve such 
> missing fonts.
> We should be ably to come up with a similar solution.
> A sample of such an issue can be found in PDFBOX-1234



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-1912) Optical Character Recognition (OCR)

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-1912:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Optical Character Recognition (OCR)
> ---
>
> Key: PDFBOX-1912
> URL: https://issues.apache.org/jira/browse/PDFBOX-1912
> Project: PDFBox
>  Issue Type: New Feature
>  Components: Text extraction
>Affects Versions: 2.0.0
> Environment: JDK 6, C/C++
>Reporter: John Hewson
>Assignee: John Hewson
>  Labels: gsoc2014
> Fix For: 3.0.0
>
>
> Brief explanation: The PDFBox library is widely used to extract text from PDF 
> files. However, many PDF files embed text in a malformed manner which renders 
> text extraction useless. There has recently been interest in extracting 
> governmental data from PDF files, the PDF Liberation commons being a notable 
> example, see https://github.com/pdfliberation for more details.
> Many end-users of PDFBox have been making use of OCR tools such as Google's 
> Tesseract https://code.google.com/p/tesseract-ocr/ which are run on the final 
> image generated by PDFBox. We think that by adding a more integrated OCR API 
> to PDFBox it will be possible to do a better job. PDFBox often has access to 
> encoding and positioning information for individual glyphs. Even when their 
> extracted text is meaningless, a character-by-character, or line-by-line OCR 
> could be more accurate. PDFBox also has information such as image orientation 
> which could allow it to better perform OCR on pages such as embedded 
> landscape tables.
> There are existing JNI bindings for Tesseract available at 
> https://code.google.com/p/tesseract-android-tools/
> Expected results: To extend PDF box with an API which allows external OCR 
> tools to be plugged-in, and an implementation of a Tesseract plug-in using 
> either JNI or the command line via Process.exec.
> Knowledge Prerequisite: Java, JNI (C/C++)
> Mentor: John Hewson
> PMC Note: Tesseract  is under the Apache License 2.0
> To learn more about PDFBox, please visit http://pdfbox.apache.org/



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3512) PDFDebugger Mac App

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3512:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> PDFDebugger Mac App
> ---
>
> Key: PDFBOX-3512
> URL: https://issues.apache.org/jira/browse/PDFBOX-3512
> Project: PDFBox
>  Issue Type: New Feature
>  Components: Utilities
> Environment: Mac OS X
>Reporter: John Hewson
>Assignee: John Hewson
>Priority: Minor
> Fix For: 2.0.6, 3.0.0
>
>
> Using the PDFDebugger on the Mac isn't a great experience (see PDFBOX-3507). 
> We should package the jar into a native Mac .app bundle.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2578) Fuzzy matching of base 14 fonts

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2578:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Fuzzy matching of base 14 fonts
> ---
>
> Key: PDFBOX-2578
> URL: https://issues.apache.org/jira/browse/PDFBOX-2578
> Project: PDFBox
>  Issue Type: Improvement
>  Components: Rendering
>Affects Versions: 2.0.0
>Reporter: John Hewson
>Assignee: John Hewson
> Fix For: 3.0.0
>
>
> The file from PDFBOX-2573 has a font "Helvetica~1d3" which Adobe Reader 
> recognises as a "base 14" Helvetica. We should implement a similarly 
> forgiving font-matching mechanism for cases where there is no FontDescriptor.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2019) Annotations produced by Annotation.java example are not rendered correctly

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2019:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Annotations produced by Annotation.java example are not rendered correctly
> --
>
> Key: PDFBOX-2019
> URL: https://issues.apache.org/jira/browse/PDFBOX-2019
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 2.0.0
> Environment: Windows 7
> java version "1.7.0_04"
> Java(TM) SE Runtime Environment (build 1.7.0_04-b22)
> Java HotSpot(TM) 64-Bit Server VM (build 23.0-b21, mixed mode)
>Reporter: Stan Taylor
>  Labels: Annotations
> Fix For: 3.0.0
>
> Attachments: Annotation.pdf, 
> PDFBOX-2019-Annotations-with-Appearances.pdf
>
>
> The sample PDF produced by 
> /examples/src/main/java/org/apache/pdfbox/examples/pdmodel/Annotation.java 
> cannot be properly displayed by the pdfbox-app-2.0.0-SNAPSHOT's PDFReader 
> function -- the circle, square, and line annotations are not visible.
> Unknown if this is a problem with the Annotation.java example, the 
> pdfbox-app, or with the library itself.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2604) Generate a default appearance when creating an interactive form field

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2604:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Generate a default appearance when creating an interactive form field 
> --
>
> Key: PDFBOX-2604
> URL: https://issues.apache.org/jira/browse/PDFBOX-2604
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm
>Affects Versions: 1.8.8, 2.0.0
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
> Fix For: 3.0.0
>
>
> When an interactive form field is generated a default appearance should be 
> generated to get a visual representation of the form field.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2685) COS 'primitive' types are mutable

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2685:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> COS 'primitive' types are mutable
> -
>
> Key: PDFBOX-2685
> URL: https://issues.apache.org/jira/browse/PDFBOX-2685
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 1.8.8, 1.8.9, 2.0.0
>Reporter: Tilman Hausherr
>Assignee: Maruan Sahyoun
>Priority: Critical
> Fix For: 3.0.0
>
>
> From [~torakiki]:
> They are cached and they are mutable so this:
> {code}
> COSInteger cosint = COSInteger.get(10);
> cosint.setValue(50);
> //lets move to a totally unrelated part of my software
> System.out.println(COSInteger.get(10));
> {code}
> prints COSInteger(50) so basically I ask for a 10 and get a 50, isn't it 
> weird (and error prone)? ... or the way around, I call cosint.setValue(50) at 
> some point in the software and that makes it a 50 everywhere I used 
> COSInteger.get(10).
> That's of course also true for the other mutable attributes setDirect and 
> setNeedToBeUpdate.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3683) Unexpected behavior when setting value for radio button with /Opts entry

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3683:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Unexpected behavior when setting value for radio button with /Opts entry 
> -
>
> Key: PDFBOX-3683
> URL: https://issues.apache.org/jira/browse/PDFBOX-3683
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm
>Affects Versions: 2.0.4, 2.0.5, 2.1.0
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
> Fix For: 3.0.0
>
>
> Setting the value for a radio button the behavior is unexpected as when 
> setting the value the entries in {{/Opts}} need to be used is there is such 
> entry otherwise the names for the {{On}} values in the normal appearances 
> entries need to be used. When getting the value always the set names entry is 
> returned.
> Also see the discussion at PDFBOX-3656.
> The {{/Opts}} entries shall only be considered when exporting or importing 
> the AcroForm fields data.  



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2642) Unsupported: Type1 PFB fonts in CIDFontType0

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2642:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Unsupported: Type1 PFB fonts in CIDFontType0
> 
>
> Key: PDFBOX-2642
> URL: https://issues.apache.org/jira/browse/PDFBOX-2642
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 2.0.0
>Reporter: Tilman Hausherr
> Fix For: 3.0.0
>
> Attachments: PDFBOX-2642-277053-p3.pdf
>
>
> {code}
> java.lang.NullPointerException
>   at 
> org.apache.pdfbox.pdmodel.font.PDCIDFontType0.getFontMatrix(PDCIDFontType0.java:169)
>   at 
> org.apache.pdfbox.pdmodel.font.PDCIDFontType0.(PDCIDFontType0.java:153)
>   at 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createDescendantFont(PDFontFactory.java:121)
>   at 
> org.apache.pdfbox.pdmodel.font.PDType0Font.(PDType0Font.java:95)
>   at 
> org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:83)
> {code}
> UPDATE: The PDF contains a CIDFontType0 which has a FontFile3 containing a 
> Type1 PFB font. This is invalid, but Type1C is permitted and the two are 
> equivalent, so it should be possible to handle this. However, the current 
> design of PD CIDFontType0 does not permit it.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2124) barcore scanner using pdfbox and zxing

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2124:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> barcore scanner using pdfbox and zxing
> --
>
> Key: PDFBOX-2124
> URL: https://issues.apache.org/jira/browse/PDFBOX-2124
> Project: PDFBox
>  Issue Type: New Feature
>  Components: Utilities
>Affects Versions: 2.0.0
> Environment: W8
>Reporter: David KELLER
>Priority: Minor
> Fix For: 3.0.0
>
> Attachments: mail_with_barcode.pdf, PdDocumentBarcodeScanner.java, 
> PdPageBarcodeScanner.java
>
>
> Dear Pdfbox,
> I have just finished a work to integrate pdfbox and zxing to extract 
> barcodes, and I wanted to give this source code to your fundation.
> Program do this :
> - extract all scanned images in a PDF, 
> - apply some homebrew image filters to retrieve areas of interest, 
> - rotate cropped areas and send them to zxing to find any barcode
> - aggregate all results in specific List
> Hope it can be useful for Pdfbox or Lucence.
> Example : 
> {noformat}
>   public static void main(String[] args) throws Exception {
>   long startTime = System.currentTimeMillis();
>   System.out.println("PdfBoxBarcodeScannerTest starts...");
>   String filename = "D:\\test\\mail_with_barcode.pdf";
>   
>   
>   PdDocumentBarcodeScanner scanner = new 
> PdDocumentBarcodeScanner(new File(filename));
>   scanner.scan();
>   long endTime = System.currentTimeMillis();
>   System.out.println("pdf scanned in " + (endTime - startTime) + 
> " ms");
>   
>   scanner.displayResults();
>   }
> {noformat}
> Gives :
> PdfBoxBarcodeScannerTest starts...
> pdf scanned in 3803 ms
> page=0, barcodeFormat=DATA_MATRIX, value=HP14601225523
> Best regards
> David KELLER



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3519) COSName is not ascii

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3519:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> COSName is not ascii
> 
>
> Key: PDFBOX-3519
> URL: https://issues.apache.org/jira/browse/PDFBOX-3519
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.3
>Reporter: simon steiner
>Priority: Minor
> Fix For: 3.0.0
>
> Attachments: COSNameAcrobat.png
>
>
> Trunk seems ok
> PDF is from PDFBOX-783
> {code}
> public static void main( String[] args ) throws IOException {
> PDDocument doc = PDDocument.load(new File("A02Gj780LZ.pdf"));
> COSDictionary x = doc.getPage(0).getResources().getCOSObject();
> read(x);
> doc.close();
> }
> private static void read(COSBase b) {
> if (b instanceof COSObject) {
> read(((COSObject) b).getObject());
> } else if (b instanceof COSDictionary) {
> for (COSBase x : ((COSDictionary) b).getValues()) {
> read(x);
> }
> } else if (b instanceof COSName) {
> if(((COSName) b).getName().charAt(0) > 256)
> throw new RuntimeException(((COSName) b).getName());
> }
> }
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3393) Javascript actions on form fields cause data to become hidden

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3393:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Javascript actions on form fields cause data to become hidden
> -
>
> Key: PDFBOX-3393
> URL: https://issues.apache.org/jira/browse/PDFBOX-3393
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm
>Affects Versions: 2.0.2
>Reporter: Scott Coldwell
>Assignee: Maruan Sahyoun
> Fix For: 3.0.0
>
> Attachments: SF3102.pdf
>
>
> We have run across numerous PDFs that contain javascript actions on some of 
> the form fields.  When trying to set a value for the field, the data is there 
> in the field, but only visible if you place the cursor in the field.  And 
> since we flatten forms before saving them, it looks as if the data was never 
> populated.  Some of the actions we've seen are like the following:
> AFDate_KeystrokeEx("mm/dd/")
> AFNumber_Keystroke(0, 0, 0, 0, "", true);
> As a workaround we are remove any actions associated with a field before 
> setting a value by doing the following:
> {code}
> final PDField field = form.getField(fieldName);
> if (field instanceof PDTerminalField)
> {
> // Need to clear any actions (typically javascript) from the fields
> ((PDTerminalField)field).setActions(null);
> }
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2359) Lines show on top of image

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2359:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Lines show on top of image
> --
>
> Key: PDFBOX-2359
> URL: https://issues.apache.org/jira/browse/PDFBOX-2359
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 2.0.0
>Reporter: simon steiner
>Priority: Minor
> Fix For: 3.0.0
>
> Attachments: 3.pdf
>
>
> java -jar ~/pdf-box-svn/app/target/pdfbox-app-2.0.0-SNAPSHOT.jar PDFToImage 
> 3.pdf



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2897) Preflight not flagging bad xml generated by XMPBox for dc:title

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2897:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Preflight not flagging bad xml generated by XMPBox for dc:title
> ---
>
> Key: PDFBOX-2897
> URL: https://issues.apache.org/jira/browse/PDFBOX-2897
> Project: PDFBox
>  Issue Type: Bug
>  Components: Preflight, XmpBox
>Affects Versions: 2.0.0
>Reporter: Tim Allison
>Priority: Minor
> Fix For: 3.0.0
>
> Attachments: PDFBOX-2897-PDFA-BadXMP2.pdf, PDFBOX-2897-PDFA-BadXMP.pdf
>
>
> [~tilman] asked that I open two separate issues for the finding in TIKA-1678 
> that XMPBox is not generating a valid dc:title entry in the XMP.  This issue 
> is meant to track preflight's failure to detect this problem.
> What PDFBox does:
> {code}
>   
> 
>   this is the title
> 
>   
> {code}
> It should be:
> {code}
>   
> 
>   this is the title
> 
>   
> {code}
> Error message from the PDF-Tools validator:
> {quote}
> 'dc:li' is not allowed in arrays. The elements must be rdf:li or rdf:_N, 
> where N is a positive number.
> There is only one RDF resource allowed in XMP.
> {quote}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3017) Improve document signing

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3017:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Improve document signing
> 
>
> Key: PDFBOX-3017
> URL: https://issues.apache.org/jira/browse/PDFBOX-3017
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm, Signing
>Affects Versions: 2.0.0, 2.1.0
>Reporter: Tilman Hausherr
> Fix For: 3.0.0
>
> Attachments: pdfa_signed_insivible.pdf
>
>
> Improve signing code:
> - incremental save only works for signatures and doesn't respect certificates 
> such as Adobe Extended Usage Rights
> - -{{prepareNonVisualSignature}} clears the AcroForm DR 
> {{acroForm.setDefaultResources(null)}} which is not good if there are other 
> form fields-
> - visual/nonVisualSignature should move into the {{interactive.forms}} 
> package and be handled within the signature field
> - -verify signature (to have tests that go full circle)- done June 2016
> - document or refactor / rewrite visible labyrinthine signature code
> - why is it not possible to pass only the signatureField to addSignature, 
> instead having to create a COSDocument with a page and annotations that has 
> the signature field, and that must be searched for in 
> {{prepareVisibleSignature()}}?
> - support rotated pages (see 
> https://stackoverflow.com/questions/34012293/pdfbox-sign-landscape-file-error/34359956#34359956
>  )
> - -make sure that signed PDF/A files are still PDF/A (see 
> http://www.pdfa.org/wp-content/uploads/2011/08/tn0006_digital_signatures_in_pdfa-1_2008-03-14.pdf
>  ); /ID possibly not OK; /Annots is possibly required ([~tilman] removed this 
> for invisible signatures); test signed files with PDF-Tools and with 
> preflight- tested, they are OK with PDF-Tools and preflight
> - test whether "bad" signatures are detected by preflight (search in old 
> issues)
> - -PDFBOX-3363 - why is the stream cached in a file? Should it be done in 
> memory?- done on July 15, 2016
> - remove {{setVisualSignature(PDVisibleSigProperties 
> visSignatureProperties)}} from SignatureOptions.java, all it does is to call 
> {{visSignatureProperties.getVisibleSignature()}} which returns an 
> {{InputStream}}, and this is already available
> - {{checkSignatureField}} violates the "do one thing" rule
> - decide whether the whole certificate chain should be passed in the sample 
> code, instead of only the first one
> - check certificate chain, revocation lists, etc, only if needed by users, 
> code 
> [here|https://svn.apache.org/repos/asf/cxf/tags/cxf-2.4.1/distribution/src/main/release/samples/sts_issue_operation/src/main/java/demo/sts/provider/cert/]
> - deprecate / remove all PDVisibleSignDesigner constructors except those with 
> a PDDocument object, to avoid a file being opened twice
> - ... your ideas...



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2138) Corrupted words when using PDFTextStripper

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2138:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Corrupted words when using PDFTextStripper
> --
>
> Key: PDFBOX-2138
> URL: https://issues.apache.org/jira/browse/PDFBOX-2138
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 1.8.5, 1.8.6, 2.0.0
> Environment: Windows 7 / 64 bit
>Reporter: Walter Kehl
> Fix For: 3.0.0
>
> Attachments: banking-banana-skins-2014.pdf, 
> banking-banana-skins-2014.txt, PDFBOX-2138.pdf, PDFBOX-2138.txt
>
>
> >> I am using PDFTextStripper (embedded into another application) to get 
> >> the raw text of PDFs so far with good results but recently a PDF file 
> >> has appeared where the output of the PDFTextStripper was corrupted. I 
> >> got sentences like:
> >>
> >>
> >>
> >> "There is al o con ern that b nkers may be pushed to misprice risk 
> >> (No. 6) by the pres ures of c mpetition and an abunda ce of central b 
> >> nk-provided liquidity."
> > Additionally some portions of text appear 
> > twice in the output: first correctly and then corrupted. I have 
> > attached an output created with PDFBox's command line options.
> > If you compare lines 357- 365 with lines 421-429 you see that it is 
> > the same paragraph, first ok and then with characters missing. In the 
> > original source this paragraph is unique.
> > The same seems to happen for the other instances where text is corrupted.
> I also tried it directly on the command line with the same results: input and 
> output files are attached.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3359) Drawing to Graphics2D / ScratchFileBuffer not closed

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3359:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Drawing to Graphics2D / ScratchFileBuffer not closed
> 
>
> Key: PDFBOX-3359
> URL: https://issues.apache.org/jira/browse/PDFBOX-3359
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 2.0.1
>Reporter: Ivan Ridao Freitas
> Fix For: 3.0.0
>
>
> First, there is a little bug on PDFRenderer.renderPageToGraphics(int 
> pageIndex, Graphics2D graphics, float scale) when using scale != 1 the call 
> to clearRect() fills the original size with white background, but it should 
> fill the scaled size.
> Second, I implemented a JPanel which is painted using that function and on 
> every paint this message goes to the console:
> "DEBUG ScratchFileBuffer:516 - ScratchFileBuffer not closed!". Here is the 
> code to test it, run it and *resize the JFrame*:
> {code:title=PanelTest.java|borderStyle=solid}
> import java.awt.Dimension;
> import java.awt.Graphics;
> import java.awt.Graphics2D;
> import java.io.File;
> import java.io.IOException;
> import javax.swing.JFrame;
> import javax.swing.JPanel;
> import javax.swing.SwingUtilities;
> import javax.swing.WindowConstants;
> import org.apache.pdfbox.pdmodel.PDDocument;
> import org.apache.pdfbox.rendering.PDFRenderer;
> public class PanelTest {
>
> private static JPanel getTestPanel() {
> PDDocument doc = null;
> try {
> doc = PDDocument.load(new File("anyfile.pdf"));
> } catch (IOException e) {
> e.printStackTrace();
> }
> final PDFRenderer renderer = new PDFRenderer(doc);
> JPanel panel = new JPanel() {
> @Override
> protected void paintComponent(Graphics g) {
> try {
> renderer.renderPageToGraphics(0, (Graphics2D) g, 0.5f);
> } catch (IOException e) {
> e.printStackTrace();
> }
> }
> };
> return panel;
> }
> public static void main(String[] args) throws Exception {
> SwingUtilities.invokeLater(new Runnable() {
> @Override
> public void run() {
> JFrame frame = new JFrame();
> frame.setDefaultCloseOperation(WindowConstants.EXIT_ON_CLOSE);
> frame.add(getTestPanel());
> frame.pack();
> frame.setSize(600, 400);
> Dimension paneSize = frame.getSize();
> Dimension screenSize = frame.getToolkit().getScreenSize();
> frame.setLocation((screenSize.width - paneSize.width) / 2, 
> (screenSize.height - paneSize.height) / 2);
> frame.setTitle("Test");
> frame.setVisible(true);
> }
> });
> }
> }
> {code}
> Ivan 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3469) Improve handling of tab character when setting AcroForm field values

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3469:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Improve handling of tab character when setting AcroForm field values
> 
>
> Key: PDFBOX-3469
> URL: https://issues.apache.org/jira/browse/PDFBOX-3469
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm
>Reporter: Maruan Sahyoun
>Priority: Minor
> Fix For: 3.0.0
>
>
> When a plan text form field is filled with a value containing the TAB {{\t}} 
> character Adobe Acrobat jumps to tab positions. Look at replicating that.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-1792) Different metadata with NonSequentialPDFParser

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-1792:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Different metadata with NonSequentialPDFParser
> --
>
> Key: PDFBOX-1792
> URL: https://issues.apache.org/jira/browse/PDFBOX-1792
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing, XmpBox
>Affects Versions: 1.8.8, 2.0.0
>Reporter: Tim Allison
>Assignee: Andreas Lehmkühler
>Priority: Minor
> Fix For: 3.0.0
>
> Attachments: PDFBOX-1792.tar.gz, testPDF_acroForm2.pdf
>
>
> The traditional parser is able to extract metadata from a test document from 
> TIKA-738.  The NonSequentialPDFParser is not able to extract metadata from 
> that file.  Another file from the Tika test suite has metadata that can be 
> extracted by the NonSequentialPDFParser but not by classic. 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-1532) extra space added to rotated text

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-1532:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> extra space added to rotated text 
> --
>
> Key: PDFBOX-1532
> URL: https://issues.apache.org/jira/browse/PDFBOX-1532
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 1.7.1, 2.0.0
>Reporter: Jinder Aujla
> Fix For: 3.0.0
>
> Attachments: 0049-My-squashed-commits.patch, rotated.pdf
>
>
> Extra line break added after first character is read in a document that has 
> rotated text.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2648) Preflight does not check CIDSet contents

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2648:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Preflight does not check CIDSet contents
> 
>
> Key: PDFBOX-2648
> URL: https://issues.apache.org/jira/browse/PDFBOX-2648
> Project: PDFBox
>  Issue Type: Bug
>  Components: Preflight
>Affects Versions: 2.0.0
>Reporter: Tilman Hausherr
> Fix For: 3.0.0
>
>
> As correctly observed by [~zuki_ebetsu]:
> {quote}preflight seems to check only the presence of the stream{quote}
> So we need to check the contents too, somehow. At the very least, that would 
> be the length. Next thing would be to verify that the bits correspond to the 
> CIDs that are present.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-1842) Warn if command-line pdf encryption destroys a pre-existing signature

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-1842:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Warn if command-line pdf encryption destroys a pre-existing signature
> -
>
> Key: PDFBOX-1842
> URL: https://issues.apache.org/jira/browse/PDFBOX-1842
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 1.8.7, 2.0.0
>Reporter: Ralf Hauser
>Priority: Minor
> Fix For: 3.0.0
>
>
> see also PDFBOX-1594 , PDFBOX-912



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-1133) Refactoring PDFParser.parseHeader() method

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-1133:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Refactoring PDFParser.parseHeader() method
> --
>
> Key: PDFBOX-1133
> URL: https://issues.apache.org/jira/browse/PDFBOX-1133
> Project: PDFBox
>  Issue Type: Improvement
>  Components: Parsing
>Affects Versions: 1.7.0, 2.0.0
>Reporter: Pierre Huttin
>Assignee: Andreas Lehmkühler
>  Labels: Annotations, Appearance
> Fix For: 3.0.0
>
> Attachments: PDFParser.java.patch, sample.cdata_decoded.cos, 
> sample.xfdf
>
>
> Re-factoring the method parseHeader() method to support an extra header 
> declaration : "%CSO-".
> This header is foundable into some XFDF document, when the stamp appearance 
> stream is define as complete COSDocument. This special document use 
> "%CSO-1.0" as header declaration.
> So I purpose to enhance the PDFReader to be able to parse this kind of 
> document.
> Pierre Huttin



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3066) Text extraction garbled in this file, was OK in 1.8

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3066:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Text extraction garbled in this file, was OK in 1.8
> ---
>
> Key: PDFBOX-3066
> URL: https://issues.apache.org/jira/browse/PDFBOX-3066
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 2.0.0
>Reporter: Joel Hirsh
> Fix For: 3.0.0
>
> Attachments: 936918.pdf, garbled.pdf, PDFBOX-3066-reduced.pdf
>
>
> Attached file, PrintTextLocations shows text garbled, like *,%-))’)) 
> Acrobat copy/paste shows accurate text, and was also fine in 1.8.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2602) Enhance command line tools

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2602:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Enhance command line tools
> --
>
> Key: PDFBOX-2602
> URL: https://issues.apache.org/jira/browse/PDFBOX-2602
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 1.8.8, 2.0.0
>Reporter: Maruan Sahyoun
>Priority: Minor
> Fix For: 3.0.0
>
>
> The command line tools shall be enhanced to have the same behavior across all 
> tools.
> From the discussion on the dev mailing list
> - add an -h option to print the usage
> - print the usage to System.err and use an exit code of 1 if there was an 
> invalid command line parameter
> - print messages on exceptions to System.err
> - rethrow the exception so java can handle it if it will terminate afterwards 
> anyway
> - use an exit code of 1if rethrowing doesn't make sense



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3138) PDTextField doesn't accept any Hebrew characters as new value

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3138:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> PDTextField doesn't accept any Hebrew characters as new value
> -
>
> Key: PDFBOX-3138
> URL: https://issues.apache.org/jira/browse/PDFBOX-3138
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm, FontBox
>Affects Versions: 2.0.0
> Environment: Eclipse 4.2.2, Windows 7 Pro, JRE 1.8.0_05
>Reporter: Gilad Denneboom
> Fix For: 3.0.0
>
> Attachments: SetHebrewFieldValueTest.java, Test-3-filled.pdf, 
> Test.pdf, Test.txt
>
>
> Trying to set a UTF-8 encoded Hebrew string as the value of a PDTextField 
> fails with the following exception:
> {code}
> Exception in thread "main" java.lang.IllegalArgumentException: No glyph for 
> U+05D7 in font AdobeHebrew-Regular
>   at 
> org.apache.pdfbox.pdmodel.font.PDType1CFont.encode(PDType1CFont.java:300)
>   at org.apache.pdfbox.pdmodel.font.PDFont.encode(PDFont.java:283)
>   at 
> org.apache.pdfbox.pdmodel.PDPageContentStream.showText(PDPageContentStream.java:341)
>   at 
> org.apache.pdfbox.pdmodel.interactive.form.PlainTextFormatter.format(PlainTextFormatter.java:213)
>   at 
> org.apache.pdfbox.pdmodel.interactive.form.AppearanceGeneratorHelper.insertGeneratedAppearance(AppearanceGeneratorHelper.java:373)
>   at 
> org.apache.pdfbox.pdmodel.interactive.form.AppearanceGeneratorHelper.setAppearanceContent(AppearanceGeneratorHelper.java:237)
>   at 
> org.apache.pdfbox.pdmodel.interactive.form.AppearanceGeneratorHelper.setAppearanceValue(AppearanceGeneratorHelper.java:144)
>   at 
> org.apache.pdfbox.pdmodel.interactive.form.PDTextField.constructAppearances(PDTextField.java:263)
>   at 
> org.apache.pdfbox.pdmodel.interactive.form.PDTerminalField.applyChange(PDTerminalField.java:221)
>   at 
> org.apache.pdfbox.pdmodel.interactive.form.PDTextField.setValue(PDTextField.java:218)
>   at SetHebrewFieldValueTest.main(SetHebrewFieldValueTest.java:22)
> {code}
> I've tried using multiple fonts for the field, all of which can handle Hebrew 
> characters just fine, and got the same results in all of them.
> See attached files for a demonstration of the issue.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2246) PDFTextStripper should handle colors

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2246:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> PDFTextStripper should handle colors
> 
>
> Key: PDFBOX-2246
> URL: https://issues.apache.org/jira/browse/PDFBOX-2246
> Project: PDFBox
>  Issue Type: Improvement
>  Components: Text extraction
>Affects Versions: 1.8.6, 1.8.7, 2.0.0
>Reporter: Tilman Hausherr
>Priority: Minor
> Fix For: 3.0.0
>
>
> A recent thread in the dev mailing lst (with Aaron H.) dealt with the 
> inability to extract color with PDFTextStripper. The solution was to create a 
>  PDFTextStripper with these entries to the properties file
> {code}
> CS=org.apache.pdfbox.util.operator.SetStrokingColorSpace
> cs=org.apache.pdfbox.util.operator.SetNonStrokingColorSpace
> G=org.apache.pdfbox.util.operator.SetStrokingGrayColor
> g=org.apache.pdfbox.util.operator.SetNonStrokingGrayColor
> K=org.apache.pdfbox.util.operator.SetStrokingCMYKColor
> k=org.apache.pdfbox.util.operator.SetNonStrokingCMYKColor
> RG=org.apache.pdfbox.util.operator.SetStrokingRGBColor
> rg=org.apache.pdfbox.util.operator.SetNonStrokingRGBColor
> SC=org.apache.pdfbox.util.operator.SetStrokingColor
> sc=org.apache.pdfbox.util.operator.SetNonStrokingColor
> SCN=org.apache.pdfbox.util.operator.SetStrokingColor
> scn=org.apache.pdfbox.util.operator.SetNonStrokingColor
> {code}
> I therefore propose (and I'd like to get at least one "+1" before starting 
> because I've never worked on that segment before):
> - replacing the empty entries in the PDFTextStripper property file with the 
> ones above
> - improve the printtextlocations example 
> The problem has come up before: PDFBOX-1736, 
> http://stackoverflow.com/q/10844271/535646 , 
> http://stackoverflow.com/a/9157714/535646 and the solutions presented are 
> rather cumbersome (using a PageDrawer object).



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3000) Transparency Group issues

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3000:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Transparency Group issues
> -
>
> Key: PDFBOX-3000
> URL: https://issues.apache.org/jira/browse/PDFBOX-3000
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 2.0.0
>Reporter: John Hewson
>  Labels: Transparency
> Fix For: 3.0.0
>
> Attachments: 007087-payment-due-p58_reduced2.pdf, blendmodes.pdf, 
> BlendModes-rgb.pdf, circle-simple.pdf, ds-firewall-enterprise-p1_reduced.pdf, 
> gs-bugzilla689309-reduced-bc0.pdf, gs-bugzilla689309-reduced-bc1.pdf, 
> gs-bugzilla689309-reduced.pdf, gs-bugzilla689931_reduced-Multiply.pdf, 
> gs-bugzilla689931_reduced-ScreenBlendPageBackground.pdf, 
> gs-bugzilla689931_reduced-Screen.pdf, gs-bugzilla690022_reduced.pdf, 
> gs-bugzilla690022-reduced-rotations-cropbox.pdf, 
> gs-bugzilla690022-reduced-rotations.pdf, gs-bugzilla691157_mod_unc.pdf, 
> gs-bugzilla691157_mod_unc.png, gs-bugzilla691157.pdf, gs-bugzilla691348.pdf, 
> gs-bugzilla691650-2.pdf, gs-bugzilla692217_reduced.pdf, 
> gs-bugzilla693322_reduced.pdf, gs-bugzilla694556-3.pdf, 
> gs-bugzilla695354.pdf, gs-bugzilla695582-transparency-fill-stroke.pdf, 
> gs-bugzilla695582-transparency-fill-stroke.pdf-1.png, 
> PDFBOX-1697-reduced-rotations.pdf, PDFBOX-2182_mod.pdf, 
> PDFBox3359PanelTestEnhanced.java, PDFBOX-3400-RGB.pdf, 
> PDFBOX-3494_reduced_cropX.pdf, PDFBOX-3494_reduced.pdf, PDFBOX-3564-Mask.pdf, 
> PDFJS-2845-p1.pdf, PDFJS-5526-p13_reduced1.pdf, PDFJS-5526-p13_reduced2.pdf, 
> PDFJS-5526-p13_reduced3-nogroup.pdf, PDFJS-5811-2-p3_reduced4.pdf, 
> PDFJS-5811-2-p3_reduced.pdf, PDFJS-5811-2-p4_reduced-rotations.pdf, 
> PDFJS-5811-2.pdf, PDFJS-5853_reduced.pdf, 
> PDFJS-6967_reduced_outside_softmask.pdf, 
> samsung_galaxy_s_4_um-p1_reduced.pdf, snowman-nose-gradient.pdf, 
> snowman-nose-gradient-rgb.pdf, 
> snowman-nose-gradient-rgb_reduced-0.6-bad2.pdf, 
> snowman-nose-gradient-rgb_reduced-0.7-good2.pdf, 
> snowman-nose-gradient-screenshot-comparison-11.12.2016.jpg, 
> snowmen-opacity-clipping-masks-2.0.3.png, 
> snowmen-opacity-clipping-masks-2.1.0-SNAPSHOT-2016-12-07.png, 
> snowmen-opacity-clipping-masks-2.1-SNAPSHOT-10.12.2016.png, 
> snowmen-opacity-clipping-masks-adobe-illustrator.png, 
> snowmen-opacity-clipping-masks.pdf, SoftMask-Clipped.pdf, SoftMask.pdf, 
> softmask-rewrite-alt1.patch, softmask-rewrite.patch
>
>
> This is a follow-up issue for transparency group issues from PDFBOX-2423. 
> More details to come.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2662) Implement fallback if there are no AcroForm default resources

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2662:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Implement fallback if there are no AcroForm default resources
> -
>
> Key: PDFBOX-2662
> URL: https://issues.apache.org/jira/browse/PDFBOX-2662
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm
>Affects Versions: 1.8.8, 2.0.0
>Reporter: Maruan Sahyoun
> Fix For: 3.0.0
>
>
> Some forms might not have a default resource (DR) entry for AcroForms or the 
> entry is incomplete. Adobe Reader can handle such situations using default 
> settings.
> We should be ably to handle such situations too.
> A sample form can be found in PDFBOX-1234, Fonts.pdf



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2482) Support for creating PDF containing OTF fonts

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2482:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Support for creating PDF containing OTF fonts
> -
>
> Key: PDFBOX-2482
> URL: https://issues.apache.org/jira/browse/PDFBOX-2482
> Project: PDFBox
>  Issue Type: New Feature
>  Components: FontBox, Writing
>Affects Versions: 2.0.0
>Reporter: Corinna Kinchin
> Fix For: 3.0.0
>
>
> Good morning!
> I have tried posting this question on the users list and stackOverflow, but 
> no luck ... other than Tilman Hausher's suggestion that I log an enhancement 
> request ... so here it is - it would be beyond wonderful if you might 
> consider supporting OTF fonts in PDF. 
> Here's the background - I've just (last week) downloaded the latest PDFbox 
> source from github and am trying to create a HelloWorldOTF.java, based on the 
> HelloWorldTTF.java example, with the hope of creating a PDF file which uses 
> an OTF font (in this case, Adobe Caslon Pro Regular) to add text to the 
> output PDF.
> Here's what I have so far:
> {code:java}
> doc = new PDDocument();
> PDPage page = new PDPage();
> doc.addPage(page);
> String testOtfFontFile =  "c:/windows/fonts/ACaslonPro-Regular.otf";
> String testTtfFontFile = "c:/windows/fonts/arial.ttf";
> String testPdfFile = "c:/tmp/pdfboxtest.pdf";
> CFFFont font = loadCFFFont(testOtfFontFile);
> PDFont ttfFont = PDTrueTypeFont.loadTTF(doc, new 
> File(testTtfFontFile));
> PDPageContentStream contentStream = new PDPageContentStream(doc,
> page);
> contentStream.beginText();
> // How to set the CFFFont?
> contentStream.setFont(ttfFont, 12);
> contentStream.moveTextPositionByAmount(100, 700);
> contentStream.drawString(text);
> contentStream.endText();
> contentStream.close();
> doc.save(testPdfFile);
> System.out.println(testPdfFile + " created!");
> {code}
> I can load a CFFFont using this code: (loadCFFFont()):
> {code:java}
> CFFFont cff = null;
> input = new FileInputStream(file);
> byte[] bytes = IOUtils.toByteArray(input);
> CFFParser cffParser = new CFFParser();
> cff = cffParser.parse(bytes).get(0);
> {code}
> ... but can't for the life of me figure out how to get from a CFFFont to a 
> PDFont in order to be able to use it to set the font via setFont().
> Any help or pointers would be hugely appreciated ...
> Thanks a million for reading this far ;)



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-1878) Tags are not being displayed in Adobe Acrobat Tags panel when merging pdfs

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-1878:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Tags are not being displayed in Adobe Acrobat Tags panel when merging pdfs
> --
>
> Key: PDFBOX-1878
> URL: https://issues.apache.org/jira/browse/PDFBOX-1878
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 1.8.3, 1.8.4
> Environment: Windows XP SP3
>Reporter: Tiuser Lassei
>Priority: Minor
> Fix For: 3.0.0
>
> Attachments: pdf1.3.pdf, pdf1.4.pdf
>
>
> The merged PDF output produced by the PDFMergerUtility does not display the 
> tags correctly in the Tags panel of Adobe Acrobat. (Tested in Acrobat Pro XI 
> trial version). Have not tested in another PDF tool that can display tags 
> (not sure if another tool can do this).
> A single blank entry is shown instead of the actual structure tree of the 
> combined source pdfs.
> Though, it seems the reading order (based on the tag structure) is still 
> preserved (based on the testing of adobe reader's read aloud feature).
> Possibly related to fix on tag merging:
> https://issues.apache.org/jira/browse/PDFBOX-1342
> Although the tag merging logic is wrong is 1.8.2 (as only the first page is 
> tagged which was fixed as indicated in PDFBOX-1342), the tags appear 
> correctly in the Tag panel.
> This bug prevents users from modifying the tag structure in Acrobat as the 
> tag entries are missing.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3092) Format 4 TTF cmap table is parsed incorrectly

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3092:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Format 4 TTF cmap table is parsed incorrectly
> -
>
> Key: PDFBOX-3092
> URL: https://issues.apache.org/jira/browse/PDFBOX-3092
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 2.0.0
>Reporter: John Hewson
> Fix For: 3.0.0
>
>
> Certain large Format 4 cmap tables aren't being parsed correctly by 
> CmapSubtable#processSubtype4(), for example in the font "ArialUnicodeMS".
> This results in missing glyphs when rendering the file from PDFBOX-2950, when 
> "ArialUnicodeMS" is used as a substitute. You can force this to happen by 
> changing the following line of PDCIDFontType2:
> {code}
> // find font or substitute
> CIDFontMapping mapping = FontMappers.instance()
> .getCIDFont(getBaseFont(), 
> getFontDescriptor(),
> getCIDSystemInfo());
> {code}
> Replace getBaseFont() with "ArialUnicodeMS"



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2538) avoid "-1 : Unknown error" message in preflight

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2538:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> avoid "-1 : Unknown error" message in preflight
> ---
>
> Key: PDFBOX-2538
> URL: https://issues.apache.org/jira/browse/PDFBOX-2538
> Project: PDFBox
>  Issue Type: Improvement
>  Components: Preflight
>Affects Versions: 2.0.0
>Reporter: Ralf Hauser
>Priority: Minor
> Fix For: 3.0.0
>
>
> e.g. error 4 in PDFBOX-2513
> after all, we know what the reason is.
> Ideas:
> 1) create COSStreamException extends IOException in COSStream.java
> 2) in SinglePageValidationProcess.validateContent() examine the root cause 
> exception type to classify smarter than just ERROR_UNKOWN_ERROR
> any other thoughts?



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2562) create ValidationWarnings

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2562:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> create ValidationWarnings
> -
>
> Key: PDFBOX-2562
> URL: https://issues.apache.org/jira/browse/PDFBOX-2562
> Project: PDFBox
>  Issue Type: Sub-task
>  Components: Preflight
>Affects Versions: 2.0.0
>Reporter: Ralf Hauser
>Priority: Minor
> Fix For: 3.0.0
>
>
> as seen in the parent Issue PDFBOX-2513 , pdf/A1 is not required to validate 
> other standards like TIFF completely.
> On the other hand, the information from the CCITTFaxDecoder is valuable and 
> shouldn't be simply discarded.
> Therefore, I suggest that this information is displayed without setting the 
> validation result to false.
> ValidationWarnings could have the same logical structure as ValidationError .



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2895) Enhance test coverage for appearance generation

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2895:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Enhance test coverage for appearance generation
> ---
>
> Key: PDFBOX-2895
> URL: https://issues.apache.org/jira/browse/PDFBOX-2895
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm
>Affects Versions: 2.0.0
>Reporter: Maruan Sahyoun
>  Labels: Appearance
> Fix For: 3.0.0
>
>
> During the development around PDFBOX-2333 there were several issues 
> discovered where Adobe has some special handling when creating the appearance 
> for AcroForm fields.
> In order to ensure that the current implementation will not be broken while 
> adding missing functionality there shall be test cases (forms and unit tests) 
> securing the current behavior where we know that the results are inline with 
> user expectations.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3080) TrueTypeFont synchronization

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3080:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> TrueTypeFont synchronization
> 
>
> Key: PDFBOX-3080
> URL: https://issues.apache.org/jira/browse/PDFBOX-3080
> Project: PDFBox
>  Issue Type: Improvement
>  Components: FontBox
>Affects Versions: 2.0.0
> Environment: Debian, java version "1.8.0_60"
> Java(TM) SE Runtime Environment (build 1.8.0_60-b27)
> Java HotSpot(TM) 64-Bit Server VM (build 25.60-b23, mixed mode)
>Reporter: ccouturi
>Priority: Minor
>  Labels: Optimization, multithreading
> Fix For: 3.0.0
>
> Attachments: 0001-PDFBOX-3080-excessive-synchronization.patch, 
> PDFBOX-3080-sync-patch.diff, TrueTypeFont.diff
>
>
> The use case is rasterize pdf to png.
> On high load, a lot of threads are blocked in the 
> TrueTypeFont.getHorizontalMetrics() or getHorinzontalHeader() methods.
> Jstack snippet: 
> {code}
> Thread 21999: (state = BLOCKED)
>  - org.apache.fontbox.ttf.TrueTypeFont.getHorizontalMetrics() @bci=6, 
> line=231 (Compiled frame)
>  - org.apache.fontbox.ttf.TrueTypeFont.getAdvanceWidth(int) @bci=1, line=439 
> (Compiled frame)
>  - org.apache.pdfbox.pdmodel.font.PDTrueTypeFont.getWidthFromFont(int) 
> @bci=11, line=324 (Compiled frame)
>  - 
> org.apache.pdfbox.rendering.PageDrawer.drawGlyph2D(org.apache.pdfbox.rendering.Glyph2D,
>  org.apache.pdfbox.pdmodel.font.PDFont, int, org.apache.pdfbox.util.Vector, 
> java.awt.geom.AffineTransform)
>  @bci=39, line=350 (Compiled frame)
>  - 
> org.apache.pdfbox.rendering.PageDrawer.showFontGlyph(org.apache.pdfbox.util.Matrix,
>  org.apache.pdfbox.pdmodel.font.PDFont, int, java.lang.String, 
> org.apache.pdfbox.util.Vector) @bci=34, line=32
> 5 (Compiled frame)
>  - 
> org.apache.pdfbox.contentstream.PDFStreamEngine.showGlyph(org.apache.pdfbox.util.Matrix,
>  org.apache.pdfbox.pdmodel.font.PDFont, int, java.lang.String, 
> org.apache.pdfbox.util.Vector) @bci=32, li
> ne=728 (Compiled frame)
>  - org.apache.pdfbox.contentstream.PDFStreamEngine.showText(byte[]) @bci=240, 
> line=685 (Compiled frame)
>  - org.apache.pdfbox.contentstream.PDFStreamEngine.showTextString(byte[]) 
> @bci=2, line=553 (Compiled frame)
>  - 
> org.apache.pdfbox.contentstream.operator.text.ShowText.process(org.apache.pdfbox.contentstream.operator.Operator,
>  java.util.List) @bci=45, line=50 (Compiled frame)
>  - 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(org.apache.pdfbox.contentstream.operator.Operator,
>  java.util.List) @bci=35, line=799 (Compiled frame)
>  - 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(org.apache.pdfbox.contentstream.PDContentStream)
>  @bci=69, line=461 (Compiled frame)
>  - 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(org.apache.pdfbox.contentstream.PDContentStream)
>  @bci=63, line=438 (Compiled frame)
>  - 
> org.apache.pdfbox.contentstream.PDFStreamEngine.showForm(org.apache.pdfbox.pdmodel.graphics.form.PDFormXObject)
>  @bci=19, line=178 (Interpreted frame)
>  - 
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(org.apache.pdfbox.contentstream.operator.Operator,
>  java.util.List) @bci=162, line=70 (Interpreted frame)
>  - 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(org.apache.pdfbox.contentstream.operator.Operator,
>  java.util.List) @bci=35, line=799 (Compiled frame)
>  - 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(org.apache.pdfbox.contentstream.PDContentStream)
>  @bci=69, line=461 (Compiled frame)
>  - 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(org.apache.pdfbox.contentstream.PDContentStream)
>  @bci=63, line=438 (Compiled frame)
>  - 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(org.apache.pdfbox.pdmodel.PDPage)
>  @bci=19, line=149 (Interpreted frame)
>  - org.apache.pdfbox.rendering.PageDrawer.drawPage(java.awt.Graphics, 
> org.apache.pdfbox.pdmodel.common.PDRectangle) @bci=93, line=180 (Interpreted 
> frame)
>  - 
> org.apache.pdfbox.rendering.PDFRenderer.renderPage(org.apache.pdfbox.pdmodel.PDPage,
>  java.awt.Graphics2D, int, int, float, float) @bci=160, line=208 (Interpreted 
> frame)
>  - org.apache.pdfbox.rendering.PDFRenderer.renderImage(int, float, 
> org.apache.pdfbox.rendering.ImageType) @bci=166, line=139 (Interpreted frame)
>  - org.apache.pdfbox.rendering.PDFRenderer.renderImageWithDPI(int, float, 
> org.apache.pdfbox.rendering.ImageType) @bci=7, line=94 (Interpreted frame)
> [...]
> {code}
> These methods are synchronized. (I think because of initialization block.)
> When I change from:
> {code:title=actual|borderStyle=solid}
> public synchronized HorizontalHeaderTable 

[jira] [Updated] (PDFBOX-1288) Lines will not be printed (correctly) in case they are part of a clipping mask

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-1288:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Lines will not be printed (correctly) in case they are part of a clipping mask
> --
>
> Key: PDFBOX-1288
> URL: https://issues.apache.org/jira/browse/PDFBOX-1288
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 1.6.0
> Environment: Mac OS X 10.7.2, Java 1.6.0_31, FOP 1.0
>Reporter: Christoph Langheld
>Assignee: John Hewson
> Fix For: 3.0.0
>
> Attachments: 20140226_after_printing_with_pdf_box.pdf, 
> 20140226_after_printing_with_pdf_box.pdf-1.png, 
> 20140226_after_printing_with_pdf_box.pdf-2.png, 20140226_original.pdf, 
> 20140226_original.pdf-1.png, 20140226_original.pdf-2.png, 
> after_printing_with_pdfbox.pdf, better-clipping-2.patch, 
> better-clipping-3.patch, better-clipping-4.patch, better-clipping.patch, 
> bitcoin.pdf, original.pdf, PDFBOX-1288.pdf-1-after.png, 
> PDFBOX-1288.pdf-1-before.png, pdfbox-1288.pdf-1.png, 
> PDFBOX-677.pdf-1-after.png, PDFBOX-677.pdf-1-before.png, 
> screenshot_borders_with_wrong_size.png, test2.pdf
>
>
> In case of a PDF file that was created with FOP 1.0 and was printed with 
> PDFBox, lines disappeared or lines appeared smaller. PDF files that were 
> created with Adobe InDesign (e.g.) will be printed correctly.
> It has something to do with lines which are part of a clipping mask. FOP 
> seems to generate a clipping mask around all kind of borders. Lines of type 
>  appear correctly and are not part of an clipping mask.
> When printing a FOP-PDF with PDFBox, PDFBox moves the clipping mask, so the 
> line is not visible anymore. Please see attached Screenshot.
> The PDF file I created with FOP is named: original.pdf
> The resulting PDF file after printing with PDFBox is named: 
> after_printing_with_pdfbox.pdf
> Regards
> Christoph



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2639) Enhance the AcroForms related API

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2639:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Enhance the AcroForms related API
> -
>
> Key: PDFBOX-2639
> URL: https://issues.apache.org/jira/browse/PDFBOX-2639
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm
>Reporter: Maruan Sahyoun
>Priority: Minor
> Fix For: 3.0.0
>
>
> This is a general issue to gather input for potential enhancements to use 
> PDFBox for forms creation and filling. Sub tasks to that issue will track 
> individual enhancements which might result from that input.
> Possible enhancements
> - -currently getWidget() only returns a single Widget but a field might have 
> multiple- > use getWidgets()
> - adding a Widget to already existing ones could be simplified
> - working with Widgets within RadioButtons might be enhanced to easier 
> check/uncheck a Radiobutton option
> - make it easier to set the fields value



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3687) PDFBox doesn't respect different setting of /DA at PDAnnotationWidget level to /DA at PDField level

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-3687:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> PDFBox doesn't respect different setting of /DA at PDAnnotationWidget level 
> to /DA at PDField level
> ---
>
> Key: PDFBOX-3687
> URL: https://issues.apache.org/jira/browse/PDFBOX-3687
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm
>Affects Versions: 2.0.4
>Reporter: Scott Coldwell
>Assignee: Maruan Sahyoun
> Fix For: 2.0.6, 3.0.0
>
> Attachments: DifferentDALevels.pdf, SF1152.pdf, TestSF1152-LL.pdf
>
>
> See the attached file "SF1152.pdf" and locate the "Bureau" field.  If you 
> open this document in Acrobat the font size shows as Auto and functions as 
> such in Acrobat/Acrobat Reader.  However, PDFBox thinks the font size is set 
> to 9.  I confirmed this in both PDFDebugger and in our rendering code.
> The second attachment "TestSF1152-LL.pdf" shows the behavior from PDFBox not 
> picking up the correct Auto font size.  The value is truncated unless you 
> click inside the field or delete the value and replace it.
> The following code shows the problem:
> {code}
> final PDDocument document = 
> PDDocument.load(ClassLoader.getSystemResourceAsStream("SF1152.pdf"));
> final PDAcroForm form = 
> document.getDocumentCatalog().getAcroForm();
> final PDTextField field = (PDTextField)form.getField("Bureau");
> final COSDictionary dict = field.getCOSObject();
> String defaultAppearance = 
> ((COSString)dict.getDictionaryObject(COSName.DA)).getString();
> System.out.println(defaultAppearance);
> {code}
> results in:
> {code}
> /Helv 9 Tf 0 0 0.5 rg
> {code}
> If I manually set the font size to auto using:
> {code}
> dict.setString(COSName.DA, "/Helv 0 Tf 0 0 0.5 rg");
> {code}
> the PDF renders correctly.
> But we shouldn't have to manipulate font sizes in code when it's set 
> correctly in the PDF.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2472) Switch XFDF export to JAXP

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2472:
---
Fix Version/s: (was: 2.1.0)

> Switch XFDF export to JAXP
> --
>
> Key: PDFBOX-2472
> URL: https://issues.apache.org/jira/browse/PDFBOX-2472
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm, Utilities
>Affects Versions: 2.0.0
>Reporter: Andreas Lehmkühler
>  Labels: jaxp, xfdf
> Fix For: 3.0.0
>
>
> As proposed in PDFBOX-2419 we should switch the XFDF export to JAXP



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-1848) Time Stamp Document Level Sigature

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-1848:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Time Stamp Document Level Sigature
> --
>
> Key: PDFBOX-1848
> URL: https://issues.apache.org/jira/browse/PDFBOX-1848
> Project: PDFBox
>  Issue Type: Improvement
>  Components: Signing
>Affects Versions: 2.0.0
>Reporter: vakhtang koroghlishvili
> Fix For: 3.0.0
>
> Attachments: CreateTSASignature.java.patch, 
> TSA-SIG-LOOKS-LIKE-THIS.png
>
>
> We need TSA Document Level signature modulo too!
> At the moment we sign document with our certificate. But... sometimes we need 
> to sign document with TSA too. This is important part of signing. Sometimes 
> this is very very very important- for instance when we will implement PAdES 4 
> profile this module will be essential. without that Document Secure Store 
> will not work :)
> I'm working on this improvement. I'will finish this soon. It's almost done. I 
> only must add some java docs, and might be I change architect design and etc..
> So, please assign this it to me :) I will upload patch as soon as possible :)



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2963) Remove Bouncy Castle Reference

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2963:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Remove Bouncy Castle Reference
> --
>
> Key: PDFBOX-2963
> URL: https://issues.apache.org/jira/browse/PDFBOX-2963
> Project: PDFBox
>  Issue Type: Improvement
>  Components: Crypto, PDModel
>Affects Versions: 1.8.9, 1.8.10, 2.0.0
>Reporter: Johnny Minty
> Fix For: 2.0.6, 3.0.0
>
>
> PDFBox Versions 1.8.X and 2.0.X add Bouncy Castle as a security provider 
> explicitly (Hard coded)
> Referencing bouncy castle explicitly ties PDF box to a specific provider 
> implementation.
> Instead of referencing BouncyCastleProvider explicitly provide an option to 
> select another provider or alternatively allow a way to override the default. 
> Version 1.8.X:
> https://github.com/apache/pdfbox/blob/1.8.10/pdfbox/src/main/java/org/apache/pdfbox/pdmodel/encryption/SecurityHandlersManager.java
> {code}
> public static SecurityHandlersManager getInstance()
> {
> if(instance == null)
> {
> instance = new SecurityHandlersManager();
> Security.addProvider(new BouncyCastleProvider());
> }
> return instance;
> }
> {code}
> Version 2.0.0:
> https://github.com/apache/pdfbox/blob/trunk/pdfbox/src/main/java/org/apache/pdfbox/pdmodel/encryption/SecurityHandlerFactory.java
> {code}
>static
> {
> Security.addProvider(new BouncyCastleProvider());
> }
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-1176) Watermark Annotations

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-1176:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Watermark Annotations
> -
>
> Key: PDFBOX-1176
> URL: https://issues.apache.org/jira/browse/PDFBOX-1176
> Project: PDFBox
>  Issue Type: Wish
>  Components: Writing
>Affects Versions: 1.8.7, 2.0.0
>Reporter: Rubesh MX
>  Labels: Watermark
> Fix For: 3.0.0
>
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> I am checking if watermarks can  be added to a PDF doc and the same way can 
> be removed, so far I could not find any option to do that with PDFBox; It 
> will be better if we have an option to add and remove watermak to a PDF.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2626) Regenerate field appearances if NeedAppearances is set prior to rendering

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2626:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Regenerate field appearances if NeedAppearances is set prior to rendering
> -
>
> Key: PDFBOX-2626
> URL: https://issues.apache.org/jira/browse/PDFBOX-2626
> Project: PDFBox
>  Issue Type: New Feature
>  Components: AcroForm
>Affects Versions: 2.0.0
>Reporter: simon steiner
>  Labels: Appearance
> Fix For: 3.0.0
>
> Attachments: out.pdf
>
>
> java -jar ~/pdf-box-svn/app/target/pdfbox-app-2.0.0-SNAPSHOT.jar PDFToImage 
> out.pdf



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2776) support "Long Term Validation" signature extensions (LTV)

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2776:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> support "Long Term Validation" signature extensions (LTV)
> -
>
> Key: PDFBOX-2776
> URL: https://issues.apache.org/jira/browse/PDFBOX-2776
> Project: PDFBox
>  Issue Type: Improvement
>  Components: Signing
>Affects Versions: 2.0.0
>Reporter: Ralf Hauser
> Fix For: 3.0.0
>
>
> in recent acrobat readers, every signature is commented w.r.t. "LTV"
> ETSI TS 102 778-4 V1.1.2 (2009-12) Technical Specification
> referenced as part 4 in
> http://en.wikipedia.org/wiki/PAdES 
> It would be great if pdf signatures created with PDFBox would assist in 
> creatign those.
> Target test setup: 
> 1) input of an unsigned PDF-1.5 document
> 2) signature with
> a) local key pair
> b) hsm
> c) remote signature service (e.g. via soap)
> 3) add ocsp response for LTV (crls typically are larger)
> ==> Result: signed pdf where acrobat reader claims it to be "LTV enabled"
> see also PDFBOX-1848
> more in 
> http://stackoverflow.com/questions/26090558/ltv-enabled-signature-in-pdf



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-45) Support incremental save

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-45?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-45:
-
Fix Version/s: (was: 2.1.0)
   3.0.0

> Support incremental save
> 
>
> Key: PDFBOX-45
> URL: https://issues.apache.org/jira/browse/PDFBOX-45
> Project: PDFBox
>  Issue Type: New Feature
>  Components: Writing
> Fix For: 3.0.0
>
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314=552835=1157431
> Originally submitted by purplish_cat on 2005-03-05 12:28.
> After opening a PDF file and changing objects out of it, 
> allow to save the changes incrementally to the same file 
> instead of creating a completely new file.
> [comment on SourceForge]
> Originally sent by benlitchfield.
> Logged In: YES 
> user_id=601708
> See forum thread at
> https://sourceforge.net/forum/message.php?msg_id=3032112



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2575) Add support for collections (portfolio)

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2575:
---
Fix Version/s: (was: 2.1.0)

> Add support for collections (portfolio)
> ---
>
> Key: PDFBOX-2575
> URL: https://issues.apache.org/jira/browse/PDFBOX-2575
> Project: PDFBox
>  Issue Type: Improvement
>  Components: PDModel
>Affects Versions: 2.0.0
>Reporter: Andreas Lehmkühler
>  Labels: Collections, Portfolio
> Fix For: 3.0.0
>
>
> We should add support for collections aka portfolios (see PDF specs 7.11.6 
> Collections items and 12.3.5 Collections)



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2564) be more precise with "Stream not read" error

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-2564:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> be more precise with "Stream not read" error
> 
>
> Key: PDFBOX-2564
> URL: https://issues.apache.org/jira/browse/PDFBOX-2564
> Project: PDFBox
>  Issue Type: Sub-task
>  Components: Preflight
>Affects Versions: 2.0.0
>Reporter: Ralf Hauser
>Priority: Minor
> Fix For: 3.0.0
>
>
> Currently, the error says:
> "2 : Unknown error, Stream was not read - on page 1"
> So while the validation doesn't really need to bother with the "EOL 
> encountered in white run", the pdf/A apparently contains further attributes 
> such  "/length" that can no longer be confirmed or denied.
> So,
> 1) the error should explain that.
> 2) If there is even not only a total length, but a specific length expected 
> for the COSStream, error should mention that among others



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-1987) Provide a PDF Lexer as a base for PDF parsing

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-1987:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Provide a PDF Lexer as a base for PDF parsing
> -
>
> Key: PDFBOX-1987
> URL: https://issues.apache.org/jira/browse/PDFBOX-1987
> Project: PDFBox
>  Issue Type: Improvement
>  Components: Parsing
>Affects Versions: 2.0.0
>Reporter: Maruan Sahyoun
>Priority: Minor
> Fix For: 3.0.0
>
> Attachments: src.zip
>
>
> In order to enhance the parsing process and as a foundation for a combination 
> of the different parsers a PDF lexer should be provided.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-1529) Exchange hard-coded values for variables and provide command-line options in TextToPDF component

2017-03-25 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-1529:
---
Fix Version/s: (was: 2.1.0)
   3.0.0

> Exchange hard-coded values for variables and provide command-line options in 
> TextToPDF component
> 
>
> Key: PDFBOX-1529
> URL: https://issues.apache.org/jira/browse/PDFBOX-1529
> Project: PDFBox
>  Issue Type: Improvement
>  Components: Utilities
>Affects Versions: 1.7.1
>Reporter: Dave Powell
>Assignee: Andreas Lehmkühler
>Priority: Minor
>  Labels: features, newbie, patch
> Fix For: 3.0.0
>
> Attachments: 
> patch-pdfbox-src-main-java-org-apache-pdfbox-TextToPDF.java.diff
>
>
> Exchange hard-coded values for variables and provide command-line options in 
> TextToPDF component
> 1) Enable the margins to be individually set from the command-line
> 2) Enable the font size to be represented as a floating-point value, e.g. 
> 10.5 or 11.5
> 3) Allow the line-spacing to be changed from the command-line
> 4) Allow the page size to be changed from the command-line, e.g. A4, A3, 
> US-Letter
> I will provide a patch for review for this added functionality



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: Branch for new major version?

2017-03-25 Thread Andreas Lehmkuehler

Am 23.03.2017 um 07:50 schrieb Andreas Lehmkuehler:

Am 21.03.2017 um 19:45 schrieb Maruan Sahyoun:



Am 21.03.2017 um 19:17 schrieb Tilman Hausherr :

Am 21.03.2017 um 17:53 schrieb Andreas Lehmkuehler:

Hi,

I'm thinking about implementing some stuff which would most likely require a
new major version. Obviously there are some similar things which already
came up, e.g. remove some disturbing public constants, switch to java 7 to
use twelve monkeys lib.

The question is, how should we deal with that. I see a handful of possible
ways:

- switch the current trunk to 3.0 and omit 2.1 (for now there are 3 tickets
with "Fix version/s" 2.1 which didn't make it to the 2.0 branch for
different reasons)
- create a 2.1 branch based on the current trunk and switch the current
trunk to 3.0
- create a 3.0 branch and don't change anything else. Merge all changes to
the trunk once 2.1 was released some time in the future
- don't change anything officially, but create a "private" 3.0 branch and
merge those changes to a future 3.0 version


I'm for solution 1 because it's a PITA to care about many versions.



I'd go for 1 too - BR Maruan


Thanks for your input. Let's wait another couple of days maybe until the next
weekend before proceeding with any changes.

Ok, there isn't any other input.

I'm going to rename the trunk version from 2.1-SNAPSHOT to 3.0-SNAPSHOT and I'm 
going to do the same with out JIRA tickets.


BR
Andreas



BR
Andreas





Tilman


-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Closed] (PDFBOX-3734) out of memory issue when convert scaned pdf to image

2017-03-25 Thread Tilman Hausherr (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr closed PDFBOX-3734.
---
Resolution: Not A Problem

> out of memory issue when convert scaned pdf to image
> 
>
> Key: PDFBOX-3734
> URL: https://issues.apache.org/jira/browse/PDFBOX-3734
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 2.0.5
> Environment: win7 64bit, jdk 1.7 64bit
>Reporter: Yachun Miao
>  Labels: how-to
>
> i had a scaned pdf file which just 2.8M, when try pdf to image feature, i get 
> OOM with -Xmx200m:
> {color:red}
>   at java.awt.image.DataBufferByte.(DataBufferByte.java:92)
>   at 
> java.awt.image.ComponentSampleModel.createDataBuffer(ComponentSampleModel.java:415)
>   at 
> sun.awt.image.ByteInterleavedRaster.(ByteInterleavedRaster.java:89)
>   at 
> sun.awt.image.ByteInterleavedRaster.createCompatibleWritableRaster(ByteInterleavedRaster.java:1281)
>   at 
> sun.awt.image.ByteInterleavedRaster.createCompatibleWritableRaster(ByteInterleavedRaster.java:1292)
>   at org.apache.pdfbox.filter.DCTFilter.fromBGRtoRGB(DCTFilter.java:246)
>   at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:171)
>   at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:69)
>   at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:162)
>   at 
> org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:235)
>   at 
> org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.(PDImageXObject.java:124)
>   at 
> org.apache.pdfbox.pdmodel.graphics.PDXObject.createXObject(PDXObject.java:70)
>   at 
> org.apache.pdfbox.pdmodel.PDResources.getXObject(PDResources.java:409)
>   at 
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:53)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:838)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:495)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:469)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:150)
>   at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:206)
>   at 
> org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:145)
> {color}
> After i enlarge jvm max heap size to 500M, then it works. 
> I know pdf rendering is very difficulty, but do we have some manner to avoid 
> consumpting so much memory? whatever it is a bit surprized pdfbox use 500M 
> memory to handle one page of scaned pdf (total 2.8M). ratio is around 200 
> times. 
> But as per me, it is ok to decrease some quality of image converted. 
> (actually the quality of original image in pdf not good as well. :)). Tell me 
> if we do have such methods. I will help try. 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Comment Edited] (PDFBOX-3734) out of memory issue when convert scaned pdf to image

2017-03-25 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15941704#comment-15941704
 ] 

Tilman Hausherr edited comment on PDFBOX-3734 at 3/25/17 12:03 PM:
---

See here:
https://pdfbox.apache.org/2.0/faq.html#outofmemoryrrror

Closing as this is a howto-question and not really a problem (you can run the 
program with more memory; I use -Xmx2g for my tests), please ask such questions 
in the mailing list or on stackoverflow, see
https://pdfbox.apache.org/support.html


was (Author: tilman):
See here:
https://pdfbox.apache.org/2.0/faq.html#outofmemoryrrror

Closing as this is a howto-question, please ask these in the mailing list or on 
stackoverflow, see
https://pdfbox.apache.org/support.html

> out of memory issue when convert scaned pdf to image
> 
>
> Key: PDFBOX-3734
> URL: https://issues.apache.org/jira/browse/PDFBOX-3734
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 2.0.5
> Environment: win7 64bit, jdk 1.7 64bit
>Reporter: Yachun Miao
>  Labels: how-to
>
> i had a scaned pdf file which just 2.8M, when try pdf to image feature, i get 
> OOM with -Xmx200m:
> {color:red}
>   at java.awt.image.DataBufferByte.(DataBufferByte.java:92)
>   at 
> java.awt.image.ComponentSampleModel.createDataBuffer(ComponentSampleModel.java:415)
>   at 
> sun.awt.image.ByteInterleavedRaster.(ByteInterleavedRaster.java:89)
>   at 
> sun.awt.image.ByteInterleavedRaster.createCompatibleWritableRaster(ByteInterleavedRaster.java:1281)
>   at 
> sun.awt.image.ByteInterleavedRaster.createCompatibleWritableRaster(ByteInterleavedRaster.java:1292)
>   at org.apache.pdfbox.filter.DCTFilter.fromBGRtoRGB(DCTFilter.java:246)
>   at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:171)
>   at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:69)
>   at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:162)
>   at 
> org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:235)
>   at 
> org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.(PDImageXObject.java:124)
>   at 
> org.apache.pdfbox.pdmodel.graphics.PDXObject.createXObject(PDXObject.java:70)
>   at 
> org.apache.pdfbox.pdmodel.PDResources.getXObject(PDResources.java:409)
>   at 
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:53)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:838)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:495)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:469)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:150)
>   at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:206)
>   at 
> org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:145)
> {color}
> After i enlarge jvm max heap size to 500M, then it works. 
> I know pdf rendering is very difficulty, but do we have some manner to avoid 
> consumpting so much memory? whatever it is a bit surprized pdfbox use 500M 
> memory to handle one page of scaned pdf (total 2.8M). ratio is around 200 
> times. 
> But as per me, it is ok to decrease some quality of image converted. 
> (actually the quality of original image in pdf not good as well. :)). Tell me 
> if we do have such methods. I will help try. 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3734) out of memory issue when convert scaned pdf to image

2017-03-25 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15941704#comment-15941704
 ] 

Tilman Hausherr commented on PDFBOX-3734:
-

See here:
https://pdfbox.apache.org/2.0/faq.html#outofmemoryrrror

Closing as this is a howto-question, please ask these in the mailing list or on 
stackoverflow, see
https://pdfbox.apache.org/support.html

> out of memory issue when convert scaned pdf to image
> 
>
> Key: PDFBOX-3734
> URL: https://issues.apache.org/jira/browse/PDFBOX-3734
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 2.0.5
> Environment: win7 64bit, jdk 1.7 64bit
>Reporter: Yachun Miao
>  Labels: how-to
>
> i had a scaned pdf file which just 2.8M, when try pdf to image feature, i get 
> OOM with -Xmx200m:
> {color:red}
>   at java.awt.image.DataBufferByte.(DataBufferByte.java:92)
>   at 
> java.awt.image.ComponentSampleModel.createDataBuffer(ComponentSampleModel.java:415)
>   at 
> sun.awt.image.ByteInterleavedRaster.(ByteInterleavedRaster.java:89)
>   at 
> sun.awt.image.ByteInterleavedRaster.createCompatibleWritableRaster(ByteInterleavedRaster.java:1281)
>   at 
> sun.awt.image.ByteInterleavedRaster.createCompatibleWritableRaster(ByteInterleavedRaster.java:1292)
>   at org.apache.pdfbox.filter.DCTFilter.fromBGRtoRGB(DCTFilter.java:246)
>   at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:171)
>   at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:69)
>   at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:162)
>   at 
> org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:235)
>   at 
> org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.(PDImageXObject.java:124)
>   at 
> org.apache.pdfbox.pdmodel.graphics.PDXObject.createXObject(PDXObject.java:70)
>   at 
> org.apache.pdfbox.pdmodel.PDResources.getXObject(PDResources.java:409)
>   at 
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:53)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:838)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:495)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:469)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:150)
>   at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:206)
>   at 
> org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:145)
> {color}
> After i enlarge jvm max heap size to 500M, then it works. 
> I know pdf rendering is very difficulty, but do we have some manner to avoid 
> consumpting so much memory? whatever it is a bit surprized pdfbox use 500M 
> memory to handle one page of scaned pdf (total 2.8M). ratio is around 200 
> times. 
> But as per me, it is ok to decrease some quality of image converted. 
> (actually the quality of original image in pdf not good as well. :)). Tell me 
> if we do have such methods. I will help try. 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-3734) out of memory issue when convert scaned pdf to image

2017-03-25 Thread Tilman Hausherr (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-3734:

Labels: how-to  (was: )

> out of memory issue when convert scaned pdf to image
> 
>
> Key: PDFBOX-3734
> URL: https://issues.apache.org/jira/browse/PDFBOX-3734
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 2.0.5
> Environment: win7 64bit, jdk 1.7 64bit
>Reporter: Yachun Miao
>  Labels: how-to
>
> i had a scaned pdf file which just 2.8M, when try pdf to image feature, i get 
> OOM with -Xmx200m:
> {color:red}
>   at java.awt.image.DataBufferByte.(DataBufferByte.java:92)
>   at 
> java.awt.image.ComponentSampleModel.createDataBuffer(ComponentSampleModel.java:415)
>   at 
> sun.awt.image.ByteInterleavedRaster.(ByteInterleavedRaster.java:89)
>   at 
> sun.awt.image.ByteInterleavedRaster.createCompatibleWritableRaster(ByteInterleavedRaster.java:1281)
>   at 
> sun.awt.image.ByteInterleavedRaster.createCompatibleWritableRaster(ByteInterleavedRaster.java:1292)
>   at org.apache.pdfbox.filter.DCTFilter.fromBGRtoRGB(DCTFilter.java:246)
>   at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:171)
>   at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:69)
>   at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:162)
>   at 
> org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:235)
>   at 
> org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.(PDImageXObject.java:124)
>   at 
> org.apache.pdfbox.pdmodel.graphics.PDXObject.createXObject(PDXObject.java:70)
>   at 
> org.apache.pdfbox.pdmodel.PDResources.getXObject(PDResources.java:409)
>   at 
> org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:53)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:838)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:495)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:469)
>   at 
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:150)
>   at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:206)
>   at 
> org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:145)
> {color}
> After i enlarge jvm max heap size to 500M, then it works. 
> I know pdf rendering is very difficulty, but do we have some manner to avoid 
> consumpting so much memory? whatever it is a bit surprized pdfbox use 500M 
> memory to handle one page of scaned pdf (total 2.8M). ratio is around 200 
> times. 
> But as per me, it is ok to decrease some quality of image converted. 
> (actually the quality of original image in pdf not good as well. :)). Tell me 
> if we do have such methods. I will help try. 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Created] (PDFBOX-3734) out of memory issue when convert scaned pdf to image

2017-03-25 Thread Yachun Miao (JIRA)
Yachun Miao created PDFBOX-3734:
---

 Summary: out of memory issue when convert scaned pdf to image
 Key: PDFBOX-3734
 URL: https://issues.apache.org/jira/browse/PDFBOX-3734
 Project: PDFBox
  Issue Type: Bug
  Components: Rendering
Affects Versions: 2.0.5
 Environment: win7 64bit, jdk 1.7 64bit
Reporter: Yachun Miao


i had a scaned pdf file which just 2.8M, when try pdf to image feature, i get 
OOM with -Xmx200m:

{color:red}
at java.awt.image.DataBufferByte.(DataBufferByte.java:92)
at 
java.awt.image.ComponentSampleModel.createDataBuffer(ComponentSampleModel.java:415)
at 
sun.awt.image.ByteInterleavedRaster.(ByteInterleavedRaster.java:89)
at 
sun.awt.image.ByteInterleavedRaster.createCompatibleWritableRaster(ByteInterleavedRaster.java:1281)
at 
sun.awt.image.ByteInterleavedRaster.createCompatibleWritableRaster(ByteInterleavedRaster.java:1292)
at org.apache.pdfbox.filter.DCTFilter.fromBGRtoRGB(DCTFilter.java:246)
at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:171)
at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:69)
at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:162)
at 
org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:235)
at 
org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.(PDImageXObject.java:124)
at 
org.apache.pdfbox.pdmodel.graphics.PDXObject.createXObject(PDXObject.java:70)
at 
org.apache.pdfbox.pdmodel.PDResources.getXObject(PDResources.java:409)
at 
org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:53)
at 
org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:838)
at 
org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:495)
at 
org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:469)
at 
org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:150)
at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:206)
at 
org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:145)
{color}

After i enlarge jvm max heap size to 500M, then it works. 

I know pdf rendering is very difficulty, but do we have some manner to avoid 
consumpting so much memory? whatever it is a bit surprized pdfbox use 500M 
memory to handle one page of scaned pdf (total 2.8M). ratio is around 200 
times. 

But as per me, it is ok to decrease some quality of image converted. (actually 
the quality of original image in pdf not good as well. :)). Tell me if we do 
have such methods. I will help try. 






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3733) image added to pdf by tool AddImageToPDF is turn-overed

2017-03-25 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15941702#comment-15941702
 ] 

Tilman Hausherr commented on PDFBOX-3733:
-

{quote}
will we set resetContext as true result in other regression issue
{quote}
You didn't open any other issues. Could you please clarify? Or do you mean what 
to do with PDFs you have created in the past?

> image added to pdf by tool AddImageToPDF is turn-overed 
> 
>
> Key: PDFBOX-3733
> URL: https://issues.apache.org/jira/browse/PDFBOX-3733
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 1.8.13, 2.0.5
> Environment: 1. windows 7 64 bit
> 2. jdk 1.7 
> java version "1.7.0_80"
> Java(TM) SE Runtime Environment (build 1.7.0_80-b15)
> Java HotSpot(TM) 64-Bit Server VM (build 24.80-b11, mixed mode)
>Reporter: Yachun Miao
>Assignee: Tilman Hausherr
> Fix For: 1.8.14, 2.0.6, 2.1.0
>
> Attachments: test.pdf
>
>
> I had a pdf file (see attachement test.pdf), i try pdfbox tools AddImageToPDF 
> to add any test image to first page of my test pdf. the image added in pdf 
> always turn-overed. But if i tried other pdf like scanned version, there is 
> no such problem. 
> You can reproduce it by my test.pdf with any image. 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3733) image added to pdf by tool AddImageToPDF is turn-overed

2017-03-25 Thread Yachun Miao (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15941698#comment-15941698
 ] 

Yachun Miao commented on PDFBOX-3733:
-

Just concern, will we set resetContext as true result in other regression 
issue? because by default the varviable is false in other function like :

public PDPageContentStream(PDDocument document, PDPage sourcePage, 
PDPageContentStream.AppendMode appendContent, boolean compress) throws 
IOException {
this(document, sourcePage, appendContent, compress, false);
}

> image added to pdf by tool AddImageToPDF is turn-overed 
> 
>
> Key: PDFBOX-3733
> URL: https://issues.apache.org/jira/browse/PDFBOX-3733
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 1.8.13, 2.0.5
> Environment: 1. windows 7 64 bit
> 2. jdk 1.7 
> java version "1.7.0_80"
> Java(TM) SE Runtime Environment (build 1.7.0_80-b15)
> Java HotSpot(TM) 64-Bit Server VM (build 24.80-b11, mixed mode)
>Reporter: Yachun Miao
>Assignee: Tilman Hausherr
> Fix For: 1.8.14, 2.0.6, 2.1.0
>
> Attachments: test.pdf
>
>
> I had a pdf file (see attachement test.pdf), i try pdfbox tools AddImageToPDF 
> to add any test image to first page of my test pdf. the image added in pdf 
> always turn-overed. But if i tried other pdf like scanned version, there is 
> no such problem. 
> You can reproduce it by my test.pdf with any image. 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Resolved] (PDFBOX-3733) image added to pdf by tool AddImageToPDF is turn-overed

2017-03-25 Thread Tilman Hausherr (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr resolved PDFBOX-3733.
-
Resolution: Fixed

Thanks for the feedback!

> image added to pdf by tool AddImageToPDF is turn-overed 
> 
>
> Key: PDFBOX-3733
> URL: https://issues.apache.org/jira/browse/PDFBOX-3733
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 1.8.13, 2.0.5
> Environment: 1. windows 7 64 bit
> 2. jdk 1.7 
> java version "1.7.0_80"
> Java(TM) SE Runtime Environment (build 1.7.0_80-b15)
> Java HotSpot(TM) 64-Bit Server VM (build 24.80-b11, mixed mode)
>Reporter: Yachun Miao
>Assignee: Tilman Hausherr
> Fix For: 1.8.14, 2.0.6, 2.1.0
>
> Attachments: test.pdf
>
>
> I had a pdf file (see attachement test.pdf), i try pdfbox tools AddImageToPDF 
> to add any test image to first page of my test pdf. the image added in pdf 
> always turn-overed. But if i tried other pdf like scanned version, there is 
> no such problem. 
> You can reproduce it by my test.pdf with any image. 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org