Re: Issue with PDFBox 3.0.0 - Unable to Extract and Add Pages

2024-04-07 Thread Andreas Lehmkühler
The issue was fixed and is part of the current 3.0.2 version of PDFBox. Andreas Am 27.02.24 um 10:11 schrieb Tilman Hausherr: Hi, It's like Fabian said. Btw neither the code here nor the different(!) code in https://stackoverflow.com/questions/78065676/ would enable anybody to reproduce

Re: Text extraction from a certain PDF does not seem to terminate

2024-04-06 Thread Andreas Lehmkühler
Hi, Am 03.04.24 um 15:53 schrieb Brangs, Erik: Hi, when attempting text extraction from the PDF at https://d-nb.info/1324982411/34 , either using PDFBox 3.0.0 or PDFBox 4.0.0-SNAPSHOT, the extraction uses about 1,8 GB heap memory and does not seem to terminate. I cancelled the extraction

Re: Lost xref table on two PDF merge

2024-04-04 Thread Andreas Lehmkühler
Hi, which version of PDFBox are you using? Did you save the merged pdf before you try to fix the signature? The resulting pdf should have a valid xref table. Andreas Am 04.04.24 um 15:09 schrieb František Šimon: Hello, I encounter a problem when trying to fix some problem with

Re: Type 0 font - Text extraction X PDF Debugger

2024-03-25 Thread Andreas Lehmkühler
Am 25.03.24 um 10:07 schrieb Tilman Hausherr: On 25.03.2024 07:48, Andreas Lehmkühler wrote: Thanks for the URLs. All of them are working with my change. See https://issues.apache.org/jira/browse/PDFBOX-5790 for further details. @Tilman Please run your tests if possible No regressions

Re: Type 0 font - Text extraction X PDF Debugger

2024-03-25 Thread Andreas Lehmkühler
for my own tests. Tilman On 24.03.2024 16:19, Andreas Lehmkühler wrote: Am 15.03.24 um 05:35 schrieb Tilman Hausherr: You are correct that it's the "fb" parts that are missing. (And some of the other tools you tried also mention this) Just adding true results in text extraction

[ANNOUNCE] Apache PDFBox 2.0.31 released

2024-03-24 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.31 The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 2.0.31

Re: Type 0 font - Text extraction X PDF Debugger

2024-03-24 Thread Andreas Lehmkühler
Am 15.03.24 um 05:35 schrieb Tilman Hausherr: You are correct that it's the "fb" parts that are missing. (And some of the other tools you tried also mention this) Just adding true results in text extraction of several files no longer being correct, 433525-p1.pdf

[ANNOUNCE] Apache PDFBox 3.0.2 released

2024-03-14 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 3.0.2. The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 3.0.2

Re: Help with NullPointerException org.apache.io.IOUtils.LOG

2024-03-12 Thread Andreas Lehmkühler
Hi Matthew, this is a known issue with 3.0.1, see [1] for further details. The upcoming version 3.0.2 includes a fix. Unless nothing unforeseen happens, the new version will be available in about 2 days from now. Andreas [1] https://issues.apache.org/jira/browse/PDFBOX-5758 Am 12.03.24 um

Re: Importing landscape format and portrait format oriented pages into the same PDF causes PDF corruption

2024-03-02 Thread Andreas Lehmkühler
Hi, I guess I've fixed https://issues.apache.org/jira/browse/PDFBOX-5752 and the fix works for PDFBOX-5775 as well. @Fabian please give the newest SNAPSHOT build of 3.0.2 a try Andreas Am 23.02.24 um 11:43 schrieb Tilman Hausherr: On 21.02.2024 16:07, Fabian Zünd SI-Solutions Gmbh wrote:

Re: RE%3A Re%3A [External Sender] Re%3A PDFBox 3.0.1 compile dependency on junit-jupiterIn-Reply-To=<9f543108-ef5c-4c7a-bac8-d7c6009d9d5f%40gmail.com>

2024-01-10 Thread Andreas Lehmkühler
look forward to seeing this in the next release! --Dan From: Andreas Lehmkühler Date: Sunday, December 3, 2023 at 1:58 PM To:users@pdfbox.apache.org Subject: [External Sender] Re: PDFBox 3.0.1 compile dependency on junit-jupiter solved, see [1] for further details. Andreas [1]https

Re: Text extraction from a certain PDF uses up multiple GB of memory

2023-12-14 Thread Andreas Lehmkühler
Looks like https://issues.apache.org/jira/browse/PDFBOX-5479 Am 13.12.23 um 14:50 schrieb Tilman Hausherr: On 13.12.2023 11:23, Brangs, Erik wrote: Hi, we ran into problems when doing text extraction from the PDF athttps://d-nb.info/1312454512/34 . We were using PDFBox 3.0.0 to extract the

Re: PDFBox 3.0.1 compile dependency on junit-jupiter

2023-12-03 Thread Andreas Lehmkühler
solved, see [1] for further details. Andreas [1] https://issues.apache.org/jira/browse/PDFBOX-5722 Am 02.12.23 um 09:05 schrieb Andreas Lehmkühler: Hi, Am 01.12.23 um 17:14 schrieb Dan Rabe: It looks like a compile dependency on junit-jupiter snuck into the 3.0.1 release. If I look

Re: PDFBox 3.0.1 compile dependency on junit-jupiter

2023-12-02 Thread Andreas Lehmkühler
Hi, Am 01.12.23 um 17:14 schrieb Dan Rabe: It looks like a compile dependency on junit-jupiter snuck into the 3.0.1 release. If I look at the maven page for 3.0.0 at https://mvnrepository.com/artifact/org.apache.pdfbox/pdfbox/3.0.0, junit-jupiter is listed as a test dependency. If I look at

[ANNOUNCE] Apache PDFBox 3.0.1 released

2023-11-30 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 3.0.1. The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 3.0.1

Re: Odd OCG error

2023-11-21 Thread Andreas Lehmkühler
Am 21.11.23 um 21:26 schrieb John Lussmyer: Ugh, formatting mess. For more info, this is the "addOCGs:OCG" log line just before the error message: 10:53:09.765 [etrix SwingWorker[0]] DEBUG ImposedPDFEngine - addOCGs: OCG

[ANNOUNCE] Apache PDFBox 2.0.30 released

2023-11-05 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.30. The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 2.0.30

Re: PII data

2023-10-16 Thread Andreas Lehmkühler
PDFBox doesn't send any information anywhere. Everything is done locally on your machine. Am 16.10.23 um 23:14 schrieb Ward Dixon: Hello, does anyone know if PDF Box sends any information outside of my network from the PDF it is creating? I'm concerned about Personal Identifiable Information

Re: empty/missing pdf content

2023-10-16 Thread Andreas Lehmkühler
Am 16.10.23 um 23:43 schrieb Pados Attila: I fixed the issue with missing input pdf file, and also re-run this test project with the most fresh 3.0.1-SNAPSHOT version. (oct 5th) So far, the character distortion remains, That isn't the most recent version. The ticket was created on Oct 7th

Re: Looking for a Debugger that can show which incremental save an object belongs to

2023-10-07 Thread Andreas Lehmkühler
Am 07.10.23 um 06:43 schrieb John Lussmyer: I doubt there is a way. It's most likely that the signing code makes a MD5 checksum (or similar) of the file when it is signed. If the file is changed, checking the signing will re-calculate the checksum and find that it is different.  There isn't

Re: how to replace MemoryUsageSetting.setupMixed(100mb) ?

2023-10-07 Thread Andreas Lehmkühler
Am 06.10.23 um 00:07 schrieb Pados Attila: I am using something like this: PDDocument a1doc = Loader.loadPDF(new RandomAccessReadBuffer(resourceAsStream), () -> new ScratchFile(MemoryUsageSetting.setupMixed(100))); (I use it with tempFileOnly, but the rest are the same) Be aware that all

Re: RandomAccessReadBuffer performance issues with inputStreams in 3.0

2023-09-17 Thread Andreas Lehmkühler
Am 28.08.23 um 13:30 schrieb bnncdv: When migrating from 2.0 to 3.0 I noticed some operations were very slow, mainly the Splitter tool. With a big-ish file it would take *a lot* more memory/cpu (jdk8). What exactly are you doing? I've tried to reproduce the issue and I've bee succesful with

[ANNOUNCE] Apache PDFBox 1.8.x End-Of-Life (EOL) Announcement

2023-08-19 Thread Andreas Lehmkühler
The Apache PDFBox Team would like to inform you that PDFBox 1.8.17 is the last release of the 1.8 branch, which has reached its end of life and won't be longer officially supported. The current community mainly maintains the 2.0.x branch and the brand new 3.0.x branch. We recommend everyone

Re: [ANNOUNCE] Apache PDFBox 3.0.0 released

2023-08-19 Thread Andreas Lehmkühler
Hi, @Erik thanks for the report but I guess there is a misunderstanding, see inline Am 18.08.23 um 11:32 schrieb Brangs, Erik: Hi, -Ursprüngliche Nachricht- Von: Andreas Lehmkühler [mailto:andr...@lehmi.de.INVALID] Gesendet: Freitag, 18. August 2023 07:42 An: users

[ANNOUNCE] Apache PDFBox 3.0.0 released

2023-08-17 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox 3.0.0. It is available for download at: https://pdfbox.apache.org/download.html The Apache PDFBox library is an open source Java tool for working with PDF documents. This is the new major release 3.0.0 of

Re: Border / Box around images and form elements with backgrounds

2023-08-06 Thread Andreas Lehmkühler
Please provide the source pdf you used for rendering as well. Thanks in advance Andreas Am 01.08.23 um 22:30 schrieb JJ Blodgett: It looks like the attachments were stripped out of the email. I'll try to include Google doc links and hope these work: Example of bad behavior:

Re: TextToPDF function removes the first char since 2.0.28

2023-07-27 Thread Andreas Lehmkühler
I've ran your shell script and got the same result, the first char is missing in the pdf. It seems to be related to the way you are calling TextToPDF. You are simply print the text to the console and redirect it to TextToPDF. I've changed that and echoed the text to a file and used that file

Re: TextToPDF function removes the first char since 2.0.28

2023-07-25 Thread Andreas Lehmkühler
Hi, the question is, where does the char got lost, when creating the pdf or when extracting the text? Did you check the created pdf? Does it contain the whole text? Andreas Am 25.07.23 um 07:52 schrieb michael.a...@universa.de: Hi, the TextToPDF function worked without problems from

[ANNOUNCE] Apache PDFBox 3.0.0-beta1 released

2023-07-14 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of the first beta release for Apache PDFBox 3.0.0. It is available for download at: https://pdfbox.apache.org/download.html The Apache PDFBox library is an open source Java tool for working with PDF documents. This is the first

[ANNOUNCE] Apache PDFBox 2.0.29 released

2023-07-01 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.29. The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 2.0.29

Re: When will the next version from the 3.x line be available?

2023-06-27 Thread Andreas Lehmkühler
Hi, Am 27.06.23 um 15:10 schrieb Brangs, Erik: Hi, version 2.0.28 of PDFBox was released recently. Will there also be a new version from the 3.x line in the near future? First of all there will be another 2.0 release, hopefully tomorrow Andreas Lehmkühler mentioned a possible beta1

Fwd: Apache in 2018 - By The Digits

2019-01-01 Thread Andreas Lehmkühler
Hi, Sally prepared some digits for 2018 and I was surprised to see one of our fellow PDFBox committers among the Top 5 committers as we are a small community compared to other ASF projects. Thanks Tilman for your ongoing efforts to improve PDFBox in the last year, the time before that and

Re: Regarding retrieving COSName.getPDFName(PreflightConstants.DICTIONARY_KEY_LINEARIZED

2017-07-25 Thread Andreas Lehmkühler
> karthick g hat am 25. Juli 2017 um 10:34 > geschrieben: > > > Hi team, > > Based on the analysis I have found one thing regarding Linearized PDF in > 2.0 and above versions of PDFBox. > > COSDocument cDoc = pdDoc.getDocument(); > List lObj = cDoc.getObjects(); >

Re: PDFBox JPEG2000 and Tomcat

2017-07-25 Thread Andreas Lehmkühler
> Chris Gamache hat am 25. Juli 2017 um 03:10 geschrieben: > > > I also recall one thread on SO where the developer had kept the scope on the > imageio jars set to `test` as it is in PDFbox's pom. I wish it were a > contributing factor here because it is an easy fix. > >

Re: AW: Splitter.createNewDocument() always uses main memory only - this leads to out of memory when splitting large documents

2017-07-14 Thread Andreas Lehmkühler
You are looking at the wrong place. pdfbox-app is just a meta project to create a convience binary of all relevant subprojects. It doesn't contain any source code. The source code you are looking for is here:

Re: UTF16 encoded string to PDFDocEncoding

2017-07-11 Thread Andreas Lehmkühler
> Andreas Lehmkühler <andr...@lehmi.de> hat am 11. Juli 2017 um 12:17 > geschrieben: > > > > > Andrea Vacondio <andrea.vacon...@gmail.com> hat am 10. Juli 2017 um 19:22 > > geschrieben: > > > > > > Hi, we came across this ca

Re: catch(IOException | COSVisitorException e)

2017-06-26 Thread Andreas Lehmkühler
> Steve Carr hat am 26. Juni 2017 um 11:41 > geschrieben: > > > import java.io.IOException; > import org.apache.pdfbox.exceptions.COSVisitorException; > import org.apache.pdfbox.pdmodel.PDDocument; > import org.apache.pdfbox.pdmodel.PDPage; > > /** > * >

Re: PDPageContentStream#close() vs PDDocument#close()

2017-06-22 Thread Andreas Lehmkühler
> Thad Humphries hat am 21. Juni 2017 um 23:30 > geschrieben: > > > Is it necessary to call PDDocument#close() after calling > PDPageContentStream#close()? Does the answer apply all cases or only > certain cases? If the latter, what certain cases? > > For example,

Re: Help identifying hair-lines in PDFs using PDFBox and tabula

2017-05-23 Thread Andreas Lehmkühler
> Gilad Denneboom hat am 22. Mai 2017 um 22:07 > geschrieben: > > > Hi all, > > So I'm trying to identify hair-lines in my PDFs. I came across tabula, > which seems to be able to do it, but I can't get it to quite work with my > files in the way I need it to, so any

Re: Linearized dictionary

2017-05-22 Thread Andreas Lehmkühler
> karthick g hat am 22. Mai 2017 um 06:17 geschrieben: > > > Hi team, > > Here is the code, I am using COSName.getPDFName("Linearized). The problem > is > > PDDocument pdDoc = PDDocument.load(new File("")); > COSDocument cosDoc = pdDoc.getDocument(); > List lObj =

RE: creating fillable forms, possibly in/from existing PDF file?

2017-05-18 Thread Andreas Lehmkühler
> Gary Grosso hat am 18. Mai 2017 um 05:09 > geschrieben: > > > Thanks for your reply, Tilman. > > I see PDFBox allows for text field/area (single or multi-line), list box, > combo box, check box, push button, and radio button. > > Would it be reasonable to say

Re: OTFParser how to

2017-04-24 Thread Andreas Lehmkühler
> clifford hat am 19. April 2017 um 18:12 > geschrieben: > > > When doing.. > java.io.FileInputStream fis = new java.io.FileInputStream(file1); > OTFParser p = new OTFParser(); > OpenTypeFont otf = p.parse(fis); > > and otf.isPostScript() is true how do I

Re: converting hex to PDColor

2017-03-13 Thread Andreas Lehmkühler
> chitgoks hat am 13. März 2017 um 11:27 geschrieben: > > > hi again > > a little assistance regarding converting hex to PDColor. > > please take this example #ff8000 > > and this is my code > > String colorStr = "#ff8000"; > java.awt.Color rgb = new java.awt.Color( >

[ANNOUNCE] Apache PDFBox 2.0.2 released

2016-06-09 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.2. The release is available for download at: http://pdfbox.apache.org/download.cgi See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 2.0.2

Re: Extracting ".pam" image files

2016-06-07 Thread Andreas Lehmkühler
> "OYEBISI, Daniel" hat am 7. Juni 2016 um 10:41 > geschrieben: > > > Hello, > > I have a PDF document containing images of the format type ".pam". I have > checked the API doc but I haven't seen anything related to ".pam" files. > Please can anyone guide me on how to do

Re: Numbers get reversed sometimes during conversion

2016-06-02 Thread Andreas Lehmkühler
> Shyam Sundar hat am 2. Juni 2016 um 09:18 > geschrieben: > > > Hi, > > Wondering if you got a chance to check this ... First thing to be done in such cases is to do the "Adobe Reader test". It fails, the text can't be extracted using Acrobat Reader, so we are better

Re: Numbers get reversed sometimes during conversion

2016-05-31 Thread Andreas Lehmkühler
Hi, > Shyam Sundar hat am 31. Mai 2016 um 12:00 > geschrieben: > > > Hi, > > I have come across an issue wherein while trying to covert PDFs (mainly of > RTL languages) into TXT, the numbers get reversed. > > Please check the attached file, '2005' in heading has

Re: PDFBox*.tmp files not deleted by PDFParser

2016-05-25 Thread Andreas Lehmkühler
> Damien Butaye hat am 25. Mai 2016 um 10:01 > geschrieben: > > > Hello Tilman, > > Yes I did it. I verified in debug mode and this method (close() on > SignatureOption) is well reached but the close() method of the object > "RandomAccessBufferedFileInputStream" is

New mail archives interface

2016-05-19 Thread Andreas Lehmkühler
Hi, at the recent ApacheCon NA conference, the new mail archives interface was unveiled. See https://lists.apache.org/ BR Andreas - To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org For additional commands, e-mail:

Re: PdfParser giving garbage character

2016-05-13 Thread Andreas Lehmkühler
> Mohit Goyal hat am 13. Mai 2016 um 08:28 geschrieben: > > > Hi, > > I have one pdf which has data in Malyalam(Indian Language). I tried to parse > this data using apache Tika I got garbage character '?' in output. > > > I checked Pdf using pdffont utility seems like

[ANNOUNCE] Apache PDFBox 2.0.1 released

2016-04-26 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.1. The release is available for download at: http://pdfbox.apache.org/download.cgi See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 2.0.1

[ANNOUNCE] Apache PDFBox 1.8.12 released

2016-04-26 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 1.8.12. The release is available for download at: http://pdfbox.apache.org/download.cgi See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 1.8.12

Re: Cannot comment on Jira issues anymore

2016-04-22 Thread Andreas Lehmkühler
Hi, > alexander.kriegi...@extern.sdv-it.de hat am 22. April 2016 um 09:50 > geschrieben: > > > Sorry to bother everyone here on the mailing list, but something seems to > be wrong in Jira: I cannot comment on > https://issues.apache.org/jira/browse/PDFBOX-3323 and other issues > anymore, the

Re: pdfbox-android

2016-04-20 Thread Andreas Lehmkühler
> Paul Mitchell hat am 20. April 2016 um 10:47 > geschrieben: > > > Hi > > I’m not sure if I’ve come to the right spot for my question. Hopefully you can > help me or direct me to someone who can help me > > I’m currently using pdfbox-android with android studio >

Fwd: The Apache® Software Foundation announces Apache PDFBox™ v2.0

2016-03-21 Thread Andreas Lehmkühler
rtner Organization of the PDF Association. "PDF is a very popular and easy to use format for document exchange. It is used by millions of people every day, however the format itself is quite complicated and a real challenge to write a piece of software to work with it," said Andreas Lehmkühl

Re: Spaces are ignored when reading a PDF file

2016-03-19 Thread Andreas Lehmkühler
Hi, > Frank van der Hulst hat am 17. März 2016 um 08:34 > geschrieben: > > > Spaces don't exist as characters in PDFs. To identify spaces, you have to > compare the X coordinates of adjacent characters against their widths. That's not correct, spaces exist but in most

Re: Spaces are ignored when reading a PDF file

2016-03-18 Thread Andreas Lehmkühler
> "Hesham G." hat am 17. März 2016 um 11:20 > geschrieben: > > > Andreas, > > That is very helpful. > > I can get the x location of each character using TextPosition.getX(), ex: > W: 102.88399 > i: 114.18165 > t: 117.660614 > h: 121.55801 > d: 133.09477 > u: 140.3994 >

Re: PrintTextLocations 1.8 vs 2.0

2016-03-16 Thread Andreas Lehmkühler
Hi, > Peter Prusinowski hat am 16. März 2016 um 09:52 > geschrieben: > > > Good morning, > > thank you for the hints, now I am overwriting showGlyph() and trying to > get the value with > > PDSimpleFont sf = (PDSimpleFont) font; > String name =

Re: Fields and "]" + Checkboxes

2016-03-09 Thread Andreas Lehmkühler
Hi, > Al Grant hat am 8. März 2016 um 18:57 geschrieben: > > > Morning All, > > I have been writing some Java with PDFBox for a few weeks now. Its been > very good so far. > > My goal is to loop through all the fields in a form, grab the values and > write the value to a

Re: memory consumption PDFBox 2.0.0

2016-03-01 Thread Andreas Lehmkühler
Hi, > Felix Benz-Baldas hat am 1. März 2016 um 12:35 > geschrieben: > > > Hello, > > we plan to use PDFBox 2.0.0 for converting PDFs to JPEG. We want to convert a > very large number of documents (more than one million). > > One question: Is it possible to control

Re: Rotating a new annotation to match the page's rotation

2016-02-24 Thread Andreas Lehmkühler
Hi, > Gilad Denneboom hat am 24. Februar 2016 um 09:34 > geschrieben: > > > No one has any ideas? ... > > On Sun, Feb 21, 2016 at 12:30 AM, Gilad Denneboom > wrote: > > > Hi all, > > > > Hoping someone can help me with this issue... > >

[ANNOUNCE] Apache PDFBox 2.0.0 RC3 released

2016-01-15 Thread Andreas Lehmkühler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.0 RC3. The release is available for download at: http://pdfbox.apache.org/download.cgi The numerous feedback on our second release candidate helps us to make this release candidate better again, e. g.

Re: Shell Can't Find pdfbox

2015-11-03 Thread Andreas Lehmkühler
Hi, > Jonathan Levi hat am 3. November 2015 um 03:25 geschrieben: > > > I'm finding that shell commands to use pdfbox-app-1.8.10.jar won't work unless > the full path is used. Example: > > drj-air:Desktop jonathan$ ls /usr/local/bin/pdfb* >

Re: Failure to close files on parse error

2015-11-02 Thread Andreas Lehmkühler
Hi, > Jesse Long hat am 2. November 2015 um 12:26 > geschrieben: > > > Hi All, > > The changes to PDDocument in eb83a299bbe39c2e59735aca2b39bca312c1ddc4 > were insufficient, please include attached patch. Please provide a JIRA ticket number or a svn revision as a

Re: Anyone know how to set up a bouncycastle?

2015-09-24 Thread Andreas Lehmkühler
Hi, > Eric Douglas hat am 18. September 2015 um 16:53 > geschrieben: > > > I'm trying to read a PDF using pdfbox, and on one system I get this error: > > cannot create instance of > org.bouncycastle.jcajce.provider.digest.GOST3411$Mappings > :

Re: Something weird with PDFMergerUtility?

2015-08-14 Thread Andreas Lehmkühler
Magnus Evensberget magnus.evensber...@gmail.com hat am 14. August 2015 um 10:21 geschrieben: Rolled back to the commit c343a3f and then it works. We are working with svn and I guess you are referring to the PDFBox github mirror, aren't you? That git commit refers to rev 1693855. BR

Re: Last commit in SVN HEAD broke the PDFWriter

2015-07-27 Thread Andreas Lehmkühler
Andreas Lehmkühler andr...@lehmi.de hat am 27. Juli 2015 um 09:37 geschrieben: Hi Roberto, Roberto Nibali rnib...@gmail.com hat am 27. Juli 2015 um 09:28 geschrieben: Dear developers The last commit 1692730 by lehmi, 18:36, broke the PDFWriter. The result is: The file

Re: Last commit in SVN HEAD broke the PDFWriter

2015-07-27 Thread Andreas Lehmkühler
Hi Roberto, Roberto Nibali rnib...@gmail.com hat am 27. Juli 2015 um 09:28 geschrieben: Dear developers The last commit 1692730 by lehmi, 18:36, broke the PDFWriter. The result is: The file xx.pdf cannot be open; It may be damaged or use a file format that Preview doesn’t recognize.

Re: Last commit in SVN HEAD broke the PDFWriter

2015-07-27 Thread Andreas Lehmkühler
Roberto Nibali rnib...@gmail.com hat am 27. Juli 2015 um 10:29 geschrieben: Hi Andreas Thanks for the quick reply. On Mon, Jul 27, 2015 at 9:55 AM, Andreas Lehmkühler andr...@lehmi.de wrote: Andreas Lehmkühler andr...@lehmi.de hat am 27. Juli 2015 um 09:37 geschrieben

Re: How to configure Maven POM to include latest SNAPSHOT of PDFbox

2015-07-07 Thread Andreas Lehmkühler
Hi, Roberto Nibali rnib...@gmail.com hat am 7. Juli 2015 um 11:43 geschrieben: Hi How do I properly set the dependencies in my Maven POM, so I can use the latest SNAPSHOT of pdfbox? I tried the following (https://pdfbox.apache.org/2.0/getting-started.html), which does not work at

Re: PDFRenderer, PDDocument memory issue

2015-07-02 Thread Andreas Lehmkühler
John Hewson j...@jahewson.com hat am 2. Juli 2015 um 06:10 geschrieben: On 1 Jul 2015, at 07:52, Tilman Hausherr thaush...@t-online.de wrote: Am 01.07.2015 um 10:16 schrieb Alex Sviridov: In my application I have real time memory graphs and they show that memory is very fast

Re: Re[8]: PDFRenderer, PDDocument memory issue

2015-07-01 Thread Andreas Lehmkühler
2015, 13:54 +02:00 от Andreas Lehmkühler andr...@lehmi.de: Alex Sviridov ooo_satu...@mail.ru hat am 1. Juli 2015 um 13:38 geschrieben: The file is here https://yadi.sk/i/Y0fTuvHmhbZiE Ah, that explains a lot. The pdf is a scanned document, every page holds a color image, consuming

Re: Re[6]: PDFRenderer, PDDocument memory issue

2015-07-01 Thread Andreas Lehmkühler
(FlateFilter.java:74)     ... 20 more How to solve these problems? PDFBox isn't supposed to be thread safe. Среда, 1 июля 2015, 13:17 +02:00 от Andreas Lehmkühler andr...@lehmi.de: Alex Sviridov ooo_satu...@mail.ru hat am 1. Juli 2015 um 13:09 geschrieben: I decided to show

Re: Re[10]: PDFRenderer, PDDocument memory issue

2015-07-01 Thread Andreas Lehmkühler
is more or less complex. Среда, 1 июля 2015, 14:08 +02:00 от Andreas Lehmkühler andr...@lehmi.de: Alex Sviridov ooo_satu...@mail.ru hat am 1. Juli 2015 um 13:59 geschrieben: Ok. Thank you very much for explanation. Could you say where this scratch file is located linux/windows

Re: PDFRenderer, PDDocument memory issue

2015-07-01 Thread Andreas Lehmkühler
Alex Sviridov ooo_satu...@mail.ru hat am 1. Juli 2015 um 10:16 geschrieben: I want to display all page thumbnails. However I came across memory size problem with PDFRenderer or PDDocument - I don't know which one. I have the following code:        private PDDocument

Re: Re[4]: PDFRenderer, PDDocument memory issue

2015-07-01 Thread Andreas Lehmkühler
so, simply use another load method, e.g. load(File file, boolean useScratchFiles) Среда, 1 июля 2015, 13:04 +02:00 от Andreas Lehmkühler andr...@lehmi.de: Alex Sviridov ooo_satu...@mail.ru hat am 1. Juli 2015 um 12:58 geschrieben: Thank you for answer. I tried

Re: Re[2]: PDFRenderer, PDDocument memory issue

2015-07-01 Thread Andreas Lehmkühler
, help me. Maybe, but I'm not sure at all. Try to use the scratch file. Среда, 1 июля 2015, 12:48 +02:00 от Andreas Lehmkühler andr...@lehmi.de: Alex Sviridov ooo_satu...@mail.ru hat am 1. Juli 2015 um 10:16 geschrieben: I want to display all page thumbnails. However I came across

Re: Scratch files - too many files open

2015-06-05 Thread Andreas Lehmkühler
Hi, Jesse Long jesse.long...@gmail.com hat am 3. Juni 2015 um 13:20 geschrieben: On 03/06/2015 12:46, Andreas Lehmkühler wrote: Hi, Jesse Long jesse.long...@gmail.com hat am 3. Juni 2015 um 08:45 geschrieben: On 02/06/2015 17:48, Andreas Lehmkuehler wrote: Hi, Am

Re: IllegalArgumentException when using PDType1Font.HELVETICA

2015-05-27 Thread Andreas Lehmkühler
those, as you have to manage line breaks yourself. Thanks :) Johanneke Lamberink BR Andreas Lehmkühler - To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org For additional commands, e-mail: users-h

Re: How to flatedecode and find all acroform fields in a compressed PDF

2015-05-22 Thread Andreas Lehmkühler
time and pointers. Maybe it's worth to try to share the file visually with us. Open both files (compressed and decompressed) with PDFDebugger [1] and post a screenshot of both somehwere (dropbox etc.) and share the link with us. Maybe that could shed some light on your issue. BR Andreas Lehmkühler

Re: java source in PDFBox snapshot jars?

2015-04-22 Thread Andreas Lehmkühler
small compared to other projects, so that infra most likely won't any headache if we put those source jars into the repo as well. I'll try to find out what we have to do to publish the sources as well, see PDFBOX-2770 BR Andreas Lehmkühler BR Thomas Zitat von Andrew Munn and...@nmedia.net

Re: Blank page rendered with wrong xref start objid (batch 1.8)

2015-03-26 Thread Andreas Lehmkühler
are welcome :-) BR Andreas Lehmkühler thank you Jerome - To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org For additional commands, e-mail: users-h...@pdfbox.apache.org

Re: Text removal

2015-03-24 Thread Andreas Lehmkühler
running out of time. I'd so much appreciate any help or any idea on what's going on. Notes: 1. I use use PDFBox 1.7.1 1.7.1 is more than 2 years old (released in july 2012). I strongly recommend to use a more recent version, such as 1.8.8 BR Andreas Lehmkühler 2. This problem does not occur

Re: Problem building the project with Eclipse and m2e

2015-03-17 Thread Andreas Lehmkühler
the alternative? :-{ I don't know any. Best Martin BR Andreas Lehmkühler - To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org For additional commands, e-mail: users-h...@pdfbox.apache.org

Re: Question about PDDocument.setVersion

2015-03-04 Thread Andreas Lehmkühler
(Float.toString(newVersion)); } getDocument().setVersion(newVersion); Am I missing something? You're right there is some room for improvements. I've already reopened the related ticket PDFBOX-2099 Thanks for the pointer BR Andreas Lehmkühler

Re: PDFBox 2.0.0 and UTF8 chars

2015-03-02 Thread Andreas Lehmkühler
is an issue with that specific font. Either a malformed font or a fontbox issue. BR Andreas Lehmkühler Tilman Am 01.03.2015 um 18:53 schrieb Andreas Lehmkuehler: Hi, Am 28.02.2015 um 11:52 schrieb Ivan Klaric: Hello good PDFBox people, I am working on a pet project with PDFBox and I

Re: [PDFBOX-2.0] PDF Size after Signature

2015-02-27 Thread Andreas Lehmkühler
. Saying that, r1659998 could introduce a regression. I had a quick look and maybe the changes made to COSWriter are the root cause. But we have to debug that first to be sure. BR Andreas Lehmkühler BR Maruan Am 27.02.2015 um 08:45 schrieb Andreas Lehmkühler andr...@lehmi.de: Hi, Tilman

Re: [PDFBOX-2.0] PDF Size after Signature

2015-02-26 Thread Andreas Lehmkühler
, all objects are there ?! Correct, something went wrong when appending the changed objects only. It work for me when I fixed the encryption stuff. I seems as if some recent change introduced this regression. @Isaias Which exact version/revision of the trunk are you using? BR Andreas Lehmkühler

Re: https://issues.apache.org/jira/browse/PDFBOX-2523 still present (or variation of it still present)

2015-02-26 Thread Andreas Lehmkühler
comments. Is there any chance to get a hand on a sample pdf? I would be good enough to send it via private mail to me: BR Andreas Lehmkühler Thanks- Steve From: Andreas Lehmkühler andr...@lehmi.de Sent: Tuesday, February 24, 2015 3:30 AM To: users

Re: https://issues.apache.org/jira/browse/PDFBOX-2523 still present (or variation of it still present)

2015-02-23 Thread Andreas Lehmkühler
Hi, I've improved the self repair mechnism of the trunk based on Steves report. @Steve Please give the newest trunk version/SNAPSHOT a try. Does the issue still persist? BR Andreas Lehmkühler Steve Antoch sant...@yuzu.com hat am 17. Februar 2015 um 00:05 geschrieben: Andreas- Thanks

Re: How to attach files to messages sent to users@pdfbox.apache.org?

2015-02-17 Thread Andreas Lehmkühler
Hi Alan, (most kind of) attachments are not allowed. Either attach the file(s) to the related JIRA ticket or provide it using a sharehoster/public webspace/etc. BR Andreas Lehmkühler Alan Masters amast...@nhbc.co.uk hat am 17. Februar 2015 um 11:10 geschrieben: Please could someone help

Re: https://issues.apache.org/jira/browse/PDFBOX-2523 still present (or variation of it still present)

2015-02-16 Thread Andreas Lehmkühler
Andreas Lehmkühler - To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org For additional commands, e-mail: users-h...@pdfbox.apache.org

Re: [PDFBOX-2.0] Signature Issue

2015-02-04 Thread Andreas Lehmkühler
to a public place or send it to me directly. BR Andreas Lehmkühler Best regards On Mon, Feb 2, 2015 at 6:13 PM, Isaias Barroso isaias.barr...@gmail.com wrote: Thank you, After test I'll give a feedback. BR On Mon, Feb 2, 2015 at 6:05 PM, Andreas Lehmkuehler andr...@lehmi.de

Re: [PdfBox 2.0] Page rendered as a blank image

2015-02-03 Thread Andreas Lehmkühler
- To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org For additional commands, e-mail: users-h...@pdfbox.apache.org BR Andreas Lehmkühler - To unsubscribe, e-mail: users

Re: [PDFBOX-2.0] Signature Issue

2015-01-29 Thread Andreas Lehmkühler
at the provided testcase [1], which demonstrates all necessary steps to sign a pdf. Best regards -- Isaías Barroso Belo Horizonte - MG BR Andreas Lehmkühler [1] http://svn.apache.org/viewvc/pdfbox/trunk/examples/src/test/java/org/apache/pdfbox/examples/pdmodel/TestCreateSignature.java?view

Re: Aw: Re: Type1Glyph2D No glyph for 41 (.notdef) in font Helvetica

2015-01-29 Thread Andreas Lehmkühler
, IMO you have to think about your font handling, especially as you're creating the pdfs yourself. It's always a bad idea not to include the used fonts, as the used reader has to map the missing fonts somehow and such a replacement may lead to not that perfect renderings. BR Andreas Lehmkühler

Re: Error on PDDocument.load

2015-01-21 Thread Andreas Lehmkühler
, the issue has to be something else BR Andreas Lehmkühler Thanks for your help and for all your work. Kevin On 21/01/2015 11:54, Maruan Sahyoun wrote: Hi Kevin works for me - what's your Java Version? BR Maruan Am 21.01.2015 um 11:24 schrieb Kevin Morin mo

Re: unsubscribe [SEC=UNOFFICIAL]

2015-01-12 Thread Andreas Lehmkühler
Hi James, to unsubscribe you have to write an email to users-subscr...@pdfbox.apache.org. See [1] for further details. BR Andreas Lehmkühler [1] http://pdfbox.apache.org/mailinglists.html Weatherly, James james.weathe...@humanservices.gov.au hat am 12. Januar 2015 um 00:58 geschrieben

Re: Content of pdf moved around

2015-01-11 Thread Andreas Lehmkühler
Hi Ray, to unsubscribe you have to write an email to users-subscr...@pdfbox.apache.org. See [1] for fruther details. BR Andreas Lehmkühler [1] http://pdfbox.apache.org/mailinglists.html Ray Morris ray.morris.brisb...@bigpond.com hat am 10. Januar 2015 um 22:48 geschrieben: Please

Re: What all is in the standalone JAR?

2015-01-07 Thread Andreas Lehmkühler
dependencies are defined including the version. Thanks, Thib BR Andreas Lehmkühler

  1   2   >