Re: PII data

2023-10-16 Thread Andreas Lehmkühler
PDFBox doesn't send any information anywhere. Everything is done locally 
on your machine.


Am 16.10.23 um 23:14 schrieb Ward Dixon:

Hello, does anyone know if PDF Box sends any information outside of my network 
from the PDF it is creating? I'm concerned about Personal Identifiable 
Information (PII) being inadvertently sent outside of my organization.




-
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org



Re: empty/missing pdf content

2023-10-16 Thread Andreas Lehmkühler




Am 16.10.23 um 23:43 schrieb Pados Attila:
I fixed the issue with missing input pdf file, and also re-run this test 
project with the most fresh 3.0.1-SNAPSHOT version. (oct 5th)

So far, the character distortion remains,
That isn't the most recent version. The ticket was created on Oct 7th 
and the last change was commited in Otc 13th. Please retry with a more 
recent version





image.png

original text to print is site-1-1, I added this on purpose, as it was 
failing on the real application too.
Last time I tried to reproduce the "missing content" problems, and that 
didn't worked, so I don't have a testing code either that reproduces it 
(yet).



On Sat, Oct 14, 2023 at 5:04 AM Tilman Hausherr > wrote:


Hi,

That has now been fixed as well so you don't need that call if you're
using the snapshot.

Another small thing I noticed, which didn't play a role but is weird
and
you should fix it: you didn't close the content stream before
flattening, you did so AFTER. This may or may not bring weird effects.

Tilman

On 12.10.2023 05:03, Tilman Hausherr wrote:
 > Hi,
 >
 > That one has been solved by now but it turns out I had discovered a
 > different bug than the one you have. Yours is similar to
 > PDFBOX-5489 > .
 > Please call
 >
 >

targetDoc.getDocument().setHighestXRefObjectNumber(sourceDoc.getDocument().getHighestXRefObjectNumber());
 >
 >
 > and then it works.
 >
 > @Andreas should the call that is in importPage() also be added to
 > addPage() ?
 >
 > Tilman
 >
 > On 07.10.2023 12:44, Tilman Hausherr wrote:
 >> I was able to reduce your test even further, and created an
issue in
 >> JIRA:
 >> https://issues.apache.org/jira/browse/PDFBOX-5696

 >>
 >> Tilman
 >>
 >>
 >> On 07.10.2023 11:24, Tilman Hausherr wrote:
 >>> The file "/pdf/Template.pdf" is missing in both projects.
 >>>
 >>> So it produces only one file. There is a difference, 29 has the
 >>> Poppins_Semibold font embedded and the newer one doesn't.
 >>>
 >>>
Root/Pages/Kids/[0]/Resources/XObject/Form4/Resources/Font/Poppins-SemiBold
 >>>
 >>>
 >>> Tilman
 >>>
 >>> On 03.10.2023 21:47, Pados Attila wrote:
  Hi, here is the repository with test/reproduce code:
  https://github.com/padisah/pdfboxtests

 
  Here I am reproducing a character displacement problem: text that
  includes
  '-' sign, they are shifted from position.
  There will be more cases added, with missing content.
 
 
 
  On Tue, Sep 26, 2023 at 3:04 PM Pados Attila
  mailto:attila.pa...@gmail.com>> wrote:
 
 > Hi, so far the team delayed swapping pdfbox version, so I can
only
 > work on this on my own.
 >
 > I will make a simple command line application, or a unit
test, that
 > would imitate what the webapp does, using pdfbox 3, and first
 > reproduce the error there.
 > But it may take several weeks, as I have little free time left.
 >
 > On Sun, Sep 24, 2023 at 1:33 PM Tilman Hausherr
 > mailto:thaush...@t-online.de>>
 > wrote:
 >> Please share the smallest possible code to reproduce the
problem,
 >> and
 >> additional files if needed. (Please make our life easy and test
 >> whether
 >> it can be reproduced without extra files)
 >>
 >> The AB_Manuel_Test.pdf file has the font file missing, this
does
 >> look
 >> similar to the problem fixed recently.
 >>
 >> Tilman
 >>
 >> On 20.09.2023 20:23, Pados Attila wrote:
 >>>
https://drive.google.com/file/d/1LD0joGW9OnrXFPaY-HXZkwyKfFoCIe5L/view 

 >>>
 >>>
 >>> sorry, I was in a hurry
 >>>
 >>>
 >>> On Wed, Sep 20, 2023 at 5:35 PM sahy...@fileaffairs.de
 <
 >>> sahy...@fileaffairs.de > wrote:
 >>>
  Dear Attila,
 
  both links point to the same file. The link to the PDFBox
  generated
 > one
  is missing.
 
  BR
  Maruan
 
  Am Dienstag, dem 19.09.2023 um 20:43 +0200 schrieb Pados
Attila:
 > Template pdf
 >
 >
 >

https://drive.google.com/file/d/1mbvN9RDKoesy0tJbj3GCO4VkMPjxYw5c/view?usp=sharing 


Re: empty/missing pdf content

2023-10-16 Thread Pados Attila
I fixed the issue with missing input pdf file, and also re-run this test
project with the most fresh 3.0.1-SNAPSHOT version. (oct 5th)
So far, the character distortion remains,


[image: image.png]

original text to print is site-1-1, I added this on purpose, as it was
failing on the real application too.
Last time I tried to reproduce the "missing content" problems, and that
didn't worked, so I don't have a testing code either that reproduces it
(yet).


On Sat, Oct 14, 2023 at 5:04 AM Tilman Hausherr 
wrote:

> Hi,
>
> That has now been fixed as well so you don't need that call if you're
> using the snapshot.
>
> Another small thing I noticed, which didn't play a role but is weird and
> you should fix it: you didn't close the content stream before
> flattening, you did so AFTER. This may or may not bring weird effects.
>
> Tilman
>
> On 12.10.2023 05:03, Tilman Hausherr wrote:
> > Hi,
> >
> > That one has been solved by now but it turns out I had discovered a
> > different bug than the one you have. Yours is similar to
> > PDFBOX-5489  .
> > Please call
> >
> >
> targetDoc.getDocument().setHighestXRefObjectNumber(sourceDoc.getDocument().getHighestXRefObjectNumber());
>
> >
> >
> > and then it works.
> >
> > @Andreas should the call that is in importPage() also be added to
> > addPage() ?
> >
> > Tilman
> >
> > On 07.10.2023 12:44, Tilman Hausherr wrote:
> >> I was able to reduce your test even further, and created an issue in
> >> JIRA:
> >> https://issues.apache.org/jira/browse/PDFBOX-5696
> >>
> >> Tilman
> >>
> >>
> >> On 07.10.2023 11:24, Tilman Hausherr wrote:
> >>> The file "/pdf/Template.pdf" is missing in both projects.
> >>>
> >>> So it produces only one file. There is a difference, 29 has the
> >>> Poppins_Semibold font embedded and the newer one doesn't.
> >>>
> >>>
> Root/Pages/Kids/[0]/Resources/XObject/Form4/Resources/Font/Poppins-SemiBold
> >>>
> >>>
> >>> Tilman
> >>>
> >>> On 03.10.2023 21:47, Pados Attila wrote:
>  Hi, here is the repository with test/reproduce code:
>  https://github.com/padisah/pdfboxtests
> 
>  Here I am reproducing a character displacement problem: text that
>  includes
>  '-' sign, they are shifted from position.
>  There will be more cases added, with missing content.
> 
> 
> 
>  On Tue, Sep 26, 2023 at 3:04 PM Pados Attila
>   wrote:
> 
> > Hi, so far the team delayed swapping pdfbox version, so I can only
> > work on this on my own.
> >
> > I will make a simple command line application, or a unit test, that
> > would imitate what the webapp does, using pdfbox 3, and first
> > reproduce the error there.
> > But it may take several weeks, as I have little free time left.
> >
> > On Sun, Sep 24, 2023 at 1:33 PM Tilman Hausherr
> > 
> > wrote:
> >> Please share the smallest possible code to reproduce the problem,
> >> and
> >> additional files if needed. (Please make our life easy and test
> >> whether
> >> it can be reproduced without extra files)
> >>
> >> The AB_Manuel_Test.pdf file has the font file missing, this does
> >> look
> >> similar to the problem fixed recently.
> >>
> >> Tilman
> >>
> >> On 20.09.2023 20:23, Pados Attila wrote:
> >>>
> https://drive.google.com/file/d/1LD0joGW9OnrXFPaY-HXZkwyKfFoCIe5L/view
> >>>
> >>>
> >>> sorry, I was in a hurry
> >>>
> >>>
> >>> On Wed, Sep 20, 2023 at 5:35 PM sahy...@fileaffairs.de <
> >>> sahy...@fileaffairs.de> wrote:
> >>>
>  Dear Attila,
> 
>  both links point to the same file. The link to the PDFBox
>  generated
> > one
>  is missing.
> 
>  BR
>  Maruan
> 
>  Am Dienstag, dem 19.09.2023 um 20:43 +0200 schrieb Pados Attila:
> > Template pdf
> >
> >
> >
> https://drive.google.com/file/d/1mbvN9RDKoesy0tJbj3GCO4VkMPjxYw5c/view?usp=sharing
> >
> > Pdf generated with pdfbox 3.0.0 without restricting flatten's
> > input
> > fields
> >
> >
> >
> https://drive.google.com/file/d/1mbvN9RDKoesy0tJbj3GCO4VkMPjxYw5c/view?usp=sharing
> >
> > there should be a text AB Manuel Test
> >
> 
> -
> 
>  To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
>  For additional commands, e-mail: users-h...@pdfbox.apache.org
> 
> 
> >>
> >>
> -
> >>
> >> To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
> >> For additional commands, e-mail: users-h...@pdfbox.apache.org
> >>
> >
> > --
> > Attila Pados
> > Java developer
> > +36204432457
> >
> 
> >>>
> >>>
> >>> ---

PII data

2023-10-16 Thread Ward Dixon
Hello, does anyone know if PDF Box sends any information outside of my network 
from the PDF it is creating? I'm concerned about Personal Identifiable 
Information (PII) being inadvertently sent outside of my organization.