Re: [iText-questions] PDF metadata

Matt Benson Tue, 26 Nov 2002 09:50:32 -0800

Sounds complex but interesting...

-Matt


--- "John K. Watson" <[EMAIL PROTECTED]> wrote:
> On our project, we wrote an aspect to do something
> like this.  We 
> intercepted all calls to the itext api and logged
> the value of the 
> parameters.  We're using it for regression testing,
> but you could 
> certainly use the same idea for caching documents.
> 
>       John
> 
> On Tuesday, Nov 26, 2002, at 08:20 US/Pacific, Paulo
> Soares wrote:
> 
> > Taking out the metadata won't help you as there
> are no guaranties that 
> > the
> > layout engine is the same from version to version,
> the text may look 
> > the
> > same but the internal representation is different.
> The best way is to 
> > do a
> > checksum to the text (words only, skipping the
> whitespace) and store 
> > that
> > information in the pdf metadata as a new key. The
> already generated 
> > pdf can
> > have the text extracted, the checksum calculated
> and applied to the 
> > same
> > pdf.
> >
> > Best Regards,
> > Paulo Soares
> >
> >> -----Original Message-----
> >> From:      Matt Benson [SMTP:[EMAIL PROTECTED]]
> >> Sent:      Tuesday, November 26, 2002 15:40
> >> To:        itext-questions
> >> Subject:   [iText-questions] PDF metadata
> >>
> >> We are using iText to convert text files to PDF
> as
> >> outlined in the FAQ.  This works; however I want
> to
> >> take a checksum of the PDF created and use it in
> >> conjunction with some other information to verify
> we
> >> have not created this file before.  What I am
> finding,
> >> however, is that the metadata of the PDF always
> >> differs between iText versions as well as
> creation
> >> date/time, so I cannot create the exact same file
> >> twice and thus cannot rely on a checksum.  I
> could use
> >> the checksum from the input file, except that
> this is
> >> a modification to a production application and we
> no
> >> longer have the input files for the existing
> data.  So
> >> to do this I would have to extract the text to
> get an
> >> approximation of the original file.  If I did
> this,
> >> the checksum would represent slightly different
> things
> >> from the old to the new data.  What I am
> wondering
> >> about is whether these variable pieces of
> metadata are
> >> vital to the PDF structure, and if not, what
> would it
> >> take to remove them?  Alternatively, if anyone
> has a
> >> better idea then those are welcome too.
> >>
> >> Thanks,
> >> Matt
> >>
> >>
> __________________________________________________
> >> Do you Yahoo!?
> >> Yahoo! Mail Plus - Powerful. Affordable. Sign up
> now.
> >> http://mailplus.yahoo.com
> >>
> >>
> >>
>
-------------------------------------------------------
> >> This SF.net email is sponsored by: Get the new
> Palm Tungsten T
> >> handheld. Power & Color in a compact size!
> >>
>
http://ads.sourceforge.net/cgi-bin/redirect.pl?palm0002en
> >> _______________________________________________
> >> iText-questions mailing list
> >> [EMAIL PROTECTED]
> >>
>
https://lists.sourceforge.net/lists/listinfo/itext-questions
> >
> >
> >
>
-------------------------------------------------------
> > This SF.net email is sponsored by: Get the new
> Palm Tungsten T
> > handheld. Power & Color in a compact size!
> >
>
http://ads.sourceforge.net/cgi-bin/redirect.pl?palm0002en
> > _______________________________________________
> > iText-questions mailing list
> > [EMAIL PROTECTED]
> >
>
https://lists.sourceforge.net/lists/listinfo/itext-questions
> 


__________________________________________________
Do you Yahoo!?
Yahoo! Mail Plus - Powerful. Affordable. Sign up now.
http://mailplus.yahoo.com


-------------------------------------------------------
This SF.net email is sponsored by: Get the new Palm Tungsten T 
handheld. Power & Color in a compact size! 
http://ads.sourceforge.net/cgi-bin/redirect.pl?palm0002en
_______________________________________________
iText-questions mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/itext-questions

Re: [iText-questions] PDF metadata

Reply via email to