On our project, we wrote an aspect to do something like this. We intercepted all calls to the itext api and logged the value of the parameters. We're using it for regression testing, but you could certainly use the same idea for caching documents.

John

On Tuesday, Nov 26, 2002, at 08:20 US/Pacific, Paulo Soares wrote:

Taking out the metadata won't help you as there are no guaranties that the
layout engine is the same from version to version, the text may look the
same but the internal representation is different. The best way is to do a
checksum to the text (words only, skipping the whitespace) and store that
information in the pdf metadata as a new key. The already generated pdf can
have the text extracted, the checksum calculated and applied to the same
pdf.

Best Regards,
Paulo Soares

-----Original Message-----
From:	Matt Benson [SMTP:[EMAIL PROTECTED]]
Sent:	Tuesday, November 26, 2002 15:40
To:	itext-questions
Subject:	[iText-questions] PDF metadata

We are using iText to convert text files to PDF as
outlined in the FAQ.  This works; however I want to
take a checksum of the PDF created and use it in
conjunction with some other information to verify we
have not created this file before.  What I am finding,
however, is that the metadata of the PDF always
differs between iText versions as well as creation
date/time, so I cannot create the exact same file
twice and thus cannot rely on a checksum.  I could use
the checksum from the input file, except that this is
a modification to a production application and we no
longer have the input files for the existing data.  So
to do this I would have to extract the text to get an
approximation of the original file.  If I did this,
the checksum would represent slightly different things
from the old to the new data.  What I am wondering
about is whether these variable pieces of metadata are
vital to the PDF structure, and if not, what would it
take to remove them?  Alternatively, if anyone has a
better idea then those are welcome too.

Thanks,
Matt

__________________________________________________
Do you Yahoo!?
Yahoo! Mail Plus - Powerful. Affordable. Sign up now.
http://mailplus.yahoo.com


-------------------------------------------------------
This SF.net email is sponsored by: Get the new Palm Tungsten T
handheld. Power & Color in a compact size!
http://ads.sourceforge.net/cgi-bin/redirect.pl?palm0002en
_______________________________________________
iText-questions mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/itext-questions

-------------------------------------------------------
This SF.net email is sponsored by: Get the new Palm Tungsten T
handheld. Power & Color in a compact size!
http://ads.sourceforge.net/cgi-bin/redirect.pl?palm0002en
_______________________________________________
iText-questions mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/itext-questions


-------------------------------------------------------
This SF.net email is sponsored by: Get the new Palm Tungsten T handheld. Power & Color in a compact size! http://ads.sourceforge.net/cgi-bin/redirect.pl?palm0002en
_______________________________________________
iText-questions mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/itext-questions

Reply via email to