Manulak,

Manulak Dissanayake wrote
> In last few mails, Michael said "It is not strictly impossible to use
> iText to convert arbitrary PDFs to PDF/A as the iText low level API allows
> you to nearly everything allowed in PDF" and as he explained, it is also
> not a small fix to be done.
> I always pointed to that point. I claimed out that it is impossible for
> the moment to change the FOP side and make this fixed. So I expected a
> help from you to do so.

I not merely said that it's not a small fix to be done, I actually pointed
out that it's safe to assume that programming that as a single developer
takes more than a year. And even that likely was an understatement,
especially in case of really arbitrary PDFs.

Therefore, only go this road if the FOP process cannot be changed for some
years to come and your employer is willing to let you spend a large number
of months on this.

Manulak Dissanayake wrote
> As I got many information on "what to read" and "why  it is hard", I
> didn't get any reply saying "How to do it" or "try this".

The reason is that before even starting to "do" or "try" some conversion,
you have to learn about PDF/A. And that means getting to know the current
PDF specification (as it represents what you get), the specification version
1.4 (as the PDF/A spec is based on it), and the PDF/A spec itself really
really well, far better than I do for example. There actually are some other
documents referenced from these specs you should also read and understand.

Only then you can seriously start trying and doing. But by then you very
likely know better than I do how to even start.


One thing may work out to your advantage, though: You actually have to do
with only very special PDFs created by that FOP process. If you really
"only" need to convert them, you should first take some representative
sample PDFs from that process and check manually against both the PDF 1.4
spec (are they at least proper PDF 1.4 files?) and the PDF/A spec (what in
them is not PDF/Aish?). By checking I really mean checking each and every
object in the PDF, not merely running through some third party PDF/A
checker.

If you are lucky, you can safe some months this way.

Regards,   Michael

PS: Of course you should use a more current iText version than 2.1.6 which
you mentioned in your original posts. Quite a number of fixes have been
introduced into iText since that version directly or indirectly improving
the quality of the results.

PPS: I just read again the validation error report from your initial post.
Cleaning the properties should be fairly easy (if it's ok to loose some
information) but the CIDSet related issues indicate that some interesting
tasks (i.e. requiring some intimate knowledge of PDF and typesetting in
general) will come up soon. And don't assume the list of issues in that
report is complete.



--
View this message in context: 
http://itext-general.2136553.n4.nabble.com/FW-SPAM-Re-PDF-Validation-error-with-PDF-A1b-complaince-tp4656183p4656206.html
Sent from the iText - General mailing list archive at Nabble.com.

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Reply via email to