[ 
https://issues.apache.org/jira/browse/PDFBOX-2562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14283579#comment-14283579
 ] 

Maruan Sahyoun commented on PDFBOX-2562:
----------------------------------------

Let me add some background to what [~jahewson] wrote:

A font being embedded to a PDF file may or may not have all of the information 
from the original font file as only some parts of the original font file are 
required within a PDF. Validating these requirements is done as part of the 
PDF/A validator and reflected there.

Validating the font itself would mean to validate it against the font 
specification e.g. TTF. Some of these are ISO standards some are not. For some 
there are multiple specs which might bring some ambiguity.

As the font program embedded into the PDF does not have all the information 
required by the spec extracting the information and running it against a full 
font check would bring up a lot of errors which are valid from a fonts 
perspective (as the font might not be complete) but is not from a PDF 
perspective (as the font doesn’t need to have all information required in the 
font spec). In addition validating the font without the context information 
from the PDF might als be incomplete.

The same applies in principle to other embedded resources like images.

An architecture which does a full validation of PDF and all embedded resources 
would likely 

- take a PDF
- validate the PDF
- extract the resources from the PDF
- validate these against external individual validators
- combine the validation results

PDFBox can surely do all PDF related tasks. Doing a full validation of the PDF 
and all resources against their individual standards is beyond scope of the 
PDFBox project and not needed for PDF/A validation.  


> create ValidationWarnings
> -------------------------
>
>                 Key: PDFBOX-2562
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2562
>             Project: PDFBox
>          Issue Type: Sub-task
>          Components: Preflight
>    Affects Versions: 2.0.0
>            Reporter: Ralf Hauser
>            Priority: Minor
>             Fix For: 2.1.0
>
>
> as seen in the parent Issue PDFBOX-2513 , pdf/A1 is not required to validate 
> other standards like TIFF completely.
> On the other hand, the information from the CCITTFaxDecoder is valuable and 
> shouldn't be simply discarded.
> Therefore, I suggest that this information is displayed without setting the 
> validation result to false.
> ValidationWarnings could have the same logical structure as ValidationError .



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to