[ 
https://issues.apache.org/jira/browse/PDFBOX-5198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17359359#comment-17359359
 ] 

Matthew Jung commented on PDFBOX-5198:
--------------------------------------

Hi Hausherr
It looks like the issue happens when the PDF file is not tagged correctly as 
PDF UA. If the pdf is tagged correctly it works fine
Matt
    On Monday, June 7, 2021, 10:31:01 PM EDT, Tilman Hausherr (Jira) 
<[email protected]> wrote:  
 
 
    [ 
https://issues.apache.org/jira/browse/PDFBOX-5198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17358997#comment-17358997
 ] 

Tilman Hausherr commented on PDFBOX-5198:
-----------------------------------------

The release build is planned for today, so if you get examples later, please 
create a new issue.




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


> When merging multiple pdf ua documents, Tags become nested
> ----------------------------------------------------------
>
>                 Key: PDFBOX-5198
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5198
>             Project: PDFBox
>          Issue Type: Wish
>          Components: Utilities
>    Affects Versions: 2.0.21, 2.0.23
>            Reporter: Matthew Jung
>            Assignee: Tilman Hausherr
>            Priority: Major
>             Fix For: 2.0.24, 3.0.0 PDFBox
>
>         Attachments: 1622000586495blob.jpg, 1622120149457blob.jpg, 
> 1622120149457blob.jpg, 1622123253165blob.jpg, 1622123790854blob.jpg, 
> 1623105725988blob.jpg, 1623105725988blob.jpg, 1623115281967blob.jpg, 
> Binder1.pdf, PDFA3A-merged-new.pdf, PDFUA-in-a-Nutshell-PDFUA_1.pdf, 
> nested_tags_4documents_merged_using_pdfbox.tif, 
> non_nested_tags_4documents_combined_using+adobe_pro.tif, screenshot-1.png
>
>
> When merging PDF UA documents the tags seen in Adobe reader are nested. If 
> merging 200 documents then the tags are 200 nested deep. It does not appear 
> to affect that JAWS reader can still read the document  but it may slow down 
> performance when loading to a content repository.
> <DOCUMENT>
>           <DOCUMENT>
>                        <DOCUMENT>
> when using Adobe DC to merge multiple documents the tags are flatten
> <DOCUMENT>
>      <DOCUMENT>
>       <DOCUMENT>
>       <DOCUMENT>
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to