I'm attaching 2 pdf files.1.pdf is the one from our test suite. It contains root Document tag which includes Header (H1) and Paragraph tags. This is absolutely correct PDF/UA document.
2.pdf is generated in the same way as 1.pdf with the only exception. I've set role=null to Document. In java code it looks like:
document = new Document(); document.setRole(null);You can see that 2.pdf contains no tags. In this case you can control tagged output manually. No auto generated tags. Is this what you want to achieve? If not, please share your code and sample files. It will help me to understand the problem.
On 10/4/2013 11:21 AM, Jones Tim wrote:
Hi,I think this is more of my misunderstanding. I'm basically writing the tags myself including a <Document> tag to the root. I wish to have full control over what tags to write and what tags not to write. I had problems with the default auto generated tags. As much as I can remember, the logical reading order of the PDF not being quite right and the tagging not passing accessibility checks in Adobe.I'm now left with a PDF with 2 <Document> tags. 1 of which is auto generated by iText and one which is defined by me in code. I do use the Role = null to stop tags being set. But I'm left with a root document tag with a bunch of path:path (etc) tags. Maybe I don't understand how to achieve what I need. Defining the logical reading order would be good because this sometimes doesn't quite work in particular scenarios.Tim ------------------------------------------------------------------------ *From:* iText Info <i...@1t3xt.info>*To:* Jones Tim <timjone...@yahoo.co.uk>; Post all your questions about iText here <itext-questions@lists.sourceforge.net>*Sent:* Wednesday, 2 October 2013, 17:01 *Subject:* Re: [iText-questions] PDF tagging document tag Op 19/09/2013 16:46, Jones Tim schreef: > Please could we have functionality to set the tagging of the document > at the very basic level of setting user properties and creating the > default root structure of the document and NOT to create the > "<Document>" tag and it's other tags. Adding the <Document> tag to the document is really important. It's being added not only as a root to the structure tree, but to the page content of every page as well. Without having <Document> as a root element you can't have a proper PDF/UA document. That's why we have decided to add the <Document> tag as root element by default. It's not trivial to localize the method which adds this tag: - part of the job is done on opening the document, - part is done on opening and closing pages, - the rest is done on closing document. As you can see: it's not as simple as one would think. Unfortunately, it's not possible now to tell iText to "skip writing the root element". You could use a workaround in some cases. For instance, you could redefine the document role. Let's say you could call Document.setRole(PdfName.DIV), then the <Document> tag would be replaced with <Div> tag in both structure tree and page content For the next release, we can add an option which allows to skip writing certain tags. Currently it's possible to call Document.setRole(null). It means that <Document> tag will not be written. But in this case all internal tags will not be written as well. I think we can extend this functionality a bit so that a certain tag is not written, but all internal tags are written. Also I have a question what do you want to achieve with redefining <Document> tag? What extra functionality do you need? Maybe we can advise you something or push this functionality into next release.
1.pdf
Description: Adobe PDF document
2.pdf
Description: Adobe PDF document
------------------------------------------------------------------------------ October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register > http://pubads.g.doubleclick.net/gampad/clk?id=60134071&iu=/4140/ostg.clktrk
_______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php