dayvidpow wrote: > What do you mean by "where each paragrapgh start & end"? Are you referring > to the coordinates position of each paragrapgh?
No, he isn't. If you read chapter 18 of my book, you'll notice that a short sentence like "Hello World" can be stored in the PDF as "Hello World", but it shouldn't surprise you if it's stored as "ld", "Wor", "llo", and "He". To the human being reading the PDF in Adobe Reader, it will look exactly the same on the screen, but inside the PDF "llo" can be the first part of the sentence appearing in the content stream, and "He" the last part. Also how do you know that "Hello World" is a two-word sentence and not just two one-word sentences? That's what Leonard means: the paragraphs are in the PDF and you can read them on your screen when using Adobe Reader, but when your machine looks at the bits and bytes, there are no paragraphs, no sentences, maybe not even recognizable characters. That's what makes it difficult to make a Tagged PDF from a PDF that isn't tagged. >> Is a given string a Header or Footer? Is >> any text contained inside of a table - and if so, what part of the >> table? Once you have a semantic map for your data, you are ready to >> proceed to step two. (This part is, of course, outside iText) > > There are no tables or header or footer , etc. I am simply inserting the > words in the PDF at their specified coordinate position. Tagging content in newly created PDFs is explained in section F.3.2 of the book. You have to use beginMarkedContentSequence and endMarkedContentSequence, and so on... But to make this work for specific pieces of text added to an existing PDF, you'll have to adapt iText. br, Bruno ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ iText-questions mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/itext-questions Buy the iText book: http://itext.ugent.be/itext-in-action/
