Unless the PDF producer used marked content operators to delineate the chart (as is done by Adobe's applications), then you are correct - there is no easy way to do it. You will need to create your own heuristics.
Leonard -----Original Message----- From: sal salaimani [mailto:[email protected]] Sent: Monday, July 12, 2010 12:54 PM To: [email protected] Subject: Re: [iText-questions] object level document analyis of PDF files Bruno wrote: I think it's feasible to extend the available parser functionality. In the book, there's an example that draws a rectangle that contains all the text on a page. There is no such functionality (yet) to get the bounding rectangle of a shape, but it should be doable with some extra programming work. My reply: I agree with you. My problem here is identifying chart as a whole enity(chart is made of group of path and text objects). Parsing path construction, path paining, and graphics state operators , path object can be identified. Parsing BT and ET(text object operators), text object can be identified. since path objects are drawn based on coordinates of the page, there is no way to identify chart by grouping/parsing path and text objects from page's content stream. Do you agree with me? Can you please give me more clue to identify chart if I am not right? -- View this message in context: http://itext-general.2136553.n4.nabble.com/object-level-document-analyis-of-PDF-files-tp2282895p2286427.html Sent from the iText - General mailing list archive at Nabble.com. ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ iText-questions mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/itext-questions Buy the iText book: http://www.itextpdf.com/book/ Check the site with examples before you ask questions: http://www.1t3xt.info/examples/ You can also search the keywords list: http://1t3xt.info/tutorials/keywords/ ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ iText-questions mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/itext-questions Buy the iText book: http://www.itextpdf.com/book/ Check the site with examples before you ask questions: http://www.1t3xt.info/examples/ You can also search the keywords list: http://1t3xt.info/tutorials/keywords/
