I am looking for implementation logic or algorithm to parse path
object(graphics object) from a PDF file. As we all aware , iText API does
not have the capability to parse path object , I pursued to review PDFbox
and multivalent APIs to grasp the implementation logic to parse path object.
I could not find any implementation for parsing path object in these APIS. I
am stuck now.  All I am looking for is where path object starts and where it
ends as marked content operators BDC...EMC..inline image...BI EI ...image
with Do operators.  The path object has too many operators.  For example, a
chart in a PDF doc has many pathing and graphics operators mix as below. 
Instead of parsing each and every pathing and graphics operators and
identify as rectangle or line and group them as chart, I want to implement
simple logic to find where path obj starts and ends. I really appreciate if
anyone points me to right direction.

TOTAL NO OF PAGES = 1
content stream for page 1 =BT
/GS0 gs
/TT0 1 Tf
0.0009 Tc -0.0021 Tw 12 0 0 12 90 708.96 Tm
[(This is a cha)5(r)-1(t  )]TJ
0 Tc 0 Tw 0 -1.15 TD
( )Tj
T*
( )Tj
ET
q
1 0 0 1 121.8000031 562.6799927 cm
0.5 g
0 0 m
2.4 1.8 l
126 1.8 l
123.6 0 l
0 0 l
f*
Q
q
1 0 0 1 121.8000031 562.6799927 cm
0.754 g
0 0 m
0 96.6 l
2.4 98.4 l
2.4 1.8 l
0 0 l
f*
Q
-- 
View this message in context: 
http://itext-general.2136553.n4.nabble.com/object-level-document-analyis-of-PDF-files-tp2282895p2282895.html
Sent from the iText - General mailing list archive at Nabble.com.

------------------------------------------------------------------------------
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions

Buy the iText book: http://www.itextpdf.com/book/
Check the site with examples before you ask questions: 
http://www.1t3xt.info/examples/
You can also search the keywords list: http://1t3xt.info/tutorials/keywords/

Reply via email to