[Podofo-users] Extracting Accessible Text

Mark Rogers Thu, 19 Mar 2009 11:23:35 -0700

Hi


I'm trying to figure out how to extract text from a PDF into an
accessibility tool

 

I've figured out how to walk the tagged structure returned by
GetStructTreeRoot, but stuck on how to get from an integer marked content
identifier (PDF 32000 14.7.2) to the actual text.

 

Looks like I probably need to parse the content stream using
PdfContentsTokenizer to scan for the corresponding marked content.

 

Is this the right thing to do? Is there an easier way to do this?

 

Thanks

Mark

------------------------------------------------------------------------------
Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are
powering Web 2.0 with engaging, cross-platform capabilities. Quickly and
easily build your RIAs with Flex Builder, the Eclipse(TM)based development
software that enables intelligent coding and step-through debugging.
Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com

_______________________________________________
Podofo-users mailing list
Podofo-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/podofo-users

[Podofo-users] Extracting Accessible Text

Reply via email to