Why would working through the PostScript be easier than doing this on  
the original PDF?

        You can get to all the PDF operators just fine.
        Font & text information is more easily referenceable from the PDF
        PostScript also has "XObjects", Patterns, etc. that may contain text.
        etc.

Not understanding the logic :(.

Leonard


On Oct 6, 2007, at 4:53 PM, [EMAIL PROTECTED] wrote:

> Yes; but it is not practicable with iText. You could, however, as  
> long as the PDF is printable, use the following procedure:
>
>      1. Print to a PS file.
>
>      2. Scan the PS file from step1 above, dropping all lines that  
> do not end with Tj or TJ.
>
>      3. Use a regular expression (together with Substitution or  
> Match) to extract the instances of "text fragment" from within  
> multiple instances of "(text fragment)Tj", printing the resulting  
> text fragments to STDOUT.
>
> Bruno has given an excellent example of why you should not expect  
> the resulting output to make sense, i.e., the text fragments may  
> not appear in the order in which you'd like for them to appear.
>
> Cheers,
>
> Bill Segraves
>
> -------------- Original message from krammark  
> <[EMAIL PROTECTED]>: --------------
>
>
> >
> > so , how we read the data from pdf ?
> > i mean , can we read them line by line from the specific pages ?
> >
> > thanks buddy.
> >
> >
> > Bruno Lowagie (iText) wrote:
> > >
> > > krammark wrote:
> > >> hey gusy,
> > >> do u guys have a idea how to read the data from pdf pages  
> using itext ?
> > >> basically, i want to read the data from table and write them  
> into excel
> > >> files.
> > >> is that possible ?
> > >
> > > There is no such thing as 'a table' in plain PDF.
> > > It's just lines and words painted on a canvas,
> > > possible in an arbitrary order.
> > >
> > > Unless your tables cells are form fields, or your
> > ; > PDF contains specific table structures (Tagged PDF),
> > > iText probably won't help you.
> > >
> > > br,
> > > Bruno
> > >
> > >  
> ---------------------------------------------------------------------- 
> ---
> > > This SF.net email is sponsored by: Splunk Inc.
> > > Still grepping through log files to find problems? Stop.
> > > Now Search log events and configuration files using AJAX and a  
> browser.
> > > Download your FREE copy of Splunk now >> http://get.splunk.com/
> > > _______________________________________________
> > > iText-questions mailing list
> > > [email protected]
> > > https://lists.sourceforge.net/lists/listinfo/itext-questions
> > > Buy the iText book: http://itext.ugent.be/itext-in-action/
> > >
> > >
> >
> > --
> > View this message in context:
> > http://www.nabble.com/u-guys-konw -how-t o-read-the-data-from-pdf- 
> using-java-itext
> > ---tf4572506.html#a13067937
> > Sent from the iText - General mailing list archive at Nabble.com.
> >
> >
> >  
> ---------------------------------------------------------------------- 
> ---
> > This SF.net email is sponsored by: Splunk Inc.
> > Still grepping through log files to find problems? Stop.
> > Now Search log events and configuration files using AJAX and a  
> browser.
> > Download your FREE copy of Splunk now >> http://get.splunk.com/
> > _______________________________________________
> > iText-questions mailing list
> > [email protected]
> > https://lists.sourceforge.net/lists/listinfo/itext-questions
> > Buy the iText book: http://itext.ugent.be/itext-in-action/
> ---------------------------------------------------------------------- 
> ---
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems?  Stop.
> Now Search log events and configuration files using AJAX and a  
> browser.
> Download your FREE copy of Splunk now >> http://get.splunk.com/ 
> _______________________________________________
> iText-questions mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/itext-questions
> Buy the iText book: http://itext.ugent.be/itext-in-action/


-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions
Buy the iText book: http://itext.ugent.be/itext-in-action/

Reply via email to