Re: [BangPypers] Retrieving images from PDFs

Anand Balachandran Pillai Tue, 29 Dec 2009 23:02:54 -0800

On Tue, Dec 29, 2009 at 5:51 PM, Srinivas Reddy Thatiparthy <
[email protected]> wrote:


> Though I never tried it, Reportlab has a pdf library (If I am not
> wrong).
> Have you tried it??
>
>
> When the limestone of imperative programming is worn away, the granite
> of functional programming will be observed. ---Simon Peyton Jones.
>
>
> -----Original Message-----
> From: bangpypers-bounces+srinivas_thatiparthy=akebonosoft....@python.org
> [mailto:bangpypers-bounces+srinivas_thatiparthy<bangpypers-bounces%2Bsrinivas_thatiparthy>
> [email protected]
> rg] On Behalf Of Shashwat Anand
> Sent: Tuesday, December 29, 2009 5:49 PM
> To: Bangalore Python Users Group - India
> Subject: [BangPypers] Retrieving images from PDFs
>
> How can we retrieve images from PDFs. I need both images and the text
> beneath the image to form a database. I was able to parse text via
> PDFMiner but was crippled when it leads to images.
>

Use pyPdf. I have worked a bit on PDF structure using pyPdf and written a
library
on top of it for PDF accessibility so I can help you to do this. If you want

help contact me off list.


> _______________________________________________
> BangPypers mailing list
> [email protected]
> http://mail.python.org/mailman/listinfo/bangpypers
> _______________________________________________
> BangPypers mailing list
> [email protected]
> http://mail.python.org/mailman/listinfo/bangpypers
>



-- 
--Anand
_______________________________________________
BangPypers mailing list
[email protected]
http://mail.python.org/mailman/listinfo/bangpypers

Re: [BangPypers] Retrieving images from PDFs

Reply via email to