To get images, use the ExtractImages.java tool; to get text, use the ExtractText.java tool (or ExtractTextSimple.java example from the source code download).

Tilman

Am 14.05.2021 um 22:13 schrieb Subhajit Das:
Hi There,

I am new in PDF box. I need to know PDF content type, like how much text, how 
many embedded images, size of them etc.
How do I accomplish that?
I found : 
https://svn.apache.org/viewvc/pdfbox/trunk/examples/src/main/java/org/apache/pdfbox/examples/pdmodel/ExtractEmbeddedFiles.java?view=markup
But this is general for all embedded files. How can I make specific for 
embedded images?
Thanks and Regards,
Subhajit Das




---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to