Am 05.09.2015 um 09:06 schrieb vahid islami:
hi every one ,
I write program that parse pdf and using pdfbox( I also change source code
to work with rtl language). my problem is how extract heading from pdf?
thank all.


You can't unless you have a tagged PDF. You would have to use heuristics and make a decision based on font sizes.

Tilman

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to