HelloI am totally new to the insides of PDF structure and also PDFBox library. My goal is to extract index data from a pdf book (that book does not have proper index, but it is very needed). That means I want to detect the fonts/sizes of headings and sub-headings, then extract their text along with page number and later just print that information to text file.
How to do that? Juhan

