Hi, On Thu, Jul 22, 2010 at 5:22 PM, Ensor, Neal <[email protected]> wrote: > I know the information is available via underlying library calls (e.g., PDF > box) and > appears it should be available via extended information in the MS Office > parser, > but I don't see it in the metadata of any documents I tried. My question is, > was > there some reason why page counts are omitted?
The only reason is that nobody has yet gotten around to adding that feature to Tika. :-) > I hacked my local copy of PDFParser to provide such via the > PDDocument.getNumberOfPages() call, but was wondering if I missed something > somewhere or there might be a reason to not provide such information. It would be great if you wanted to share your changes by posting them as an improvement request in https://issues.apache.org/jira/browse/TIKA. BR, Jukka Zitting
