Hi,

On Thu, Jul 22, 2010 at 5:22 PM, Ensor, Neal <[email protected]> wrote:
> I know the information is available via underlying library calls (e.g., PDF 
> box) and
> appears it should be available via extended information in the MS Office 
> parser,
> but I don't see it in the metadata of any documents I tried.  My question is, 
> was
> there some reason why page counts are omitted?

The only reason is that nobody has yet gotten around to adding that
feature to Tika. :-)

> I hacked my local copy of PDFParser to provide such via the
> PDDocument.getNumberOfPages() call,  but was wondering if I missed something
> somewhere or there might be a reason to not provide such information.

It would be great if you wanted to share your changes by posting them
as an improvement request in
https://issues.apache.org/jira/browse/TIKA.

BR,

Jukka Zitting

Reply via email to