For such formats I will covert these to PDF and then extract content. Nick, Really thanks for your cooperation,
On Mon, 20 Jul 2015 at 00:40 Nick Burch <[email protected]> wrote: > On Fri, 17 Jul 2015, Nazar Hussain wrote: > > I tested it with different pdf documents and it works 100% perfect. > > Unfortunately it does not work well with docx format. > > That's entirely to be expected. Word .doc and .docx formats are run-based > not page-based, so there's no page information in the file to extract. > (There's section information, but not page) > > Nick >
