On Tuesday 11 October 2011, Alec Taylor wrote: > Good afternoon, > > Do you have some recommends and/or sample code for comparing textual > and geometric layout information across pages? > > Basically I'm trying to realise patterns within documents, e.g., page > numbers, header and footers, title, column information &etc; using the > capabilities of the Poppler PDF library.
Not sure that it will help you much, but you can have a look at DiffPDF which uses poppler to compare two PDF files page by page (both textually and visually): http://www.qtrac.eu/diffpdf.html Best regards, Glad -- Everything that is really great and inspiring is created by the individual who can labor in freedom. -- Albert Einstein, Out of My Later Years (1950) _______________________________________________ poppler mailing list [email protected] http://lists.freedesktop.org/mailman/listinfo/poppler
