Is there any plan to support some basic editing operations, some of which pdftk can do, like rearranging page order, renumbering pages, editing metadata or OCR text inside the PDF? I saw in the Qt4 binding documentation that it's possible to write an open PDF document as a new PDF, and there is a flag to preserve changes or not, but what are the changes that it supports?
I'm scanning a bunch of old magazines that take up too much space in boxes (Radio-Electronics, Popular Science etc.) and was thinking of writing a program to recognize the name and date of each scan (look for the known magazine titles, month names etc. in the margins), and auto-number the pages (look for page numbers in known likely locations). I confirmed that GOCR is good enough to extract page numbers from page images. I could probably just use pdftk to do the renumbering, but also thought of making a better integrated tool rather than just a scripting solution. I actually bought Acrobat to do OCR on the scans, but it does not have such features. _______________________________________________ poppler mailing list [email protected] http://lists.freedesktop.org/mailman/listinfo/poppler
