Primary uses: 1. Text extraction to facilitate full-text indexing of PDFs
2. Constructing PDFs for “art manuscripts”: taking a set of images and associated metadata from a CMS and producing PDFs that show the images and the metadata, one per page. PDF includes a QR code that captures essential CMS details, such as object ID. Use PDF box to read PDFs that are scans of these previously-generated pages (e.g., somebody prints the original PDF, marks on it, scans it back to a new PDF), extract the QR code, and correlate the scanned page image to the original image from which the first PDF was generated. Cheers, Eliot -- Eliot Kimber Senior Solutions Architect "Bringing Strategy, Content, and Technology Together" Main: 512.554.9368 www.reallysi.com www.rsuitecms.com On 1/6/14, 5:31 AM, "Maruan Sahyoun" <[email protected]> wrote: >Dear PDFBox users, > >we’d love to hear from you how you are using PDFBox in your PDF >applications. Do you use it for rendering, merging, creation … - what is >the main application? > >As we are planning for PDFBox 2.0 there are already a lot of ideas what >could be done in that release. Your input will help us to better >understand where we could put our focus. > >Please understand that we will take your input seriously but as this is a >volunteers effort we can not commit to a certain functionality. And if >you’d like to help you’re always welcome to do so. > >Thanks a lot for your feedback! > >Maruan Sahyoun >

