Re: [SURVEY] PDFBox Uses Cases

Eliot Kimber Mon, 06 Jan 2014 06:27:57 -0800

Primary uses:

1. Text extraction to facilitate full-text indexing of PDFs

2. Constructing PDFs for “art manuscripts”: taking a set of images and
associated metadata from a CMS and producing PDFs that show the images and
the metadata, one per page. PDF includes a QR code that captures essential
CMS details, such as object ID. Use PDF box to read PDFs that are scans of
these previously-generated pages (e.g., somebody prints the original PDF,
marks on it, scans it back to a new PDF), extract the QR code, and
correlate the scanned page image to the original image from which the
first PDF was generated.

Cheers,

Eliot
-- 
Eliot Kimber
Senior Solutions Architect
"Bringing Strategy, Content, and Technology Together"
Main: 512.554.9368
www.reallysi.com
www.rsuitecms.com

On 1/6/14, 5:31 AM, "Maruan Sahyoun" <[email protected]> wrote:

>Dear PDFBox users,
>
>we’d love to hear from you how you are using PDFBox in your PDF
>applications. Do you use it for rendering, merging, creation … - what is
>the main application?
>
>As we are planning for PDFBox 2.0 there are already a lot of ideas what
>could be done in that release. Your input will help us to better
>understand where we could put our focus.
>
>Please understand that we will take your input seriously but as this is a
>volunteers effort we can not commit to a certain functionality. And if
>you’d like to help you’re always welcome to do so.
>
>Thanks a lot for your feedback!
>
>Maruan Sahyoun
>

Re: [SURVEY] PDFBox Uses Cases

Reply via email to