It might be possible to extract the text you want but I don't think there's
a built-in method in PDFBox that will allow you to do it. It will have to
be based either on the text's location on the page, the context (relation
to other text), a certain pattern, etc. Each of these things can be
implemented but it would require a custom-made tool to be created.
It's very hard to say for sure without seeing the actual file, though.

On Mon, Feb 2, 2015 at 4:29 PM, Lorena Leishman <
[email protected]> wrote:

> Hi,
> I have a PDF that has information displayed on tables. Example:
> Company Name:   Barnes & Noble   Bank Of America  Macy'sAccount #:
>     123xxxxx              345xxxx               679xxxxStatus:
>       Open                    Closed                 OpenBalance:
>      $23.                      $0.00                    $100
> Is there a way with PDFbox to extract a specific value(s) from the table?
> Example: Bank Of America  and $0.00
> And also is there a way to cut the whole table and paste it into a
> different PDF?
> Please let me know, Thanks!
> Lorena

Reply via email to