RE: PDF Region Parsing

2009-09-15 Thread Martinez, Mel
...@gmail.com] Sent: Tuesday, September 15, 2009 11:52 AM To: pdfbox-dev@incubator.apache.org Subject: Re: PDF Region Parsing Andy, An easy way to extract Chapter 3 and Chapter 5 is to extract text page by page using the org.apache.pdfbox.util.PDFTextStripper class and if possible use regular

Re: PDF Region Parsing

2009-09-15 Thread Navendu Garg
Andy, An easy way to extract Chapter 3 and Chapter 5 is to extract text page by page using the org.apache.pdfbox.util.PDFTextStripper class and if possible use regular expressions to determine the starting and ending pages for each chapter. I personally like the idea of extending PDFTextStripper c

Re: PDF Region Parsing

2009-09-15 Thread Adam
ator.apache.org To pdfbox-dev@incubator.apache.org cc Subject PDF Region Parsing Hi all, I am new to PDFBox and want to ask a few questions to make sure that PDFBox is the right choice for what I want to do. Is there a way to use PDFBox so that I am able to extract specific portions of

PDF Region Parsing

2009-09-15 Thread listan...@gmail.com
Hi all, I am new to PDFBox and want to ask a few questions to make sure that PDFBox is the right choice for what I want to do. Is there a way to use PDFBox so that I am able to extract specific portions of a PDF document - for example "Chapter 3" and "Chapter 5" from an e-book? What kind of auxil