Re: Html to Pdf

2014-09-05 Thread Alin Mazilu
Since we are suggesting alternatives, I use iText for converting HTML into PDF. Here is an example: http://www.rgagnon.com/javadetails/java-html-to-pdf-using-itext.html Hope that helps, Alin On Fri, Sep 5, 2014 at 1:50 PM, John Hewson j...@jahewson.com wrote: Rendering HTML is very complex,

Re: Problem with processTextPosition

2014-05-17 Thread Alin Mazilu
What are the x and y coordinates of H and W? Alin Mazilu SKE GlobalTech, LLC 3250 West Market St. Suite 307D Fairlawn, OH 44333 Sent from my Galaxy S3 On May 17, 2014 2:42 AM, DImuthu Upeksha dimuthu.upeks...@gmail.com wrote: Hi all, I was tying to manually feed text position objects

Re: Problem with processTextPosition

2014-05-17 Thread Alin Mazilu
:18 PM, Alin Mazilu impet...@gmail.com wrote: What are the x and y coordinates of H and W? Alin Mazilu SKE GlobalTech, LLC 3250 West Market St. Suite 307D Fairlawn, OH 44333 Sent from my Galaxy S3 On May 17, 2014 2:42 AM, DImuthu Upeksha dimuthu.upeks...@gmail.com wrote: Hi all

Re: PDF file characters x and y coordinates

2014-05-16 Thread Alin Mazilu
I process about 2000 PDF files daily and I never had had an issue with the coordinates. One piece of advise though: write your own TextPositionComparator. ~Alin On Fri, May 16, 2014 at 8:39 AM, Simer P sime...@gmail.com wrote: I just needed to confirm this with you guys. Can the X and Y

Re: PDFTextPositions

2014-04-02 Thread Alin Mazilu
You have to extend the PDFTextStripper class and override the processTextPosition(...) method. From there the logic depends on you. You can also override the writePage() method to grab the charactersByArticle Vector and then you would look for your words in there by iterating over it. Basically in

Problem With MergeUtility

2014-03-13 Thread Alin Mazilu
Hello guys, Has anyone had any problem with this? Any idea why it happens? What would be a good value for pushBackSize so this does not happen? Thanks! Partial stack trace: org.apache.pdfbox.exceptions.WrappedIOException: Could not push back 72940 bytes in order to reparse stream. Try

Re: Problem With MergeUtility

2014-03-13 Thread Alin Mazilu
, 2014 at 11:27 AM, Maruan Sahyoun sahy...@fileaffairs.dewrote: Hi, not a direct answer to your question but could you try PDDocument.loadNonSeq instead? BR Maruan Sahyoun Am 13.03.2014 um 16:16 schrieb Alin Mazilu impet...@gmail.com: Hello guys, Has anyone had any problem

Re: Problem With MergeUtility

2014-03-13 Thread Alin Mazilu
null as buffer parameter). Best, Timo Am 13.03.2014 16:46, schrieb Alin Mazilu: Where? Here's the code that causes that: PDFMergeUtility util = new PDFMergeUtility(); for (File file : set) { try{ if( file.exists() ){ util.addSource(file); } } catch ( Exception e

Re: Problem With MergeUtility

2014-03-13 Thread Alin Mazilu
case for using a protected field, because overriding it is the primary mechanism for custom text extraction. Cheers -- John On 13 Mar 2014, at 10:40, Alin Mazilu impet...@gmail.com wrote: Ok, I will try. In my opinion it would be useful if it had the instance variables protected rather

Re: Need JBIG2 test image

2014-03-12 Thread Alin Mazilu
I have a scanned accident police reports that have people names, addresses and phone numbers in them. I had a problem printing these files with pdfbox and I had to improvise by using a command prompt print utility as a Process. I could maybe give you one if you agree not to release it to the

Re: Regarding pdf data extraction

2014-03-03 Thread Alin Mazilu
I don't think that class can help you... All you need is the PDFTextStripper class... On Mon, Mar 3, 2014 at 7:15 PM, Divya Muttineni divyamuttin...@gmail.comwrote: I am trying to convert the tabular data from pdf file to text(.txt) file. In one of the article I came across

Error printing...

2014-01-22 Thread Alin Mazilu
Hello all, I am printing some PDFs and I am getting this: Jan 22, 2014 12:07:47 PM org.apache.pdfbox.filter.JBIG2Filter decode SEVERE: Can't find an ImageIO plugin to decode the JBIG2 encoded datastream. Jan 22, 2014 12:07:47 PM org.apache.pdfbox.pdmodel.graphics.xobject.PDPixelMap getRGBImage

Re: Error printing...

2014-01-22 Thread Alin Mazilu
at 1:48 PM, John Hewson j...@jahewson.com wrote: Yes, there is. Simply Google JBIG2 plugin” and follow the first link, it will be called jbig2-imageio. -- John On 22 Jan 2014, at 09:16, Alin Mazilu impet...@gmail.com wrote: Hello all, I am printing some PDFs and I am getting

Re: PDFTextStripper's writeLine() must be protected!

2013-11-15 Thread Alin Mazilu
Hello, I would venture to guess that if you need to override that method you probably need to do something more complicated than just finding out where a line starts and where it ends. Because if you just need to get the beginning and end of each line, you can override setLineSeparator() and all

Re: PDF Text Highlight

2013-07-27 Thread Alin Mazilu
standard -- the upper left corner. But that's not a problem as it's fixable with simple arithmetic. Thank you so much for your help. It would have taken me a long time to figure it out on my own, if ever. Alin Mazilu On Fri, Jul 26, 2013 at 6:19 PM, Fred Hansen zweibie...@yahoo.com wrote: Caveat

PDF Text Highlight

2013-07-26 Thread Alin Mazilu
appreciate it. Alin Mazilu

Re: [DISCUSS] Switch to java 1.6

2013-04-30 Thread Alin Mazilu
... :)) On Tue, Apr 30, 2013 at 1:03 PM, Thomas Chojecki i...@rayman2200.de wrote: Zitat von Alin Mazilu impet...@gmail.com: Hello, Hi, I got one: JavaFX. I use PDFBox in projects that use JavaFX 1.7/1.8. I try to find this JavaFX version to see what Java version it need, but I can't

Re: [DISCUSS] Switch to java 1.6

2013-04-28 Thread Alin Mazilu
Hello, I got one: JavaFX. I use PDFBox in projects that use JavaFX 1.7/1.8. Alin On Sun, Apr 28, 2013 at 1:35 PM, Andreas Lehmkuehler andr...@lehmi.dewrote: Hi, there was already a discussion about switching to java 1.6. As this is a very important topic I'd like to move the discussion