Re: Detecting CID fonts

2021-05-06 Thread jorgeeflorez
t; Tilman > > Am 06.05.2021 um 14:16 schrieb jorgeeflorez: > > Hi Tilman, > > thank you for your reply. > > > > It's more complicated because form XObjects, patterns, annotations, > >> softmasks (and maybe more) can also have fonts. I also doubt that you &

Re: Detecting CID fonts

2021-05-06 Thread jorgeeflorez
One last question, how could I modify the RemoveAllText example to remove from the pdf file all images? Thanks. Jorge El jue, 6 may 2021 a las 1:07, Tilman Hausherr () escribió: > Am 05.05.2021 um 18:39 schrieb jorgeeflorez: > > Hi, > > I would like to know what would be the

Detecting CID fonts

2021-05-05 Thread jorgeeflorez
Hi, I would like to know what would be the best way to detect whether ia pdf file has CID fonts. As far as I understand, these fonts are used in asian texts (japanese, chinese, korean, etc). I have the following code: PDDocument doc = PDDocument.load(myFile); for (int i = 0; i <

Re: Text extraction example

2018-11-12 Thread jorgeeflorez
Hi. Ok. I understand. Nevermind :) Thanks. El lun., 12 de nov. de 2018 11:16 p. m., Tilman Hausherr < thaush...@t-online.de> escribió: > Am 12.11.2018 um 19:56 schrieb jorgeeflorez: > > Hi all, > > > > first, I want to thank Tilman for his effort getting the text fro

Text extraction example

2018-11-12 Thread jorgeeflorez
Hi all, first, I want to thank Tilman for his effort getting the text from a page regardless its rotation. (https://issues.apache.org/jira/browse/PDFBOX-4371). second, I want to share with you a small application I created using C#. It uses ITextSharp library and a custom text extraction

Re: users Digest 6 Nov 2018 21:30:49 -0000 Issue 1773

2018-11-06 Thread jorgeeflorez
Tilman, I am sorry, I did not read your last message before writing mine :( I am currently in "Digest" mode and as soon as I sent my message, the digest arrived (odd). Thank you. El mar., 6 nov. 2018 a las 16:30, escribió: > > -- Forwarded message -- > From: Tilman Hausherr >

Re: Extracting page "correctly"

2018-11-06 Thread jorgeeflorez
Thanks a lot Tilman for your help. What it seems to me is that, regarding text extraction from a page, some improvements can be made (I used PDFBox 2.0.11). The idea, I think, is that one could just invoke a method and get the text of the page, just as you would get it if you select the text from

Re: Extracting page "correctly"

2018-11-05 Thread jorgeeflorez
Nov 2018 10:35:30 +0100 > Subject: Re: Extracting page "correctly" > Am 02.11.2018 um 23:37 schrieb jorgeeflorez: > > > > The text I get is better than the first one, but it mixes the text > > from left and right "columns" (please see the bold text). >

Extracting page "correctly"

2018-11-02 Thread jorgeeflorez
Hi all, I want to extract the text from the page of this PDF file . I am using the following code to achieve it: try (PDDocument document = PDDocument.load(new File(fileName))) { PDFTextStripper stripper = new

Rendering large pages

2017-10-04 Thread jorgeeflorez .
Hello all, When I try to render a large PDF page as an image I get: Exception in thread "main" java.lang.NegativeArraySizeException at java.awt.image.DataBufferByte.(DataBufferByte.java:76) at java.awt.image.Raster.createInterleavedRaster(Raster.java:266) at

Re: (JAI) Image I/O Tools are not installed

2017-02-28 Thread jorgeeflorez .
ush...@t-online.de>: > Am 28.02.2017 um 19:23 schrieb jorgeeflorez .: > >> Hi Tilman, >> >> this is what I get: >> >> [JPG, jpg, bmp, BMP, gif, GIF, WBMP, png, PNG, wbmp, jpeg, JPEG] >> > > Then it's really not there... JBIG2 is also missing. I'

Re: (JAI) Image I/O Tools are not installed

2017-02-28 Thread jorgeeflorez .
Hi Tilman, this is what I get: [JPG, jpg, bmp, BMP, gif, GIF, WBMP, png, PNG, wbmp, jpeg, JPEG] 2017-02-28 13:04 GMT-05:00 Tilman Hausherr <thaush...@t-online.de>: > Am 28.02.2017 um 18:56 schrieb jorgeeflorez .: > >> Hi Maruan, >> >> thank you for your re

Re: (JAI) Image I/O Tools are not installed

2017-02-28 Thread jorgeeflorez .
ahy...@fileaffairs.de>: > Hi, > > please take a lookt at https://pdfbox.apache.org/2.0/dependencies.html > under Optional Components/JAI Image I/O > > Hope that helps. > > Best regards > > Maruan > > > > Am 28.02.2017 um 17:19 schrieb jorgeeflorez . &

Re: Base fonts not found

2016-10-03 Thread jorgeeflorez .
to /usr/share/fonts I have another question... Are these fonts free to copy? or will I need to purchase one by one? Thank you. Best Regards. 2016-10-03 12:55 GMT-05:00 John Hewson <j...@jahewson.com>: > > > On 3 Oct 2016, at 06:20, jorgeeflorez . <jorgeeduardoflo...@gmail.co