Am 26.08.2014 15:32, schrieb Tilman Hausherr:
We have problems recognizing YCCK / CMYK jpeg files.

I can't find the issue quickly, it was a few weeks ago from a french person and about an image about a Porsche event.

https://issues.apache.org/jira/browse/PDFBOX-2128

Tilman




Anyway, what worked for me (most of the time) is to use Apache Imaging to detect the image type.

I wrote "most of the time" because it is not perfect, although better than java.

https://issues.apache.org/jira/browse/IMAGING-136 <https://issues.apache.org/jira/browse/IMAGING-136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel>

I'm not sure if Apache Imaging is still an active project.

Tilman

Am 26.08.2014 14:34, schrieb Timo Boehme:
Hi,

checking the rendering capabilities of PDFBOX 1.8 vs. current trunk I came across a journal which showed severe problems in both - but different. Problems of 1.8 are gone, new ones showed up.

While the journal (Chemical&Engineering News, C&EN) does not provide free PDF editions a sample edition can be downloaded via 'View a sample issue' at http://cen.acs.org/static/about/digital.html (or directly via http://www.cendigital.org/cendigital/sample/). I'm referring to volume 92, nr 27 from 2014-07-07 which I downloaded yesterday but the same problems also showed up in other journal issues.

The problems (all on Linux, Java 1.6):
- PDFBOX 1.8 (svn 1620380)
  - first letters of words in headlines are sometimes missing, e.g. on
    page 2 "Getting ..." reads " et ing ...", "Overview" -> " verview"
  - bad character spacing because of substituted font

- PDFBOX trunk (svn 1620415)
  - no missing letters but heavily distorted and displaced
    letters in headlines (e.g. page 2)
  - compared to 1.8 correct font is used
  - picture colors are completely wrong;
    logged warning: org.apache.pdfbox.filter.DCTFilter decode
                    WARNUNG: Inconsistent metadata read from JPEG stream
  - transparent background instead of white

- PDFBOX trunk, no-awt svn 1620487
  - font rendering ok
  - picture/background problems as in trunk

Since these are multiples problems on different versions and the PDF is not freely distributable I did not create a JIRA issue. Nevertheless it is a widely distributed journal and a good test case for the rendering quality. At least the JPEG rendering problem of the current trunk should be solved.


Best,
Timo




Reply via email to