Quoth Gert Schmeltz Pedersen on Mon, Aug 24, 2009 at 05:49:01AM -0400:
> If you can build a PDFBox.jar from the tip of the trunk, then you could
> simply try to substitute it for the old jar. I would like to know, if that
> would solve the problem. 

Thanks for your reply!

Unfortunately, no. PDFBox have recently moved their Java packages from
org.pdfbox to org.apache.pdfbox, so I can’t just grab a PDFBox.jar from the
tip of trunk and use it. It may be possible to build a trunk PDFBox.jar and
then modify the GSearch imports to use the new packages, but I haven’t
tested that yet: I’m trying to contain my changes as much as possible for
now. 

> There is no exact plan for a next release of GSearch. The current one also
> works with Fedora 3.2.1. The intention is to go for the next release, when
> Lucene and Solr release again, then including the newest PDFBox and
> probably other new ones together with a few minor improvements.

Thanks for clarifying!

For anyone else who may have a similar issue, I’ve confirmed that my
problems were caused by this PDFBox bug:
https://issues.apache.org/jira/browse/PDFBOX-361
This is a bug in the PDFBox parser that causes it to fail to parse
crossreference streams. Crossreference streams are a feature introduced in
PDF 1.5. Adobe Acrobat 9 was the first release of that program to produce
crossreference streams.

The bug description contains a patch. PDFBox fixed the bug by applying the
patch to their repository. I’ve modified the patch to apply to PDFBox 0.7.2
(the version supplied with GSearch 2.2) and attached it below.

PDFBox 0.7.2 source is available from:
http://sourceforge.net/projects/pdfbox/files/

-- 
Ben Ranker <[email protected]>
Emory University Libraries

Attachment: PDFBox-0.7.2.xref.patch.gz
Description: Binary data

Attachment: signature.asc
Description: Digital signature

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
Fedora-commons-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users

Reply via email to