#197: Errors in text extraction without running 'make install-pdfa-helper-files'
-----------------------+----------------------------------------------------
Reporter: bthiell | Owner:
Type: defect | Status: new
Priority: major | Milestone:
Component: *general* | Version:
Keywords: |
-----------------------+----------------------------------------------------
For the common user running the common installation procedure, 'inveniocfg
--load-demo-records' produces lots of exceptions (72 for me).
The exceptions are:
* "ERROR: it's impossible to correctly extract words from
http://localhost/record/8/files/9812226.fig5.ps.gz"
The following problem occurred on <http://localhost> (CDS Invenio
0.99.90.20100628)
>> 2010-07-02 11:44:13 -> InvenioWebSubmitFileError: Unexpected error when
converting from /opt/cds-invenio/var/data/files/g0/11/9812226.fig5.ps.gz;1
to .txt (<type 'instance'>): [Errno 2] No such file or directory: '/opt
/cds-invenio/etc/websubmit/file_converter_templates/ISOCoatedsb.icc'"
* "'Error in extracting text from bibdoc 11, version 1'
The following problem occurred on <http://localhost> (CDS Invenio
0.99.90.20100628)
>> 2010-07-02 11:44:11 -> InvenioWebSubmitFileConverterError: Unexpected
error when converting from /opt/cds-
invenio/var/data/files/g0/11/9812226.fig5.ps.gz;1 to .txt (<type
'instance'>): [Errno 2] No such file or directory: '/opt/cds-
invenio/etc/websubmit/file_converter_templates/ISOCoatedsb.icc'"
Running 'make install-pdfa-helper-files' fixes this issue.
The problem is that this command is marked as optional when in fact it is
not (or you get your inbox filled with exceptions). So we should either
decide to have this command marked as mandatory, run it silently with
'make install' or prevent the text from being extracted when this command
has not been ran. Maybe a single email saying that the extraction could
not be completed because the PDFa helper files are missing is enough to
warn the user.
--
Ticket URL: <http://invenio-software.org/ticket/197>
Invenio <http://invenio-software.org>