#197: Errors in text extraction without running 'make install-pdfa-helper-files'
------------------------+---------------------------------------------------
Reporter: bthiell | Owner: skaplun
Type: defect | Status: assigned
Priority: major | Milestone: v1.0
Component: WebSubmit | Version:
Resolution: | Keywords:
------------------------+---------------------------------------------------
Comment (by bthiell):
So I installed pstotext on my CentOS 5 machine.
The installation process for pstotext was straightforward as all its
dependencies are available on the default CentOS repositories and/or the
EPEL repository. I downloaded a RPM build here:
http://rpm.pbone.net/index.php3/stat/4/idpl/654271/dir/pld/com/pstotext-1.8g-1.i386.rpm.html
and it installed without a problem.
This solves partially the problem as I don't receive as many exceptions.
Now I receive only the exceptions because of the extraction of texts from
images.
BTW maybe we could have the text extraction done outside of "inveniocfg
--load-demo-records" as it makes it very slow. Probably not everyone needs
the texts to be extracted. I would suggest something along the line of
"inveniocfg --extract-text-from-records". Actually I think that it does
put the new feature even more forward than previously.
--
Ticket URL: <http://invenio-software.org/ticket/197#comment:4>
Invenio <http://invenio-software.org>