#197: Errors in text extraction without running 'make install-pdfa-helper-files'
------------------------+---------------------------------------------------
  Reporter:  bthiell    |       Owner:  skaplun 
      Type:  defect     |      Status:  assigned
  Priority:  major      |   Milestone:  v1.0    
 Component:  WebSubmit  |     Version:          
Resolution:             |    Keywords:          
------------------------+---------------------------------------------------

Comment (by bthiell):

 So I installed pstotext on my CentOS 5 machine.

 The installation process for pstotext was straightforward as all its
 dependencies are available on the default CentOS repositories and/or the
 EPEL repository. I downloaded a RPM build here:
 
http://rpm.pbone.net/index.php3/stat/4/idpl/654271/dir/pld/com/pstotext-1.8g-1.i386.rpm.html
 and it installed without a problem.

 This solves partially the problem as I don't receive as many exceptions.
 Now I receive only the exceptions because of the extraction of texts from
 images.

 BTW maybe we could have the text extraction done outside of "inveniocfg
 --load-demo-records" as it makes it very slow. Probably not everyone needs
 the texts to be extracted. I would suggest something along the line of
 "inveniocfg --extract-text-from-records". Actually I think that it does
 put the new feature even more forward than previously.

-- 
Ticket URL: <http://invenio-software.org/ticket/197#comment:4>
Invenio <http://invenio-software.org>

Reply via email to