Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Tika Wiki" for change 
notification.

The "VirtualMachine" page has been changed by TimothyAllison:
https://wiki.apache.org/tika/VirtualMachine?action=diff&rev1=21&rev2=22

  
  5. start: apachectl start
  
+ == pdftotext ==
+ Downloads from: https://www.xpdfreader.com/download.html
+ 
+ Current version: 4.00; Released: 2017 Aug 10
+ 
+ 1. Downloaded 64-bit Linux {{{XpdfReader}}}; executed: 
{{{XpdfReader-linux64-4.00.01.run}}}; unpacked and cp xpdf to /usr/local/bin
+ 
+ 2. Downloaded 64-bit Linux {{{Xpdf tools}}}; unpacked and cp bin64/* to 
/usr/local/bin
+ 
+ 3. Downloaded language support packages: Arabic, Chinese/simplified, 
Chinese/traditional, Cyrillic, Greek, Hebrew, Japanese, Korean, Latin2, Thai 
and Turkish; unzipped them all, cat all add-to-xpdfrc >> tmp_xpdfrc and cp all 
to  /usr/local/share/xpdf
+ 
+ 4. cat xpdf-tools-linux-4.00/doc/sample-xpdfrc tmp_xpdfrc >> 
/usr/local/etc/xpdfrc
+ 
+ 
  == Other data ==
  See ApacheTikaHtmlEncodingStudy for a description of gathering data for 
TIKA-2038.
  See CommonCrawl3 for a description of refreshing data for TIKA-2750.

Reply via email to