Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "VirtualMachine" page has been changed by TimothyAllison: https://wiki.apache.org/tika/VirtualMachine?action=diff&rev1=21&rev2=22 5. start: apachectl start + == pdftotext == + Downloads from: https://www.xpdfreader.com/download.html + + Current version: 4.00; Released: 2017 Aug 10 + + 1. Downloaded 64-bit Linux {{{XpdfReader}}}; executed: {{{XpdfReader-linux64-4.00.01.run}}}; unpacked and cp xpdf to /usr/local/bin + + 2. Downloaded 64-bit Linux {{{Xpdf tools}}}; unpacked and cp bin64/* to /usr/local/bin + + 3. Downloaded language support packages: Arabic, Chinese/simplified, Chinese/traditional, Cyrillic, Greek, Hebrew, Japanese, Korean, Latin2, Thai and Turkish; unzipped them all, cat all add-to-xpdfrc >> tmp_xpdfrc and cp all to /usr/local/share/xpdf + + 4. cat xpdf-tools-linux-4.00/doc/sample-xpdfrc tmp_xpdfrc >> /usr/local/etc/xpdfrc + + == Other data == See ApacheTikaHtmlEncodingStudy for a description of gathering data for TIKA-2038. See CommonCrawl3 for a description of refreshing data for TIKA-2750.
