Hello, > On 25 Dec 2014, at 09:59, A.M. Sabuncu <[email protected]> wrote: > > Have a feeling maybe I am missing something rudimentary. > > I am running tika-server on an AWS Ubuntu instance, and issueing the curl > commands from a Windows 7 system. I downloaded and built Tika 1.6 from > apache.org/dist/tika <http://apache.org/dist/tika>, with timestamp 2014-09-05 > 05:42. > > Thanks so much, happy holidays. >
I have performed a couple of tests on OS X, Ubuntu and Windows box building the Tika 1.6 source[1] with either Sun Java or Open JDK 7. Each time this works as expected. Also, I am able to process the PDF using an instance of the Apache Tika OpenShift cartridge[2] based on the tika-1.6-server.jar[3]: curl -T GeoSPARQL.pdf http://tikaserver-logicalspark.rhcloud.com/tika Given the above, I am wondering if there is something environmental within your EC2 instance. Are you able to share the following: Java version - i.e. java -version JDK version - i.e. javac -version EC2 Instance type - e.g. t1.micro, t2.small, etc Memory Available - i.e. output of less /proc/meminfo Thanks, Dave [1] http://www.apache.org/dist/tika/tika-1.6-src.zip [2] https://github.com/LogicalSpark/openshift-tika-cartridge <https://github.com/LogicalSpark/openshift-tika-cartridge> [3] http://www.apache.org/dist/tika/tika-server-1.6.jar
