Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "TikaJAXRS" page has been changed by HaydenYoung: https://wiki.apache.org/tika/TikaJAXRS?action=diff&rev1=22&rev2=23 Comment: Updated installation details. = Introduction = - This page is documentation on tika's JSR 311 network server, tika-server. The server package uses the [[http://cxf.apache.org/|Apache CXF]] framework that provides an implementation of JAX-RS for Java. The Tika server component builds to a standalone package in Tika, tika-server. Releases of Tika version 1.2 and beyond will ship with tika-server enabled, but to get the software before then and to experiment you can follow the below steps: + This page is documentation on tika's JSR 311 network server, tika-server. The server package uses the [[http://cxf.apache.org/|Apache CXF]] framework that provides an implementation of JAX-RS for Java. The Tika server component builds to a standalone package in Tika, tika-server. - 1. svn export http://svn.apache.org/repos/asf/tika/trunk/tika-server + == Installation == + To install: + + 1. Download the latest source from the [[http://tika.apache.org/download.html|Apache Tika download page]], + 1. Build source using Maven, + 1. Run the Apache Tika JAXRS server. + + {{{ + wget http://mirror.vorboss.net/apache/tika/tika-x.x-src.zip + unzip tika-x.x-src + cd ./tika-x.x/ - 1. mvn install + mvn install + cd ./tika-server/target/ - 1. java -jar target/tika-server-X.Y.jar + java -jar tika-server-x.x.jar + }}} + Remember to replace x.x with the version you have downloaded. You will then see a message such as the following: @@ -21, +34 @@ Apr 4, 2012 7:48:50 AM org.apache.tika.server.TikaServerCli main INFO: Started }}} + Which lets you know that it started correctly. + + You can specify additional information to change the host name and port number: + + {{{ + java -jar tika-server-x.x.jar --host=intranet.local --port=12345 + }}} - Which lets you know that it started correctly. Below is some basic documentation on how to interact with the services using cURL and HTTP. + Below is some basic documentation on how to interact with the services using cURL and HTTP. = Services = All services that take files use HTTP "PUT" requests. Original file must be sent in request body without any additional encoding (do not use multipart/form-data or other containers). @@ -76, +96 @@ $ curl -T price.xls http://localhost:9998/tika --header "Accept: text/html" $ curl -T price.xls http://localhost:9998/tika --header "Accept: text/plain" }}} - == Detector Resource == {{{ /detect/stream }}} + HTTP PUTs a document and uses the Default Detector from Tika to identify its MIME/media type. The caveat here is that providing a hint for the filename can increase the quality of detection. - HTTP PUTs a document and uses the Default Detector from Tika to identify its MIME/media type. - The caveat here is that providing a hint for the filename can increase the quality of detection. Default return is a string of the Media type name. @@ -100, +118 @@ {{{ $ curl -X PUT -H "Content-Disposition: attachment; filename=foo.csv" --upload-file foo.csv http://localhost:9998/detect/stream }}} - == Unpacker Resource == {{{ /unpacker @@ -132, +149 @@ {{{ /mime-types }}} - Mime types, their aliases, their supertype, and the parser. Available as plain text, json or human readable HTML == Available Detectors == {{{ /detectors }}} - The top level Detector to be used, and any child detectors within it. Available as plain text, json or human readable HTML == Available Parsers ==
