Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "TikaJAXRS" page has been changed by maxcom: http://wiki.apache.org/tika/TikaJAXRS?action=diff&rev1=8&rev2=9 Comment: fix formatting = Introduction = - This page is documentation on tika's JSR 311 network server, tika-server. The server package uses the [[http://cxf.apache.org/|Apache CXF]] framework that provides an implementation of JAX-RS for Java. The Tika server component builds to a standalone package in Tika, tika-server. Releases of Tika version 1.2 and beyond will ship with tika-server enabled, but to get the software before then and to experiment you can follow the below steps: 1. svn export http://svn.apache.org/repos/tika/trunk/tika-server - 2. mvn install + 1. mvn install - 3. java -jar target/tika-server-X.Y.jar + 1. java -jar target/tika-server-X.Y.jar You will then see a message such as the following: {{{ - $ java -jar target/tika-server-1.2-SNAPSHOT.jar + $ java -jar target/tika-server-1.2-SNAPSHOT.jar Apr 4, 2012 7:48:49 AM org.apache.tika.server.TikaServerCli main INFO: Starting Tikaserver ${project.version} Apr 4, 2012 7:48:50 AM org.apache.cxf.endpoint.ServerImpl initDestination @@ -21, +20 @@ 2012-04-04 07:48:50.399:INFO:oejsh.ContextHandler:started o.e.j.s.h.ContextHandler{,null} Apr 4, 2012 7:48:50 AM org.apache.tika.server.TikaServerCli main INFO: Started - }}} + }}} - - Which lets you know that it started correctly. Below is some basic documentation on how to + Which lets you know that it started correctly. Below is some basic documentation on how to interact with the services using cURL and HTTP. - interact with the services using cURL and HTTP. = Services = - Add services uses HTTP "PUT" request. Original file must be sent in request body without any additional encoding (do not use multipart/form-data or other containers). You may optionally specify content type in "Content-Type" header. If you do not specify mime type, Tika will use its detectors to guess it. @@ -36, +32 @@ Resources may return following HTTP codes: - * 200 Ok - request completed sucessfully + * 200 Ok - request completed sucessfully - * 204 No content - request completed sucessfully, result is empty + * 204 No content - request completed sucessfully, result is empty - * 422 Unprocessable Entity - Unsupported mime-type, encrypted document & etc + * 422 Unprocessable Entity - Unsupported mime-type, encrypted document & etc - * 500 Error - Error while processing document + * 500 Error - Error while processing document == Metadata Resource == - {{{ /meta }}} @@ -54, +49 @@ $ curl -X PUT -d @zipcode.csv http://localhost:9998/meta --header "Content-Type: text/csv" $ curl -T price.xls http://localhost:9998/meta }}} - Returns: {{{ "Content-Encoding","ISO-8859-2" "Content-Type","text/plain" }}} - == Tika Resource == - {{{ /tika }}} @@ -72, +64 @@ Some Example calls with cURL: === Get HELLO message back === - {{{ $ curl -X GET http://localhost:9998/tika - This is Tika Server. Please PUT + This is Tika Server. Please PUT }}} - === Get the Text of a Document === - {{{ $ curl -X PUT -d @GeoSPARQL.pdf http://localhost:9998/tika --header "Content-type: application/pdf" $ curl -T price.xls http://localhost:9998/tika }}} - == Unpacker Resource == - {{{ /unpacker }}} @@ -97, +84 @@ Some Example calls with cURL: === PUT zip file and get back met file zip === - {{{ $ curl -X PUT -d @foo.zip http://localhost:9998/unpacker --header "Content-type: application/zip" }}} - === PUT doc file and get back met file tar === - {{{ $ curl -T Doc1_ole.doc -H "Accept: application/x-tar" http://localhost:9998/unpacker > /var/tmp/x.tar }}} - == "All" resoure == - Get text, metadata and attachements in one request. {{{ $ curl -T Doc1_ole.doc http://localhost:9998/all > /var/tmp/x.zip }}} - Text is stored in "__TEXT__" file, metadata cvs in "__METADATA__". Use "accept" header if you want TAR output.
