Hi,
I'm trying to follow the instructions at the end of http://wiki.apache.org/tika/TikaJAXRS to use extract a web page from a remote website. I'd like to see the Tika results for the URL http://http://www.bbc.co.uk/news <http://http:/www.bbc.co.uk/news> , but when I run the following commands, I get the following errors: curl -i -H "fileUrl:http://http://www.bbc.co.uk/news <http://http:/www.bbc.co.uk/news> " -H "Accept: text/plain" -X PUT http://localhost:9998/meta HTTP/1.1 406 Not Acceptable Content-Length: 0 Date: Sun, 11 Sep 2016 07:40:24 GMT Server: Jetty(8.y.z-SNAPSHOT) curl -i -H "fileUrl:http://http://www.bbc.co.uk/news <http://http:/www.bbc.co.uk/news> " -H "Accept: text/plain" -X PUT http://localhost:9998/meta HTTP/1.1 415 Unsupported Media Type Content-Length: 0 Date: Sun, 11 Sep 2016 07:38:43 GMT Server: Jetty(8.y.z-SNAPSHOT) How do I correctly invoke curl to perform the PUT to the Tika Server to get a valid response for the remote url ? I'm using: Apache Tika 1.13 Server curl 7.40.0 (i386-pc-win32) libcurl/7.40.0 OpenSSL/1.0.0o zlib/1.2.8 on Windows Server 2003 R2 sp2 Thanks, John
