Hi,

 

I'm trying to follow the instructions at the end of
http://wiki.apache.org/tika/TikaJAXRS to use extract a web page from a
remote website.

 

I'd like to see the Tika results for the URL
http://http://www.bbc.co.uk/news <http://http:/www.bbc.co.uk/news>  , but
when I run the following commands, I get the following errors:

 

 

curl -i  -H "fileUrl:http://http://www.bbc.co.uk/news
<http://http:/www.bbc.co.uk/news> "  -H "Accept: text/plain" -X PUT
http://localhost:9998/meta

 

HTTP/1.1 406 Not Acceptable

Content-Length: 0

Date: Sun, 11 Sep 2016 07:40:24 GMT

Server: Jetty(8.y.z-SNAPSHOT)

 

 

curl -i  -H "fileUrl:http://http://www.bbc.co.uk/news
<http://http:/www.bbc.co.uk/news> "  -H "Accept: text/plain" -X PUT
http://localhost:9998/meta

 

HTTP/1.1 415 Unsupported Media Type

Content-Length: 0

Date: Sun, 11 Sep 2016 07:38:43 GMT

Server: Jetty(8.y.z-SNAPSHOT)

 

How do I correctly invoke curl to perform the PUT to the Tika Server to get
a valid response for the remote url ?

 

I'm using:

Apache Tika 1.13 Server

curl 7.40.0 (i386-pc-win32) libcurl/7.40.0 OpenSSL/1.0.0o zlib/1.2.8

on Windows Server 2003 R2 sp2

 

Thanks,

 

John

 

Reply via email to