Have been playing around with integrating Tika into my PHP app.

I have had great success with Tika on the command line and also SolrCell.

However, I was wondering if there is some way of running Tika in server mode and extracting a document, say, via CURL.

I have had varying degrees of success with:

nc localhost 30000 < /opt/lampp/htdocs/joomla25/tmp/InformationRepository.pdf

but I'm wondering how I pass other params such as for extracting just metadata or content in html format.

Any help would be much appreciated.

Cheers


Hayden

Reply via email to