[ 
https://issues.apache.org/jira/browse/TIKA-169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650251#action_12650251
 ] 

Jukka Zitting commented on TIKA-169:
------------------------------------

Another alternative for cross-platform use is the CLI feature:

    # Extracting structured text content from a file
    java -jar tika-0.2-standalone.jar --xml /path/to/file

    # Extracting plain text content from a file
    java -jar tika-0.2-standalone.jar --text /path/to/file

    # Extracting metadata from a file
    java -jar tika-0.2-standalone.jar --metadata /path/to/file

This way you don't need a separate server process and there won't be any 
concerns about unauthorized users getting access to your files.

I'm a bit concerned about any web service that allows the client to retrieve 
the contents of any file on the local file system. Would it make more sense to 
always require the client to upload the files they want parsed?

Also, the file system traversal feature seems a bit outside the scope of Tika, 
though having something like this in a contrib area might be nice.


> Tika Web Service Servlet
> ------------------------
>
>                 Key: TIKA-169
>                 URL: https://issues.apache.org/jira/browse/TIKA-169
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.2
>            Reporter: Rida Benjelloun
>            Priority: Minor
>         Attachments: tikaServlet.war
>
>
> Tika servlet, use file or directory path to build a list of XML documents. 
> The next version will allow file upload.
> Usage :
> //Extract document content and metadata
> http://localhost:8080/tikaServlet/?filePath=C:\test&start=0&rows=10
> //Extract metadata
> http://localhost:8080/tikaServlet/?filePath=C:\test&start=0&rows=10&extract=metadata
> //Extract document content
> http://localhost:8080/tikaServlet/?filePath=C:\test&start=0&rows=10&extract=content

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to