[jira] Created: (TIKA-433) Tika + Hadoop

Grant Ingersoll (JIRA) Tue, 25 May 2010 14:13:54 -0700

Tika + Hadoop
-------------

                 Key: TIKA-433
                 URL: https://issues.apache.org/jira/browse/TIKA-433
             Project: Tika
          Issue Type: New Feature
          Components: general
            Reporter: Grant Ingersoll
            Priority: Minor



Would be great to have a Tika contrib that took in an HDFS location with "rich" 
documents on it and an output format (or output processor) and converted the 
docs to XHTML or Solr or whatever.  Seems like it should be pretty 
straightforward to do on the Hadoop side of things.  Only tricky part, I 
suppose, is the output format and how to make that pluggable.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Created: (TIKA-433) Tika + Hadoop

Reply via email to