[
https://issues.apache.org/jira/browse/TIKA-433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12917391#action_12917391
]
Chris A. Mattmann commented on TIKA-433:
----------------------------------------
Thanks Grant, sounds cool!
> Tika + Hadoop
> -------------
>
> Key: TIKA-433
> URL: https://issues.apache.org/jira/browse/TIKA-433
> Project: Tika
> Issue Type: New Feature
> Components: general
> Reporter: Grant Ingersoll
> Priority: Minor
>
> Would be great to have a Tika contrib that took in an HDFS location with
> "rich" documents on it and an output format (or output processor) and
> converted the docs to XHTML or Solr or whatever. Seems like it should be
> pretty straightforward to do on the Hadoop side of things. Only tricky part,
> I suppose, is the output format and how to make that pluggable.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.