[
https://issues.apache.org/jira/browse/TIKA-411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14026654#comment-14026654
]
Nick Burch commented on TIKA-411:
---------------------------------
I'm not sure I'm a fan of having the tika app output a slightly odd apt format,
which is only of slight use to generating the site (given how much extra work
is needed on the text), and no use to anyone else... Happy to see an example
though!
We're about to have an always-on copy of the Tika Server, I'd probably rather
point people there to get an auto-generated list of what parsers, detectors and
types we have, or point them to grab the tika app and ask it. I'd see the
website version as being a friendly, human written and grouped intro, with the
server and app providing up-to-the-minute details as required
> Generate list of supported and detected types automatically
> -----------------------------------------------------------
>
> Key: TIKA-411
> URL: https://issues.apache.org/jira/browse/TIKA-411
> Project: Tika
> Issue Type: Improvement
> Components: documentation
> Reporter: Jukka Zitting
> Priority: Minor
> Attachments: TIKA-411.patch
>
>
> Currently we edit the list of supported types
> (http://lucene.apache.org/tika/0.7/formats.html) manually, which is bound to
> leave the list outdated and incomplete. It would be better if the list was
> automatically generated from the tika-mimetypes.xml file and the
> getSupportedTypes() response of the AutoDetectParser class.
--
This message was sent by Atlassian JIRA
(v6.2#6252)