[ 
https://issues.apache.org/jira/browse/TIKA-411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14026654#comment-14026654
 ] 

Nick Burch commented on TIKA-411:
---------------------------------

I'm not sure I'm a fan of having the tika app output a slightly odd apt format, 
which is only of slight use to generating the site (given how much extra work 
is needed on the text), and no use to anyone else... Happy to see an example 
though!

We're about to have an always-on copy of the Tika Server, I'd probably rather 
point people there to get an auto-generated list of what parsers, detectors and 
types we have, or point them to grab the tika app and ask it. I'd see the 
website version as being a friendly, human written and grouped intro, with the 
server and app providing up-to-the-minute details as required

> Generate list of supported and detected types automatically
> -----------------------------------------------------------
>
>                 Key: TIKA-411
>                 URL: https://issues.apache.org/jira/browse/TIKA-411
>             Project: Tika
>          Issue Type: Improvement
>          Components: documentation
>            Reporter: Jukka Zitting
>            Priority: Minor
>         Attachments: TIKA-411.patch
>
>
> Currently we edit the list of supported types 
> (http://lucene.apache.org/tika/0.7/formats.html) manually, which is bound to 
> leave the list outdated and incomplete. It would be better if the list was 
> automatically generated from the tika-mimetypes.xml file and the 
> getSupportedTypes() response of the AutoDetectParser class.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to