> On Fri, Oct 1, 2010 at 11:49 AM, Jan Høydahl / Cominvent > <[email protected]> wrote: >> * What is the correct mimetype? tika-mimetypes.xml lists >> application/vnd.ms-tnef >> I see references to application/ms-tnef other places, should we support >> both? > > It looks like application/vnd.ms-tnef is the official type [1], but it > would probably be a good idea to add ../ms-tnef as an alias. Can you > file an improvement request for that?
TIKA-523 and http://github.com/jukka/jtnef/issues/issue/1 >> * Could we legally include with Tika a maven target or script which downloads >> 3rd party jars? That would benefit developers (broader distribution) as well >> as >> the Tika community (better file format support). > It would of course be legal to do so (i.e. we wouldn't be going to > jail for that ;-), but Apache policies (see [2], most notably [3]) > puts some limits on what an official Apache release can include. The > reason for those policies is to make it easy to include Apache code > also in commercial products, which I think is a Good Thing (TM). I'm not thinking of linking against the GPL plugin, but helping users find them and require explicit action to download and use them. One such way could be to include a file PLUGINS-README.TXT in which we could list all 3rd party plugins and how to obtain them. I think this is more visible than simply a Wiki entry. We could then quickly expand the numer of file formats supported, and slowly re-implement each of them in Apache clothes. In short, we as deveopers care a lot obout licenses, but end users very often care more about features and are more than happy to use GPL plugins. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com
