Agree with Tim!

 

Thanks for using Tika, really appreciate it Austin. If it’s not too much trouble would love a link/citation back to Tika in your work.

 

Thanks,

Chris

 

 

 

 

 

On 4/8/25, 11:57AM, "Tim Allison" <talli...@apache.org> wrote:

I'm not a lawyer, but sounds reasonable to me. :D

 

I can't think of anything in our Apache Software License 2.0 that would cause problems, and you're not infringing on trademark, as far as I can tell.

 

Thank you for reaching out, let us know if you find any problems with our data, and enjoy!

 

Cheers,

 

         Tim

 

On 2025/04/08 04:02:47 Austin Ziegler wrote:

> Hi

>

> I'm the maintainer of the Ruby gem mime-types and its associated data

> gem/repo, mime-types/mime-types-data.

>

> While the vast majority of the data is pulled from the IANA media type

> registry, one thing which has always been a bit ad hoc is extensions. This

> is currently enriched from the Apache httpd MIME list, but I have been

> considering extending the data with the `tika-mimetypes.xml` from tika-core.

>

> I have implemented a parser to integrate these at

> it I wanted to raise the question as to whether this would be

> considered fair use by the Tika project. The changes essentially:

>

> 1. Parse the current `tika-mimetypes.xml` from the main branch of Tika on

> GitHub.

> 2. Skip over any `mime-type` record that has attributes (MIME::Types is

> about resolving the primary media types and does not support format or

> version attributes).

> 3. Extracts the `glob` entries for use in the `extensions` field. Globs

> that use `*` in the middle of a filename are excluded, because that's now

> how the Ruby MIME::Types field works (I could add a new `glob` field, but

> that will take a bit more work).

> 4. Updates the `extensions` field for any existing MIME::Type or creates

> new unregistered (not defined in IANA) types for new ones.

>

> -a

> --

>

 

Reply via email to