Hi,

Yes, definitely makes sense to share this. I was not aware of these
existing files, so I threw together a simple application which does a
similar thing, I'll try to do a first run of a POI-mass-test based on that
and then switch to your list of mime-types to re-use the work that you
already do.

On sub-sets of files: yes, might make sense in the long run, I'll take a
look at the information first to see how we would go about it.

Thanks... Dominik.

On Fri, Jun 24, 2016 at 1:09 PM, Allison, Timothy B. <[email protected]>
wrote:

> Hi Dominik,
>   As you mentioned, it is a pain for each of us to run mime-detection on
> the files in our corpus to select those we're interested in.
>   This is somewhat out of date, but should be reasonable for now:
>
> http://162.242.228.174/mimes/mime_comparisons.html
>
>  I'll dump mimes into a tab delimited file ( path\tmime) today and post
> that here: http://162.242.228.174/metadata/
>   I think it would also be useful to do subsets: poi, pdf,
> poi+other_office (msaccess, rtf, odt)...  What do you think?
>
>      Cheers,
>
>                  Tim
>

Reply via email to