Hi, Yes, definitely makes sense to share this. I was not aware of these existing files, so I threw together a simple application which does a similar thing, I'll try to do a first run of a POI-mass-test based on that and then switch to your list of mime-types to re-use the work that you already do.
On sub-sets of files: yes, might make sense in the long run, I'll take a look at the information first to see how we would go about it. Thanks... Dominik. On Fri, Jun 24, 2016 at 1:09 PM, Allison, Timothy B. <[email protected]> wrote: > Hi Dominik, > As you mentioned, it is a pain for each of us to run mime-detection on > the files in our corpus to select those we're interested in. > This is somewhat out of date, but should be reasonable for now: > > http://162.242.228.174/mimes/mime_comparisons.html > > I'll dump mimes into a tab delimited file ( path\tmime) today and post > that here: http://162.242.228.174/metadata/ > I think it would also be useful to do subsets: poi, pdf, > poi+other_office (msaccess, rtf, odt)... What do you think? > > Cheers, > > Tim >
