I'm using a build from the 1.1 source. John
On Sun, Nov 20, 2011 at 2:31 PM, Nick Burch <[email protected]> wrote: > On Sun, 20 Nov 2011, John M wrote: >> >> I have a .ppt file that I've renamed to be a .doc file (by only changing >> its extension). If I use the Tika GUI, or the command line, to extract the >> file metadata, then Tika correctly identifies the content type as a >> Powerpoint file. However, if I use the command line -d option to detect its >> content type, the application returns "application/msword", which is of >> course only superficially correct. > > What version of Tika are you trying with? If it isn't 1.0, I'd suggest you > upgrade and re-test. (We've made detectors pluggable like parsers fairly > recently, which changed how the container aware detectors were made > available and used) > > Nick >
