Hi, I've got some OCR'd books in plain text format which are incorrectly marked as application/x-elc, probably because of the junk bytes in the head and sometimes tail. I've also seen some being marked as shockwave files. The additional problem is that the file program also marks these files as Lisp data.
My question is, how do you handle files that are given an incorrect mime-type? Are there some best practices? Here are a few example files: http://ia600400.us.archive.org/34/items/papersfromtortug183121922carn/papersfromtortug183121922carn_djvu.txt http://ia700108.us.archive.org/14/items/reportofcommissi1881unit/reportofcommissi1881unit_djvu.txt http://ia600300.us.archive.org/12/items/stuttgarterbeitr2747197779staa/stuttgarterbeitr2747197779staa_djvu.txt http://ia600100.us.archive.org/9/items/nicolaijosephija01jacq/nicolaijosephija01jacq_djvu.txt Cheers, -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350
