Hi,

I've got some OCR'd books in plain text format which are incorrectly marked as 
application/x-elc, probably because of the junk bytes in the head and 
sometimes tail. I've also seen some being marked as shockwave files. The 
additional problem is that the file program also marks these files as Lisp 
data.

My question is, how do you handle files that are given an incorrect mime-type? 
Are there some best practices?

Here are a few example files:

http://ia600400.us.archive.org/34/items/papersfromtortug183121922carn/papersfromtortug183121922carn_djvu.txt
http://ia700108.us.archive.org/14/items/reportofcommissi1881unit/reportofcommissi1881unit_djvu.txt
http://ia600300.us.archive.org/12/items/stuttgarterbeitr2747197779staa/stuttgarterbeitr2747197779staa_djvu.txt
http://ia600100.us.archive.org/9/items/nicolaijosephija01jacq/nicolaijosephija01jacq_djvu.txt

Cheers,
-- 
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350

Reply via email to