Nutch is working great for me except on some text files with no extention in
the filename.

For example, a file called "MOI1993-NYN-350-0244-J_62037" that contains text
gets the following error:
  Error parsing:
file:/home/nsnyder/AL_PROBLEM_FILES/MOI1993-NYN-350-0244-J_62037:
org.apache.nutch.parse.ParseException: parser not found for
contentType=application/octet-stream
url=file:/home/nsnyder/AL_PROBLEM_FILES/MOI1993-NYN-350-0244-J_62037

I can open this file in vi and see the plain text.  What can I do to make
nutch get the content type as text and not 
application/octet-stream.
-- 
View this message in context: 
http://www.nabble.com/Text-file-with-no-extension-gets-content-type-of-octet-stream--tp20369073p20369073.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to