Hi,
It seems it still doens't work afterall. I updated all config files and the JPEG (and more new as it looks like). But the log still tells me it cannot find a suitable parser. --------------- 2010-05-17 15:20:06,636 WARN parse.ParseUtil - No suitable parser found when trying to parse content http://www.fcgroningen.nl/uploads/media/hollabovenplaat_01.jpg of type image/jpeg 2010-05-17 15:20:06,637 WARN parse.Parser - Error parsing: http://www.fcgroningen.nl/uploads/media/hollabovenplaat_01.jpg: org.apache.nutch.parse.ParseException: parser not found for contentType=image/jpeg url=http://www.fcgroningen.nl/uploads/media/hollabovenplaat_01.jpg at org.apache.nutch.parse.ParseUtil.parse(ParseUtil.java:74) at org.apache.nutch.parse.ParseSegment.map(ParseSegment.java:85) at org.apache.nutch.parse.ParseSegment.map(ParseSegment.java:41) at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50) at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177) --------------- Cheers, On Monday 17 May 2010 14:37:54 Markus Jelsma wrote: > Hi, > > > I've got a copy of the nutch-2010-05-11_04-34-41 nightly build because i > need Tika to parse JPEG images and that would be in 1.1 as i read > somewhere [1]. > > --------------- > 2010-05-17 14:36:13,074 WARN parse.ParseUtil - No suitable parser found > when trying to parse content > http://www.fcgroningen.nl/uploads/media/hollabovenplaat_01.jpg of type > image/jpeg > 2010-05-17 14:36:13,075 WARN parse.Parser - Error parsing: > http://www.fcgroningen.nl/uploads/media/hollabovenplaat_01.jpg: > org.apache.nutch.parse.ParseException: parser not found for > contentType=image/jpeg > url=http://www.fcgroningen.nl/uploads/media/hollabovenplaat_01.jpg > --------------- > > > [1]: http://lucene.472066.n3.nabble.com/Adding-jpeg-parser-to-nutch- > td710135.html > > Cheers, > > Markus Jelsma - Technisch Architect - Buyways BV > http://www.linkedin.com/in/markus17 > 050-8536620 / 06-50258350 > Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

