Hi,

It seems it still doens't work afterall. I updated all config files and the 
JPEG (and more new as it looks like). But the log still tells me it cannot 
find a suitable parser.

---------------
2010-05-17 15:20:06,636 WARN  parse.ParseUtil - No suitable parser found when 
trying to parse content 
http://www.fcgroningen.nl/uploads/media/hollabovenplaat_01.jpg of type 
image/jpeg
2010-05-17 15:20:06,637 WARN  parse.Parser - Error parsing: 
http://www.fcgroningen.nl/uploads/media/hollabovenplaat_01.jpg: 
org.apache.nutch.parse.ParseException: parser not found for 
contentType=image/jpeg 
url=http://www.fcgroningen.nl/uploads/media/hollabovenplaat_01.jpg
        at org.apache.nutch.parse.ParseUtil.parse(ParseUtil.java:74)
        at org.apache.nutch.parse.ParseSegment.map(ParseSegment.java:85)
        at org.apache.nutch.parse.ParseSegment.map(ParseSegment.java:41)
        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177)
---------------


Cheers,

On Monday 17 May 2010 14:37:54 Markus Jelsma wrote:
> Hi,
> 
> 
> I've got a copy of the nutch-2010-05-11_04-34-41 nightly build because i
>  need Tika to parse JPEG images and that would be in 1.1 as i read
>  somewhere [1].
> 
> ---------------
> 2010-05-17 14:36:13,074 WARN  parse.ParseUtil - No suitable parser found
>  when trying to parse content
> http://www.fcgroningen.nl/uploads/media/hollabovenplaat_01.jpg of type
> image/jpeg
> 2010-05-17 14:36:13,075 WARN  parse.Parser - Error parsing:
> http://www.fcgroningen.nl/uploads/media/hollabovenplaat_01.jpg:
> org.apache.nutch.parse.ParseException: parser not found for
> contentType=image/jpeg
> url=http://www.fcgroningen.nl/uploads/media/hollabovenplaat_01.jpg
> ---------------
> 
> 
> [1]: http://lucene.472066.n3.nabble.com/Adding-jpeg-parser-to-nutch-
> td710135.html
> 
> Cheers,
> 
> Markus Jelsma - Technisch Architect - Buyways BV
> http://www.linkedin.com/in/markus17
> 050-8536620 / 06-50258350
> 

Markus Jelsma - Technisch Architect - Buyways BV
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350

Reply via email to