Hi Lewis, Magnus, here is an example segment that consistently reproduces the slowness issue in parsing - https://dl.dropbox.com/u/4027616/segment.tar.gz
I'll appreciate if you guys have any insights. I've documented more details here - http://lucene.472066.n3.nabble.com/Nutch-Parse-Step-Bafflingly-Slow-in-Reduce-Step-with-example-td3988820.html thanks, Sid -- View this message in context: http://lucene.472066.n3.nabble.com/ParseSegment-taking-a-long-time-to-finish-tp3758053p3989072.html Sent from the Nutch - User mailing list archive at Nabble.com.

