I've modified the parser to log long running records and ran your segment. There are quite a few records that run for more than a second on one machine with 2x2.4GHz CPU. It, unfortunately doesn't show me a record it's waiting for.
I ouput a record prior to parsing and after parsing with elasped ms but it's stalling somewhere. It should stall with a `Parsing: ` entry, not a `Parsed` one. Parsing: http://www.target.com/p/somerset-5-drawer-chest-coffee/-/A-12121682 Parsed (34ms):http://www.target.com/p/somerset-5-drawer-chest-coffee/-/A-12121682 Parsing: http://www.target.com/p/tennessee-volunteers-college-party-pack-for-16-guests/-/A-14087806 Parsed (29ms):http://www.target.com/p/tennessee-volunteers-college-party-pack-for-16-guests/-/A-14087806 Parsing: http://www.target.com/p/the-board-dudes-magnetic-dry-erase-board-14-x14/-/A-13617619 Parsed (29ms):http://www.target.com/p/the-board-dudes-magnetic-dry-erase-board-14-x14/-/A-13617619 Parsing: http://www.target.com/p/the-laws-of-charisma-hardcover/-/A-12846523 Parsed (32ms):http://www.target.com/p/the-laws-of-charisma-hardcover/-/A-12846523 ..STALLS This is with a default Nutch checkout but can be a problem of me running it local although it shouldn't. -----Original message----- > From:sidbatra <[email protected]> > Sent: Mon 02-Jul-2012 23:14 > To: [email protected] > Subject: RE: ParseSegment taking a long time to finish > > Thanks a lot Markus. I'll make these changes, re-run and share the result. > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/ParseSegment-taking-a-long-time-to-finish-tp3758053p3992610.html > Sent from the Nutch - User mailing list archive at Nabble.com. >

