I've modified the parser to log long running records and ran your segment. 
There are quite a few records that run for more than a second on one machine 
with 2x2.4GHz CPU. It, unfortunately doesn't show me a record it's waiting for.

I ouput a record prior to parsing and after parsing with elasped ms but it's 
stalling somewhere. It should stall with a `Parsing: ` entry, not a `Parsed` 
one.

Parsing: http://www.target.com/p/somerset-5-drawer-chest-coffee/-/A-12121682
Parsed 
(34ms):http://www.target.com/p/somerset-5-drawer-chest-coffee/-/A-12121682
Parsing: 
http://www.target.com/p/tennessee-volunteers-college-party-pack-for-16-guests/-/A-14087806
Parsed 
(29ms):http://www.target.com/p/tennessee-volunteers-college-party-pack-for-16-guests/-/A-14087806
Parsing: 
http://www.target.com/p/the-board-dudes-magnetic-dry-erase-board-14-x14/-/A-13617619
Parsed 
(29ms):http://www.target.com/p/the-board-dudes-magnetic-dry-erase-board-14-x14/-/A-13617619
Parsing: http://www.target.com/p/the-laws-of-charisma-hardcover/-/A-12846523
Parsed 
(32ms):http://www.target.com/p/the-laws-of-charisma-hardcover/-/A-12846523
..STALLS

This is with a default Nutch checkout but can be a problem of me running it 
local although it shouldn't.

-----Original message-----
> From:sidbatra <[email protected]>
> Sent: Mon 02-Jul-2012 23:14
> To: [email protected]
> Subject: RE: ParseSegment taking a long time to finish
> 
> Thanks a lot Markus. I'll make these changes, re-run and share the result.
> 
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/ParseSegment-taking-a-long-time-to-finish-tp3758053p3992610.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
> 

Reply via email to