I see. It's because the RegexNormalizer is stripping off sessionid query string parameters. Never mind. Nobody need answer my question that no one was going to answer anyway. :-) Kevin
On Mon, Sep 22, 2008 at 2:38 PM, Kevin MacDonald <[EMAIL PROTECTED]>wrote: > Below is a trimmed down snippet from a segment dump file. Note that Recno:: > 2 shows a redirect from "viewtopic.php" to "login.php". The URL field in > Recno:: 1 should match the Url that was redirected to, but part of the > querystring is missing. The "&sid=e4821a83f97666e52bd197f04d4a0598" portion > got trimmed off. There were no other records in the dump file regarding this > domain. > Recno:: 1 > URL:: > http://blythe.swedenunlimited.com/newforum/login.php?redirect=viewtopic.php&t=88596 > > ParseData:: > Status: success(1,0) > > CrawlDatum:: > Status: 33 (fetch_success) > Metadata: _ngt_:1222117995595 _pst_:success(1), lastModified=0 _repr_: > http://blythe.swedenunlimited.com/newforum/viewtopic.php?t=88596 > > Recno:: 2 > URL:: http://blythe.swedenunlimited.com/newforum/viewtopic.php?t=88596 > > CrawlDatum:: > Status: 35 (fetch_redir_temp) > Metadata: _ngt_:1222117995595 _pst_:temp_moved(13), lastModified=0: > http://blythe.swedenunlimited.com/newforum/login.php?redirect=viewtopic.php&t=88596&sid=e4821a83f97666e52bd197f04d4a0598 > > > > Kevin >
