On Fri, 2003-11-14 at 20:12, Neal Richter wrote:
>    Ack!  This would imply that the 'purged document' is still returned in
> the search results AFTER you run htpurge!!  True????
> 
>    I am assuming that you did something like this:
> 
> 1) index pages
> 2) htdump -w
> 3) mv db.docs db.docs1
> 4) htpurge
> 5) htdump -w
> 6) mv db.docs db.docs2
> 7) diff db.docs1 db.docs2


 Sorry, my bad. I had to do a fresh index first (I had already purged
the same one earlier today). After the fresh index, I did a dump, purged
a record and diffed the second dump. Here's what I got:

824a825
> 818   u:http://newfind.mcgill.ca/indexes/ads/?AdsID=1025825   t:*** BASSIST WANTED 
> *** \
       a:0     m:1068859617    s:336   H:  anyone out there play bass? we're a 
groove/funk\
/jazz/rock improv band with influences from medeski martin wood and bela fleck to 
phish, \
pink floyd and hendrix... anything and everything in between... improv skills would 
help...\
email [EMAIL PROTECTED] for details...  h:      l:1068859617    L:0     b:2     c:1    
 g:0\
     e:      n: S:       d:1025825       A:
1357a1359
> 2     u:http://newfind.mcgill.ca/indexes/ads/ t:      a:2     m:1068859603    
> s:112334    \
    H:      h:      l:1068859604    L:1403  b:1     c:0     g:0e:       n:      S:     
 d:   \
   A:



After the purge, it doesn't show up any more. Then after that, I tried
to re-index it by doing this:

[EMAIL PROTECTED] bin]# echo 'http://newfind.mcgill.ca/indexes/ads/?AdsID=1025825' | 
./htdig - -s -v \
-m -c /www/htdig/install/conf/ads.conf

ht://dig Start Time: Fri Nov 14 20:36:08 2003

New server: newfind.mcgill.ca, 80
0:11476:0:http://newfind.mcgill.ca/indexes/ads/?AdsID=1025825:  (changed)  size = 336
htdig: Run complete
htdig: 1 server seen:
htdig:     newfind.mcgill.ca:80 1 document

HTTP statistics
===============
 Persistent connections    : Yes
 HEAD call before GET      : Yes
 Connections opened        : 2
 Connections closed        : 1
 Changes of server         : 0
 HTTP Requests             : 3
 HTTP KBytes requested     : 0.442383
 HTTP Average request time : 0 secs
 HTTP Average speed        : inf KBytes/secs

ht://dig End Time: Fri Nov 14 20:36:08 2003

but it still doesn't show up in the search results (even after I changed
my start_url to be 
'http://newfind.mcgill.ca/indexes/ads/?AdsID=1025825').

Cheers,

Chris


-- 
Christopher Murtagh
Enterprise Systems Administrator
ISR / Web Communications Group 
McGill University
Montreal, Quebec
Canada

Tel.: (514) 398-3122
Fax:  (514) 398-2017


-------------------------------------------------------
This SF. Net email is sponsored by: GoToMyPC
GoToMyPC is the fast, easy and secure way to access your computer from
any Web browser or wireless device. Click here to Try it Free!
https://www.gotomypc.com/tr/OSDN/AW/Q4_2003/t/g22lp?Target=mm/g22lp.tmpl
_______________________________________________
ht://Dig Developer mailing list:
[EMAIL PROTECTED]
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to