[htdig] Re: Reindex

2001-01-17 Thread Gilles Detillieux

According to Elsa Chan:
 We just launched a new site, but the search engine is indexing pages that
 don't exist anymore. I think I just need to restart htdig except I don't
 know how. I trying search for info on theb htdig web site but I couldnjt
 find anything. Would you be able to help me?

Running the standard "rundig" script will rebuild your database from scratch.
You can also manually run "htdig -i" and "htmerge" to do this.

-- 
Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




[htdig] Re: Reindex

2001-01-17 Thread Gilles Detillieux

According to Elsa Chan:
 I try doing that, but only one file gets updated from htdig.
 
 /usr/local/htdig/db/db.docdb is the only file that gets updated.
 
 db.docs.index is still old and db.wordlist.new is created by it has 0 bytes
 
 When I try to run htmerge it gives me 
 
 htmerge: Unable to open word list file '/usr/local/htdig/db/db.wordlist'

As FAQ 5.16 explains, this happens because htdig didn't index any documents.

 I also try running htdig -vvv, but I get this
 
 1:0:http://www.site.net/
 New server: www.site.net, 80
 
 
 I specify in the config file to used a different port and I put the url in
 quotes but it doesn't seem to work properly
 
 Any ideas?

You can't use quotes in the start_url, because htdig doesn't parse it as
a quoted string list.  See http://www.htdig.org/attrs.html

The port number should be tacked right on to the end of the URL with a
colon, e.g.  start_url: http://www.site.net:8001

As for figuring out why it's hanging, and what constitutes a long while,
please see Geoff's response.

 -Original Message-
 From: Gilles Detillieux [mailto:[EMAIL PROTECTED]]
 Sent: Wednesday, January 17, 2001 10:18 AM
 To: [EMAIL PROTECTED]
 Cc: [EMAIL PROTECTED]
 Subject: Re: Reindex
 
 
 According to Elsa Chan:
  We just launched a new site, but the search engine is indexing pages that
  don't exist anymore. I think I just need to restart htdig except I don't
  know how. I trying search for info on theb htdig web site but I couldnjt
  find anything. Would you be able to help me?
 
 Running the standard "rundig" script will rebuild your database from
 scratch.
 You can also manually run "htdig -i" and "htmerge" to do this.


-- 
Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html