According to Rodger Zeisler:
> I indexed a 1400 page site.  Each of the pages was only about 6-8K.  The
> resulting database looks like this:
> 
> -rw-rw-r--  1 infoserv  vuser   4023296 Dec 18 14:15 db.docdb
> -rw-rw-r--  1 infoserv  vuser    271360 Dec 18 14:15 db.docs.index
> -rw-rw-r--  1 infoserv  vuser   3435360 Dec 18 14:15 db.urls
> -rw-rw-r--  1 infoserv  vuser   4041919 Dec 18 14:15 db.wordlist
> -rw-rw-r--  1 infoserv  vuser   4072448 Dec 18 14:15 db.words.db
> 
> When I run htsearch with a small result set (400-500 I'm guessing), it all
> works fine.  When I pick a really common word, that would result in close to
> all the pages being retrieved (like the name of the company, which exists in
> the footer of each page), htsearch dies with an internal server error.  In
> the error_log it says:
> 
> Virtual memory exceeded in `new'
> [Fri Dec 18 15:52:56 1998] [error] Premature end of script headers:
> /usr/local/etc/httpd/cgi-bin/htsearch
> 
> Any suggestions.  If your answer is 'you need more virtual memory', how is
> this accomplished.  I am running on a VServers.com virtual server account.

There is a memory leak in htsearch, introduced in 3.1.0b3.  If that's what
you're running, give this patch a try and let us know whether it works for
you:

--- htsearch/Display.cc.membug  Tue Dec 15 10:58:13 1998
+++ htsearch/Display.cc Mon Dec 21 10:11:07 1998
@@ -852,6 +852,7 @@
              links = 1; // It's a hack, but it helps...
            score += config.Double("backlink_factor") 
              * (thisRef->DocBackLinks() / (double)links);
+           delete thisRef;
          }
 
        thisMatch->setIncompleteScore(score);

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.

Reply via email to