If you have only 1 host, then it can take long time to index >100000 URLs.
Possibly problem is that HTTP server gives response with delay.
Try to run "top" during indexing and see how much is the percentage of CPU time for "index", "mysqld" and "httpd" (if httpd is run on the same computer)
 
Alexander.
----- Original Message -----
From: Bob Couch
Sent: Thursday, July 12, 2001 12:53 PM
Subject: Re: [aseek-users] Indexing is taking a long time

Just 1 host and 1 thread, I cleared out the database, removed the var/aspseek directory
and started from scratch tonight. this is how I started it

echo `date` > howlong (save start date and time to a file)
./index -a
echo `date` >> howlong (save end date and time to a file)

When I tried starting it over again it wouldn't do more than 1100 documents. Then I went into
the apache conf and turned Indexes on and now it's running again. I normally shut that off so people
can not list directories.
I also notice that aspseek has a problem trying to index hrefs that have mailto: or news: in then such as
< a href="mailto:[EMAIL PROTECTED]">Help</a>, it calls them unsupported protocol...


DocumentRoot "/usr/local/apache/htdocs"
<Directory "/usr/local/apache/htdocs">
Options Includes Indexes FollowSymLinks
#Options Includes FollowSymLinks
AllowOverride None
Order allow,deny
Allow from all
</Directory>



PS:
Very Nice Program, and thanks for the help.


At 11:40 AM 7/12/2001 +0800, you wrote:
How many threads (-N parameter) do you run ?
How many unique hosts are you indexing ?

Alexander.

----- Original Message -----
From: "Bob Couch" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Thursday, July 12, 2001 1:32 AM
Subject: [aseek-users] Indexing is taking a long time


>
> I'm indexing around 170,000 files on a dual PIII 933mhz with 1 gig of ram
> and nothing else running.
> I started the index 7/10 at 4:30PM EST its now 7/11 and 1:30PM and it's
> still not done. Why does it
> make several passes through the same documents over and over again. Is
> something wrong?
>
> PS:
> I'm running 1.2.4a
>

Reply via email to