The run_status_index is only a file with the output on index process (index -r).
The output of "index -S:
ASPSeek database URL statistics
Status Expired Total
-----------------------------
0 2534940 2534940 Not indexed yet
1 0 13 Unknown status
200 2090805 5295002 OK
204 2 2 No content
299 5 5 Unknown status
300 63 104 Multiple Choices
301 5702 26726 Moved Permanently
302 16640 51540 Moved Temporarily
303 2 2 See Other
400 29 83 Bad Request
401 3407 19703 Unauthorized
403 15782 29490 Forbidden
404 133706 376890 Not found
406 1 2 Not Acceptable
407 2 4 Proxy Authentication Required
500 863 2313 Internal Server Error
502 33 254 Bad Gateway
503 85 110 Service Unavailable
504 1 3 Gateway Timeout
-----------------------------
Total 4802068 8337186
"Alexander F. Avdonkin" ha scritto:
> What is the command run_status_index ?
> Could you give me exact output of "index -S" ?
> Number in the "total" file is usually less than generated by "index -S"
> because "total" contains number of not empty URLs only.
>
> Alexander.
>
> ----- Original Message -----
> From: "Massimo Miccoli" <[EMAIL PROTECTED]>
> To: <[EMAIL PROTECTED]>
> Sent: Friday, March 16, 2001 2:07 AM
> Subject: Re: [aseek-users] Aspseek limit
>
> > Hi,
> >
> > I'm sorry,
> > I send you more data about my problem.
> > I've changed the period in the aspseek.conf from prevuis 25d to 7d. Is a
> problem?
> > Ended thread: 10. Start: 0.000. End: 0.000-984668298.634.
> > Duration: 0.000. URL: http://www.nouvellesfrontieres.it
> > /robots.txt
> > Ended thread: 12. Start: 0.000. End: 0.000-984668298.635.
> > Duration: 0.000. URL: http://www.comunie.messina.it/rob
> > ots.txt
> > Saving real-time database ... done.
> > Saving delta files [..................................................]
> done.
> > Loading ranks [..................................................]
> done.
> > Saving citation [..................................................]
> done.
> > Calculating ranks [..................................................]
> done.
> > In: 83185017. Out: 83185017. Rank: 3416857.449604
> > Urls: 8306827. Hrefs: 83185017
> > index process finished.
> >
> > Massimo Miccoli ha scritto:
> >
> > > Hi,
> > > I'm sorry but the switch not work.
> > > The result is:
> > > Ended thread: 14. Start: 0.000. End:
> 0.000-984668298.649.
> > > Duration: 0.000. URL: http://adecco.it/robots.txt
> > > Ended thread: 15. Start: 0.000. End:
> 0.000-984668298.649.
> > > Duration: 0.000. URL: http://www.javasoft-mirror.java.t
> > > Saving real-time database ... done.
> > > Saving delta files [..................................................]
> done.
> > >
> > > The command i've used is:
> > > /usr/local/aspseek/sbin/index -N 16 -s 0 -f
> /usr/local/aspseek/etc/url_index -r
> > > /usr/local/aspseek/etc/logs/run_status_index -R 8
> > >
> > > In the ursl file are the urls with I started the first indexing.
> > > The result of index -S
> > > 8.300.345 urls
> > > In total file: 5.200.231
> > >
> > > Thank
> > >
> > > Massimo
> > >
> > > "Alexander F. Avdonkin" ha scritto:
> > >
> > > > You can use command line switch "-s 0", that is index only documents
> which
> > > > have not been indexed yet.
> > > >
> > > > Alexander.
> > > >
> > > > ----- Original Message -----
> > > > From: "Massimo Miccoli" <[EMAIL PROTECTED]>
> > > > To: <[EMAIL PROTECTED]>
> > > > Sent: Wednesday, March 14, 2001 11:03 PM
> > > > Subject: Re: [aseek-users] Aspseek limit
> > > >
> > > > > Hi,
> > > > > The qestion is...
> > > > > How can I index the rest of the urls the are in the statistics
> result?
> > > > > The command I've used:
> > > > > /usr/local/aspseek/sbin/index -N 16 -f
> > > > usr/local/aspseek/etc/url_index -r
> > > > > /usr/local/aspseek/etc/logs/run_status_index -R 8 &
> > > > >
> > > > > My box have Linux kernel 2.4.2 and work fine.
> > > > >
> > > > > Thank
> > > > >
> > > > > Massimo
> > > > >
> > > > > "Alexander F. Avdonkin" ha scritto:
> > > > >
> > > > > > No, 5M URLs is approximate limit. With this number of URLs,
> ASPseek
> > > > requires
> > > > > > about 700M of RAM to calculate ranks of pages.
> > > > > > If number of URLs will grow, then swapping will occur during ranks
> > > > > > calculation.
> > > > > >
> > > > > > Alexander.
> > > > > >
> > > > > > ----- Original Message -----
> > > > > > From: "Massimo Miccoli" <[EMAIL PROTECTED]>
> > > > > > To: "aspseeklist" <[EMAIL PROTECTED]>
> > > > > > Sent: Wednesday, March 14, 2001 3:12 AM
> > > > > > Subject: [aseek-users] Aspseek limit
> > > > > >
> > > > > > > 5.000.000 of urls is an hard limit for Aspessek?
> > > > > > > How may page can I index on a Linux box dual Pentium III 900 and
> one
> > > > GB
> > > > > > > Ram and 132 GB disk?
> > > > > > > I've see in index statistics (index -S) that indexed page is
> 5.209.600
> > > > > > > and not index 8.300.334.
> > > > > > > So, i re-run the index again (index -N 16 -f /urlfile -R 8) and
> at the
> > > > > > > end the page indexed is the same.
> > > > > > > The first time I've run index I never stoped it, the work is
> finish
> > > > > > > normal at the end urls list and the urls discovered.
> > > > > > >
> > > > > > > Thank for response,
> > > > > > >
> > > > > > > Massimo
> > > > > > >
> > > > >
> >