See my comments below.

"Alexander F. Avdonkin" �����(�):
> 
> Hello, John.
> My comments are below.
> 
> ----- Original Message -----
> From: "Dr. John Takacs" <[EMAIL PROTECTED]>
> To: <[EMAIL PROTECTED]>
> Sent: Thursday, June 21, 2001 11:47 AM
> Subject: [aseek-users] New feature request: Automatically delete bad URLs
> 
> > Greetings!
> >
> > I'd like to request this as a new feature, but hopefully I'm mistaken and
> it
> > already is a feature.
> >
> > Something that I have been wondering about as my document total gets
> larger
> > is why doesn't aspseek automatically delete or get rid of bad URLs?
> >
> > For example, let's say I index 10,000 URLs but 1,000 of these are bad
> (site
> > doesn't exist).  Well the way I see my aspseek performing on my server,
> > aspseek constantly goes back and tries to index these 1,000 URLs.  This
> > isn't such a big deal now, but what if I had 500,000 URLs and 50,000 were
> > bad?  50,000 attempts to reindex bad URLs every time you run index seems
> > like a waste of resources, bandwidth, time, etc.  Can't aspseek keep track
> > of the fact that it couldn't retrieve pages for the URL after n tries and
> > simply delete the URL from its ToDo list?
> 
>     We can't delete URL completely, because ASPseek will add it again after
> indexing of URL which refers to it.
>     Better solution is to increase period of reindexing of each bad URL
> after each reindexing using algorithm which takes
>     into account some new configuration settings.
> 
>     Alexander.

What Alexander really means, there should be some options like this:

BadUrlPeriodAbsoluteIncrease
BadUrlPeriodPerCentIncrease
BadUrlPeriodMax

so next indexing time for "bad url" will grow and grow with every reindexing
(if the url is still "bad"), so after some time you won't notice it.

If you want this feature, it can be done, but it is a subject of custom
development, because most users feel fine now.

--  [EMAIL PROTECTED]  http://kir.sever.net   ICQ 7551596  --
Bend the facts to fit the conclusion. It's easier that way.
--  |_ | |\| |_| ><   --  |_| |\| | ><   --   | ) |\/|   --

Reply via email to