Before re-injecting a new set of urls to webdb, I'll wait until all fetch operations (generate + fetch + updatedb) are done. I'm not sure it's necessary or not, but it's cleaner.

One more question: Should I run UpdateSegmentsFromDb to update the segments before any new injection? Does segment updating affect url injection and fetch list generation?
-AJ

Jay Lorenzo wrote:

What about the issue of maintaining some semblance of ACIDity? Don't you have to make sure that the generation of fetchlists and the updates are run synchronously, ie one update or generate at a time?

On 9/13/05, Michael Ji <[EMAIL PROTECTED]> wrote:
I think this scenario will work.

Just a bit worry about the filter performance if the
domain site number is in scale of thundreds of
thousands.

Michael Ji

--- AJ Chen <[EMAIL PROTECTED]> wrote:

Once I create a webDB, can I inject new root urls to
the same webDB
repeatly? After each injection, run as many cycles
of
generate/fetch/updatedb to fetch all web pages from
the new sites. I think
this will allow me to gradually build a
comprehensive vertical site. Any
comment or suggestion?
-AJ



__________________________________
Yahoo! Mail - PC Magazine Editors' Choice 2005
http://mail.yahoo.com



--
AJ (Anjun) Chen, Ph.D.
Canova Bioconsulting Marketing * BD * Software Development
748 Matadero Ave., Palo Alto, CA 94306, USA
Cell 650-283-4091, [EMAIL PROTECTED]
---------------------------------------------------

Reply via email to