Re: my own crawlscript.sh

Dennis Kubes Fri, 05 Dec 2008 08:08:15 -0800


Matthias W. wrote:

Hi,
I've got a textfile with all URLs to index, I don't want to crawl URLs
before indexing.

Just having the urls isn't the same as having an index. You would stillneed to crawl them. You can inject your url list into a clean crawldband fetch only those urls with the inject, generate, fetch commands.Then you can use the index command to index them.

How to do this?

Also I'm creating an index in a temporary folder and on success I want to
overwrite the old index.
How do I check in the shell script, if the crawl- (index-) command was
successful?

You could check size. You could also check it programatically throughlucene.


Dennis

Re: my own crawlscript.sh

Reply via email to