Wahib Nackad's bits of Sat, 9 Mar 2002 translated to: >Yes, Im really tired. Can someone try to index the www.workers.org and let >me know how much result of indexed page he receive. This will surelly help >me to know if the problem comes from the web server or htdig. > ... >>more >> >than 503 pages even if we have 5500 pages. The config is as follow: >> >start_url: http://www.workers.org/ >> >limit_urls_to: ${start_url}
Do you have any links from the main pages to your archives (e.g. /ww/2000/ /ww/2001/)? ht://Dig locates documents by following links, so unless there is some way to traverse links from www.workers.org/ to the files in these directories, the files will not be indexed. I suspect that this is your problem. You might want try adding some of the other directories to your start_url and see if that makes a difference. For example start_url: http://www.workers.org/ http://www.workers.org/ww/2000/ Jim _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

