Hi, I've been using ht://dig for a while now, and it's been quite good to me, which is why I haven't been saying anything :-)
Anyhow, I recently noticed that my latest update dig (started 1 September) was still running. Normally, a run takes some 5-6 hours, not 15 days! I killed the htdig process (I make sure to use alternate working files for exactly this reason) and figured out why it was still running, blocked the &[EMAIL PROTECTED] PHP script that was feeding htdig an infinite selection of links using the exclude_urls list, and tried to re-start. As you may have guessed by the fact that I'm emailing the list with a question, it didn't work. Using the identical command line as used in the cron job (htdig -sai) I got the error: htdig: Retriever.cc:79: Retriever::Retriever(RetrieverLog = Retriever_noLog): Assertion `l && buffer[l -1] == '\n'' failed. Aborted Substituting a 'v' for the 's' showed that it found the first new server, then ended with the same error message. Adding a ridiculous number of 'v's (7 total because I didn't remember the highest debug level) showed that it connected to the server and got the robots.txt file correctly, then started parsing the default page; about 19,000 lines worth (according to less) of URLs were pushed into htdig's list of URLs to fetch - every one of them containing the string I had just added to the exclude_urls list. sample line: (I added 'PNphpBB2' to the exclude_urls list.) 2:1:http://community.jedit.oss/index.php?name=PNphpBB2&file=login&sid=8637090206340df675599fd5fb7d21ed pushed All lines are variants on this, differing after the 'file=', and all are reported as "pushed". Any clues? Any more information needed? My search engine is available at http://paradox.homeip.net/htdig/ (just in case you needed to know that for some reason...) Thanks in advance, -Rhonda -- www.write-on.indy || www.write-on.org \/ http://history.ubcengineers.ca/ Discuss the art and craft of writing /\ UBC Engineers History Project That's the problem with world domination. Nobody is willing to wait for it anymore, work slowly towards it, drink more and enjoy the ride more. ------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Welcome to geek heaven. http://thinkgeek.com/sf _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

