Hi,

I've been using ht://dig for a while now, and it's been quite good to
me, which is why I haven't been saying anything :-)

Anyhow, I recently noticed that my latest update dig (started 1
September) was still running.  Normally, a run takes some 5-6 hours, not
15 days!  I killed the htdig process (I make sure to use alternate
working files for exactly this reason) and figured out why it was still
running, blocked the &[EMAIL PROTECTED] PHP script that was feeding htdig an infinite
selection of links using the exclude_urls list, and tried to re-start.
As you may have guessed by the fact that I'm emailing the list with a
question, it didn't work.

Using the identical command line as used in the cron job (htdig -sai) I
got the error:

htdig: Retriever.cc:79: Retriever::Retriever(RetrieverLog = Retriever_noLog): 
Assertion `l && buffer[l -1] == '\n'' failed.  Aborted

Substituting a 'v' for the 's' showed that it found the first new
server, then ended with the same error message.

Adding a ridiculous number of 'v's (7 total because I didn't remember
the highest debug level) showed that it connected to the server and got
the robots.txt file correctly, then started parsing the default page;
about 19,000 lines worth (according to less) of URLs were pushed into
htdig's list of URLs to fetch - every one of them containing the string
I had just added to the exclude_urls list.

sample line: (I added 'PNphpBB2' to the exclude_urls list.)
2:1:http://community.jedit.oss/index.php?name=PNphpBB2&file=login&sid=8637090206340df675599fd5fb7d21ed
 pushed

All lines are variants on this, differing after the 'file=', and all are
reported as "pushed".

Any clues?  Any more information needed?  My search engine is available
at http://paradox.homeip.net/htdig/ (just in case you needed to know
that for some reason...)

Thanks in advance,
  -Rhonda
-- 
 www.write-on.indy || www.write-on.org   \/  http://history.ubcengineers.ca/
  Discuss the art and craft of writing   /\   UBC Engineers History Project
   That's the problem with world domination. Nobody is willing to wait for 
   it anymore, work slowly towards it, drink more and enjoy the ride more.


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to