On Wed, 12 Sep 2001, Geoff Hutchison wrote:
> At 12:25 AM -0700 9/12/01, Rhonda Hyslop wrote:
> >started going too far. In addition to requesting www.google.com, it also
>
> I'm a little suspicious about the "gle" portion of google.com which
> certainly resembles the glue you mention. Are you sure of both the
> limit_urls_to and the URL? (i.e. it's not www.googlue.com or
> somesuch?) Or do you have a ".goo" pattern?
I'm quite certain about the patterns; in fact, I copy/pasted them to make
sure I had them typed exactly the same.
> Keep in mind that you can get more concrete input from htdig when
> indexing by adding verbose flags, e.g. "htdig -vvv". In particular,
> these will give reasons for rejecting URLs.
3*v is the level it starts giving reasons? I'll try that next time around
then :)
Anyhow, I told htdig that .com wasn't allowed, and it doesn't seem to have
hit google this time around.
Thanks,
-Rhonda
--
That's the problem with world domination... Nobody is willing to
wait for it anymore, work slowly towards it, drink more and enjoy
the ride more
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html