Re: [htdig] problem indexing a site

2001-01-17 Thread Geoff Hutchison

On Wed, 17 Jan 2001, Elsa Chan wrote:

 not indicate any errors. It looks like as follows: 
 1:0:http://www.site.net/
 New server: www.site.net, 80
 It just sits there for a long while.

The first thing you should check is if you can contact this site with
another browser, e.g. lynx, Netscape, etc. The first thing htdig must
do is to retrieve the robots.txt file from the server. So if you cannot
connect to the server using other means, htdig will not be able to either
and you will have to look at networking issues.

That said, it should not just "hang" since there is a timeout set in the
connection code and the 3.1.5 version should be good about killing
connections if they timeout. How long is "a long while?"

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




RE: [htdig] problem indexing a site

2001-01-17 Thread Geoff Hutchison


If you aren't using port 80, you will need to set this in the start_url,
e.g.:

start_url: http://www.foo.com:81/

Cheers,
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

On Wed, 17 Jan 2001, Elsa Chan wrote:

 It just hangs for 10 to 15 minutes.
 
 If port 80 is not what we use, do I go and change this in the robots.txt
 file?
 
 Where is this file?
 
 Thanks



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




RE: [htdig] problem indexing a site

2001-01-17 Thread Elsa Chan

I try that but I still get the same message. 

1:0:http://www.site.com
New Server www.site.com , 80
And it hangs there, I also try putting the url in quotes as well in the
config file. 

Thanks




-Original Message-
From: Geoff Hutchison [EMAIL PROTECTED]
To: Elsa Chan [EMAIL PROTECTED]
CC: [EMAIL PROTECTED] [EMAIL PROTECTED]
Sent: Wed Jan 17 11:42:00 2001
Subject: RE: [htdig] problem indexing a site


If you aren't using port 80, you will need to set this in the start_url,
e.g.:

start_url: http://www.foo.com:81/

Cheers,
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

On Wed, 17 Jan 2001, Elsa Chan wrote:

 It just hangs for 10 to 15 minutes.
 
 If port 80 is not what we use, do I go and change this in the robots.txt
 file?
 
 Where is this file?
 
 Thanks


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




RE: [htdig] problem indexing a site

2001-01-17 Thread Geoff Hutchison

On Wed, 17 Jan 2001, Elsa Chan wrote:

 1:0:http://www.site.com
 New Server www.site.com , 80

I think we need to see your config file--if you did change your
htdig.conf, then you have done it in a manner that htdig does not
recognize.

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] problem indexing a site - no errors but nothing is

1999-09-10 Thread Geoff Hutchison


On Fri, 10 Sep 1999, Jay Tsao wrote:

 sites within our intranet.  I am running with -v output but the output does
 not indicate any errors.  It looks like as follows:
 
 New server: site1.hp.com, 80
 
 New server: site2.hp.com, 80
 0:0:0:http://site2.hp.com/:
 *+*+++--++-+++--+---+-+-- size = 17070

You'll probably see what's going on better with -vvv or -. This will
show the connection status, any HTTP headers, and the results of the
robots.txt file.

-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.



Re: htdig: problem indexing secure site

1998-12-16 Thread Denis Bazinet


 Tried setting the variable local_urls, so
 http://bla.bla.com/=/directory/path/to/DocumentRoot


By secure, I believe you mean you implement SSL.  But shouldn't your site
to index be https:// ?

--
---
Denis Bazinet, Systems Administrator
Online Strategy  Development
Bell Emergis

[EMAIL PROTECTED]
(613) 781-3974


--
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.