Hi again
I ran htdig with -vvv
and here is what I'm getting
-------------------------------
bash-2.04$ /usr/local/htdig-3.2.0b3/bin/htdig -vvv -s -a -c
/usr/local/htdig-3.2.0b3/conf/htdig.conf
ht://dig Start Time: Wed Sep 26 11:13:16 2001
1:1:http://dev-www.internal/search/htdig_show_issue_links.php?issue_id=4136&
start=0
New server: dev-www.internal, 80
- Persistent connections: enabled
- HEAD before GET: disabled
- Timeout: 30
- Connection space: 0
- Max Documents: -1
- TCP retries: 1
- TCP wait time: 5
Trying to retrieve robots.txt file
Making HTTP request on http://dev-www.internal/robots.txt
Request time: 0 secs
pushed
pick: dev-www.internal, # servers = 1
> dev-www.internal supports HTTP persistent connections (infinite)
0:2:0:http://dev-www.internal/search/htdig_show_issue_links.php?issue_id=413
6&start=0: Making HTTP request on
http://dev-www.internal/search/htdig_show_issue_links.php?issue_id=4136&star
t=0
Request time: 1 secs
title: 1
href:
http://dev-www.internal/search/htdig_show_issue_links.php?paper_name=blah&ca
=34&si=377385&issue_id=4136&start=1 (ne)
resolving
'http://dev-www.internal/search/htdig_show_issue_links.php?paper_name=blah&c
a=34&si=377385&issue_id=4136&start=1'
pushing
http://dev-www.internal/search/htdig_show_issue_links.php?paper_name=blah&ca
=34&si=377385&issue_id=4136&start=1
+ size = 231
1:3:1:http://dev-www.internal/search/htdig_show_issue_links.php?paper_name=b
lah&ca=34&si=377385&issue_id=4136&start=1: Making HTTP request on
http://dev-www.internal/search/htdig_show_issue_links.php?paper_name=blah&ca
=34&si=377385&issue_id=4136&start=1
Request time: 0 secs
title: Farmers Rights Must Not be Undermined
href:
http://dev-www.internal/search/htdig_show_issue_links.php?paper_name=blah&ca
=47&si=377387&issue_id=4136&start=2 (next)
resolving
'http://dev-www.internal/search/htdig_show_issue_links.php?paper_name=blah&c
a=47&si=377387&issue_id=4136&start=2'
pushing
http://dev-www.internal/search/htdig_show_issue_links.php?paper_name=blah&ca
=47&si=377387&issue_id=4136&start=2
+ size = 1599
2:4:2:http://dev-www.internal/search/htdig_show_issue_links.php?paper_name=b
lah&ca=47&si=377387&issue_id=4136&start=2: Making HTTP request on
http://dev-www.internal/search/htdig_show_issue_links.php?paper_name=blah&ca
=47&si=377387&issue_id=4136&start=2
Request time: 0 secs
.......
pick: dev-www.internal, # servers = 1
> dev-www.internal supports HTTP persistent connections (infinite)
htdig: Run complete
htdig: 1 server seen:
htdig: dev-www.internal:80 52 documents
HTTP statistics
===============
Persistent connections : Yes
HEAD call before GET : No
Connections opened : 2
Connections closed : 1
Changes of server : 0
HTTP Requests : 53
HTTP KBytes requested : 11.1953
HTTP Average request time : 0.113208 secs
HTTP Average speed : 1.86589 KBytes/secs
ht://dig End Time: Wed Sep 26 11:14:09 2001
--------------------------------
I don't know if this is what I should get or not - there is nothing in there
that mentions the Last Modified header
thanks
Sean
-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED]]On Behalf Of Gilles
Detillieux
Sent: Tuesday 25 September 2001 11:28 PM
To: [EMAIL PROTECTED]
Cc: [EMAIL PROTECTED]
Subject: Re: [htdig] Last Modified Date
According to Sean Downey:
> I'm having a problem in that htdig is not picking up the last modified
date
> in my documents
>
> I'm using htdig to index PHP documents, in the php code I'm setting the
last
> modified date
> The date is working in the documents because i can see it in the page info
> in Netscape
> I believe that the format is correct
>
> Last-Modified: Thu, 30 Dec 1999 00:00:00 GMT
>
> Something I've seen that could be related is:
> HEAD before GET: disabled
>
> which appears in the output form htdig - but I can't find a reference to
> that anywhere in the documentation
http://www.htdig.org/dev/htdig-3.2/attrs.html#head_before_get
But that's extremely unlikely to be the problem here. I suggest looking
though the output of htdig -vvv to see the headers returned by the server
for some of these PHP documents, and how htdig parses the dates in these
Last-Modified headers. If the headers look fine, but htdig isn't parsing
them correctly, please try the latest 3.2.0b4 development snapshot to see
if that helps. If the problem persists with the latest snapshot, please
get back to us with details about the system you're running htdig on (OS,
distribution if applicable, hardware, etc.), as well as the settings in
include/htconfig.h for HAVE_TIMEGM and HAVE_STRPTIME.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW:
http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to
<[EMAIL PROTECTED]> with a subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html