According to Norbert Hartl:
>Hi there,
>
>i discovered a strange problem with $(MODIFIED).
>I am indexing our site via htdig. We are storing our files in an
>archive with schema of /year/month/day/. While I am not willing
>to let htdig decide what URLs are new I am solving this via 
>start_urls and limit_url.
>I am doing this via 
>find /page/archive/year/month/day -name "*.html" > /htdig/tmp/start_urls
>After this I am copying start_urls to limit_url which is working
>great. To prevent any logging to the Weblogs I do also a
>local_urls:  http:/host/=/doc/root
>
>I made this setup on our internal host and after it was working
>on the Web Server.  The difference between the two hosts is that
>on the internal host there is always the modified information
>while on the Web Server there is only sometimes the modified
>value filled out. I would say on the Webserver there are only
>5 percent of the URLs shown with the modified value.
>
>Any suggestions?

This looks more like a server problem, not a problem with $(MODIFIED).
$(MODIFIED) is based upon the last modification date of a document.
If you store your documents in a YYYY/MM/DD directory scheme, you should
also make sure that they have the correct file modification time.
If you have the possibility of using some server-side programming, you
could force the modification time of a document in this directory tree
to always match the storing scheme.  Try telnet'ing to your server and
request some of the documents in question.  Check what your server returns
for the modification date of that document.


hth,
  Torsten

--
InWise - Wirtschaftlich-Wissenschaftlicher Internet Service GmbH
Waldhofstra�e 14                            Tel: +49-4101-403605
D-25474 Ellerbek                            Fax: +49-4101-403606
E-Mail: [EMAIL PROTECTED]            Internet: http://www.inwise.de

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to