We have a website that is hosted on multiple machines (load balanced). On the hour we rotate the access_logs and pull them down to a backend machine where we do some simple stats on them ("grep foo | wc-l" type stuff). The logs are stored in a tree:
<machine_name>/<year>/<month>/<day>/log_1am
<machine_name>/<year>/<month>/<day>/log_2am
<machine_name>/<year>/<month>/<day>/log_3am
...
We've been manually running analog/rmagic from time to time, but now I am interested in having Analog run on an hourly or daily basis.
I have looked at the various helper apps/scripts for calling analog via cron, but from what I can see many of them are designed for single-log-multi-vhost type setups rather than ours. I've tried figuring out how to nicely map these scripts onto our scenario, but I have a feeling that I'm missing something. None of these configuration seem to use cache files at all, and I think this is something that we should be making use of (??)
Does anyone have suggestions on an approach that I should be taking? Ideally, we'd like to maintain all historical reports sot hat we can take old log files offline (archive to tape).
Thanks in advance,
greg.fenton
--
greg.fenton ^_at_^ gmail (yes, dot com)
+------------------------------------------------------------------------ | TO UNSUBSCRIBE from this list: | http://lists.meer.net/mailman/listinfo/analog-help | | Analog Documentation: http://analog.cx/docs/Readme.html | List archives: http://www.analog.cx/docs/mailing.html#listarchives | Usenet version: news://news.gmane.org/gmane.comp.web.analog.general +------------------------------------------------------------------------

