[PERFORM] Need for speed 3

Ulrich Wisser Thu, 01 Sep 2005 06:25:43 -0700

Hi again,

first I want to say ***THANK YOU*** for everyone who kindly shared theirthoughts on my hardware problems. I really appreciate it. I started tolook for a new server and I am quite sure we'll get a serious hardware"update". As suggested by some people I would like now to look closer atpossible algorithmic improvements.

My application basically imports Apache log files into a Postgresdatabase. Every row in the log file gets imported in one of three (rawdata) tables. My columns are exactly as in the log file. The import isrun approx. every five minutes. We import about two million rows a month.


Between 30 and 50 users are using the reporting at the same time.

Because reporting became so slow, I did create a reporting table. Inthat table data is aggregated by dropping time (date is preserved), ip,referer, user-agent. And although it breaks normalization some data froma master table is copied, so no joins are needed anymore.

After every import the data from the current day is deleted from thereporting table and recalculated from the raw data table.



Is this description understandable? If so

What do you think of this approach? Are there better ways to do it? Isthere some literature you recommend reading?


TIA

Ulrich


---------------------------(end of broadcast)---------------------------
TIP 5: don't forget to increase your free space map settings

[PERFORM] Need for speed 3

Reply via email to