Author: coar
Date: Tue Mar  9 03:26:27 2010
New Revision: 920643

URL: http://svn.apache.org/viewvc?rev=920643&view=rev
Log:
Start documenting

Added:
    labs/pulse/README.txt

Added: labs/pulse/README.txt
URL: http://svn.apache.org/viewvc/labs/pulse/README.txt?rev=920643&view=auto
==============================================================================
--- labs/pulse/README.txt (added)
+++ labs/pulse/README.txt Tue Mar  9 03:26:27 2010
@@ -0,0 +1,62 @@
+The Pulse lab project collects information from the Apache Software
+Foundation's mailing lists, such as the numer of subscribers, who the
+moderators are, and so on.  The information is loaded into a MySQL
+database, and is used to generate reports and graphs for the Web site
+(http://Pulse.Apache.Org/).  In addition, the information can be
+queried via email requests.
+
+The project is broken into several parts:
+
+o Data collection
+-----------------
+This occurs on the Mail.Apache.Org system, under the username
+'apmail'.  The '~apmailbin/collect-listinfo.sh' script collects the
+info and records it in XML, and then sends it to
+'[email protected]' eddress.  Qmail forwards it according to
+'~apmail/.qmail-collect-listinfo'.  Currently that points to
+<[email protected]>.
+
+o Analysis
+----------
+On Pulse.Apache.Org, the eddress <pulse-load> feeds the
+'pulse/analysis/pulse-demail.pl' script, which strips the mail
+information from the XML data and stores the latter in the
+'pulse/data/queue/' directory.
+
+A cron job running as the 'pulsestat' user runs the
+'pulse/analysis/pulse-load.pl' script to load files from the queue
+into the database.  After loading, the files are moved to the
+'pulse/data/processed/' directory.
+
+o Generation
+------------
+'Generation' only applies to the Web-based reports, and is centred on
+the 'pulse/web-reports/' directory.  The key here is the 'mlists.php'
+script, which is run each day after all queued data files have been
+processed.  Also in this directory are the files that should be copied
+to the Web document tree (such as the CSS files).
+
+o Report upload
+Uploading is performed by the 'mlists.php' script mentioned in the
+previous section.  See the cron files for details.
+
+o Email query
+TBS
+
+Files
+=====
+ o SVN Repository:
+    https://svn.apache.org/repos/asf/labs/pulse/
+
+ o Working directory
+   /usr/local/labs/pulse
+       /analysis                Scripts to unpack data and load them
+                                into the DB
+       /collection              Copies of scripts used on Mail.Apache.Org
+       /cron.d                  Cron jobs
+       /data                    
+           /queue               XML files awaiting processing
+           /processed           XML files after loading
+       /email-query             Scripts used to answer email queries
+       /test                    Stuff for testing
+       /web-reports             Scripts for generating statistics for Web



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to