https://bugzilla.wikimedia.org/show_bug.cgi?id=59222

       Web browser: ---
            Bug ID: 59222
           Summary: Request to access redacted webproxy logfiles of (Tool)
                    Labs
           Product: Wikimedia Labs
           Version: unspecified
          Hardware: All
                OS: All
            Status: UNCONFIRMED
          Severity: normal
          Priority: Unprioritized
         Component: General
          Assignee: wikibugs-l@lists.wikimedia.org
          Reporter: metat...@online.ms
                CC: abog...@wikimedia.org, benap...@gmail.com,
                    rlan...@gmail.com
    Classification: Unclassified
   Mobile Platform: ---

I want to integrate the pagecounts of Tool Labs resp. Labs into tool
https://tools.wmflabs.org/wikiviewstats/ .For this, it would be necessary to
have access to redacted webproxy logs, which include old web (apache) and new
web (lighttpd) setups.

It would be very helpful, if these logs could be structured in the same way as
the current pagecount-dumps and be released on an per hour basis.

Further suggestions:
- identifier could be toollabs  resp.  labs.toollabs
- querystring part of url (?xyz=..) should be removed completely


Reference:
1.) IRC Petan Jan 2, 2014

2.) WIP: Tools: Add infrastructure for AWStats
https://gerrit.wikimedia.org/r/#/c/80332/

3:) IRC scfc_de Jan 2, 2014
scfc_de: hedonil: I hope to have finished puppetizing tools-webproxy by the end
of the week (the AWStats stuff is done IIRC).  As -webproxy is the heart of the
web access, review & deployment will then be *very* careful :-), but in
general, depending on Coren's schedule, it should be deployable by between the
end of next week and the end of the month.


The current pagecount-dumps are generated on an per hour basis and share the
following structure:

filename eg:
pagecounts-20140101-020000.gz

1.: identifier  2.: pagetitle  3.: hits  4.: bytes

En.d perform 3 60088
En.d rainforest 3 33780
En.d servers 3 22471
En.d situation 1 107043
En.d upwards 1 32565
En.d variety 2 59495
En Allergy 3 324964
En Arthur_Rubinstein 1 0
En Article 1 0
En British_cuisine 1 191021

hierarchical structure of identifier

en    - Wikipedia   (en)
en.b  - Wikibooks   (en)
en.d  - Wikdionary  (en)
en.n  - Wikinews    (en)
etc.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to