https://bugzilla.wikimedia.org/show_bug.cgi?id=59222
Web browser: ---
Bug ID: 59222
Summary: Request to access redacted webproxy logfiles of (Tool)
Labs
Product: Wikimedia Labs
Version: unspecified
Hardware: All
OS: All
Status: UNCONFIRMED
Severity: normal
Priority: Unprioritized
Component: General
Assignee: [email protected]
Reporter: [email protected]
CC: [email protected], [email protected],
[email protected]
Classification: Unclassified
Mobile Platform: ---
I want to integrate the pagecounts of Tool Labs resp. Labs into tool
https://tools.wmflabs.org/wikiviewstats/ .For this, it would be necessary to
have access to redacted webproxy logs, which include old web (apache) and new
web (lighttpd) setups.
It would be very helpful, if these logs could be structured in the same way as
the current pagecount-dumps and be released on an per hour basis.
Further suggestions:
- identifier could be toollabs resp. labs.toollabs
- querystring part of url (?xyz=..) should be removed completely
Reference:
1.) IRC Petan Jan 2, 2014
2.) WIP: Tools: Add infrastructure for AWStats
https://gerrit.wikimedia.org/r/#/c/80332/
3:) IRC scfc_de Jan 2, 2014
scfc_de: hedonil: I hope to have finished puppetizing tools-webproxy by the end
of the week (the AWStats stuff is done IIRC). As -webproxy is the heart of the
web access, review & deployment will then be *very* careful :-), but in
general, depending on Coren's schedule, it should be deployable by between the
end of next week and the end of the month.
The current pagecount-dumps are generated on an per hour basis and share the
following structure:
filename eg:
pagecounts-20140101-020000.gz
1.: identifier 2.: pagetitle 3.: hits 4.: bytes
En.d perform 3 60088
En.d rainforest 3 33780
En.d servers 3 22471
En.d situation 1 107043
En.d upwards 1 32565
En.d variety 2 59495
En Allergy 3 324964
En Arthur_Rubinstein 1 0
En Article 1 0
En British_cuisine 1 191021
hierarchical structure of identifier
en - Wikipedia (en)
en.b - Wikibooks (en)
en.d - Wikdionary (en)
en.n - Wikinews (en)
etc.
--
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l