[Bug 27807] Restore missing CheckUser logs

2014-01-22 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=27807

Sam Reed (reedy)  changed:

   What|Removed |Added

 CC||s...@reedyboy.net

--- Comment #11 from Sam Reed (reedy)  ---
So, Legoktm and I were just looking at it. There's a few broken entries that
can be easily fixed with common sense (newlines in the middle and such).

The date regex is fair to naive to cater for all the localised date formats.

$rxTimestamp = '(?P\d+:\d+, \d+ \w+ \d+)';

We tried using '(?P.*?)'. It's a bit better, but with the optional
comma after, but then causes issues with dates with early commas

[bad timestamp] ۲۱:۲۰, ۲۰ اکتبر ۲۰۰۶ Jon Harald Søby got edits
XXX.XXX.XXX.XXX on fawiki

And others such as 2006-10-25T20:29:01

$regexes = array(
'ipedits-xff' => "!^$rxTimestamp,? $rxUser got edits for XFF
$rxTarget on $rxWiki$rxReason!",
'ipedits' => "!^$rxTimestamp,? $rxUser got edits for" ."
$rxTarget on $rxWiki$rxReason!",
'ipusers-xff' => "!^$rxTimestamp,? $rxUser got users for XFF
$rxTarget on $rxWiki$rxReason!",
'ipusers' => "!^$rxTimestamp,? $rxUser got users for" ."
$rxTarget on $rxWiki$rxReason!",
'userips' => "!^$rxTimestamp,? $rxUser got IPs for".   "
$rxTarget on $rxWiki$rxReason!"
);

The first comma seems to be optional between some formats, so was easily
improved on.

The code is also using strtotime(), which isn't so good for these localised
formats "Parse about any English textual datetime description into a Unix
timestamp" - http://us1.php.net/strtotime


I'm guessing that the timestamp is in whatever format the person who did the
action has set in their preferences. Awesome, no?

There seems to be 10-20% of rows that won't be processed without at least some
manipulation of the code as it currently is

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 27807] Restore missing CheckUser logs

2014-01-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=27807

--- Comment #10 from Tim Starling  ---
The log was in /home/wikipedia/logs. That directory was repurposed for MW UDP
logs with automatic rotation, it's possible that the files were lost by the
automatic rotation script at around that time. I couldn't find any backup on
the server. However, I happen to have the relevant files on my hard drive, for
June 2005 to May 2007. Note that that range overlaps with the range that is
said to be in the database already, so duplicates will have to be removed
somehow.

I copied them up to /home/wikipedia/logs/norotate/checkuser

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 27807] Restore missing CheckUser logs

2013-12-20 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=27807

Trijnstel  changed:

   What|Removed |Added

 CC||tstarl...@wikimedia.org

--- Comment #9 from Trijnstel  ---
CCd Tim Starling, see
https://en.wikipedia.org/w/index.php?title=User_talk:Dominic&oldid=587005235#Old_CU_logs

Tim, do you remember this? Do you know how to obtain the missing logs?

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 27807] Restore missing CheckUser logs

2013-12-05 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=27807

Kunal Mehta (Legoktm)  changed:

   What|Removed |Added

   Keywords||shell

--- Comment #8 from Kunal Mehta (Legoktm)  ---
(In reply to comment #7)
> You can find some information on these old bugs:
> * https://bugzilla.wikimedia.org/show_bug.cgi?id=8710
> * https://bugzilla.wikimedia.org/show_bug.cgi?id=13789

Thanks. Turns out the code has already been written:
https://github.com/wikimedia/mediawiki-extensions-CheckUser/blob/master/importLog.php

A shell user will need to get the log file, and then run the import script.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 27807] Restore missing CheckUser logs

2013-12-05 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=27807

--- Comment #7 from Trijnstel  ---
(In reply to comment #6)
> I don't know the history behind the CU extension, so how did the global log
> work? Where should we restore the log entries to?

You can find some information on these old bugs:
* https://bugzilla.wikimedia.org/show_bug.cgi?id=8710
* https://bugzilla.wikimedia.org/show_bug.cgi?id=13789

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 27807] Restore missing CheckUser logs

2013-12-05 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=27807

Kunal Mehta (Legoktm)  changed:

   What|Removed |Added

   Priority|Normal  |Low
 CC||legoktm.wikipe...@gmail.com

--- Comment #6 from Kunal Mehta (Legoktm)  ---
(In reply to comment #3)
> I have a .log file (i.e., plain text in a text editor) with hundreds of lines
> like:
> 
> 23:35, June 13, 2006 Dmcdevit got IPs for Dmcdevit on enwiki
> 
> The main complication may be that this is from the days of the single global
> log, so there are also entries like:
> 
> 20:46, 1 lip 2005 Taw got IPs for [user]
> 
> Or it may be that however the local logs were originally created can also be
> applied to these log entries.

Do all the log entries state which wiki the check was run on? Your first
example does and the second doesn't.

I don't know the history behind the CU extension, so how did the global log
work? Where should we restore the log entries to?

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 27807] Restore missing CheckUser logs

2013-12-05 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=27807

Trijnstel  changed:

   What|Removed |Added

 Blocks||41492

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 27807] Restore missing CheckUser logs

2013-12-05 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=27807

Trijnstel  changed:

   What|Removed |Added

 CC||trijns...@hotmail.com

--- Comment #5 from Trijnstel  ---
(In reply to comment #4)
> Has this been completed or does it still need to be completed?

It still needs to be done. I always wondered why these logs were missing.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 27807] Restore missing CheckUser logs

2012-05-01 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=27807

Thehelpfulone  changed:

   What|Removed |Added

 CC||thehelpfulonew...@gmail.com

--- Comment #4 from Thehelpfulone  2012-05-01 
13:46:37 UTC ---
Has this been completed or does it still need to be completed?

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 27807] Restore missing CheckUser logs

2011-03-01 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=27807

--- Comment #3 from Dmcdevit  2011-03-01 20:27:27 UTC ---
I have a .log file (i.e., plain text in a text editor) with hundreds of lines
like:

23:35, June 13, 2006 Dmcdevit got IPs for Dmcdevit on enwiki

The main complication may be that this is from the days of the single global
log, so there are also entries like:

20:46, 1 lip 2005 Taw got IPs for [user]

Or it may be that however the local logs were originally created can also be
applied to these log entries.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 27807] Restore missing CheckUser logs

2011-03-01 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=27807

--- Comment #2 from Reedy  2011-03-01 19:20:02 UTC ---
What's the format of the file?

I'm guessing it's gonna be very simple, string split comma's, and then just do
a database insert

Depending of course on how Tim generated that before

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 27807] Restore missing CheckUser logs

2011-03-01 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=27807

p858snake  changed:

   What|Removed |Added

 CC||p858sn...@gmail.com
  Component|CheckUser   |Site requests
Version|any |unspecified
 AssignedTo|jschulz_4...@msn.com|wikibugs-l@lists.wikimedia.
   ||org
Product|MediaWiki extensions|Wikimedia

--- Comment #1 from p858snake  2011-03-01 12:34:26 UTC ---
Not shelling yet, probably needs a maintaince script or something written
first.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l