https://bugzilla.wikimedia.org/show_bug.cgi?id=65217

--- Comment #5 from Bawolff (Brian Wolff) <[email protected]> ---
I was reading over the thread on multimedia - I'm not entirely sure the
Special:Newfiles theory makes sense, I think its more likely someone maybe
viewed a category of the tiff uploads from gwtoolset or something like that.

So we have this graph of april 21, with a peak about 2:55 to 3:20 UTC
http://lists.wikimedia.org/pipermail/multimedia/attachments/20140420/35015082/attachment-0001.png

However when you look at the uploads from around the time, the peak in large
tiff uploads do not correspond with a peak in the graph:

MariaDB [commonswiki_p]> select substring( img_timestamp, 9, 3) "time",
count(*) "# images", round(MAX(img_width*img_height/1000000)) "max Mpx", round(
avg(img_width*img_height/1000000)) "avg mpx", round(avg (img_size/(1024*1024)))
"avg MB", round(sum(img_size/(1024*1024))) "total mb", round( max(
img_size/(1024*1024))) "max mb" from image where img_timestamp >
'20140421010000' and img_timestamp < '20140421050000' and img_minor_mime =
'tiff' and img_user_text = 'Fæ' group by substring( img_timestamp, 1, 11);
+------+----------+---------+---------+--------+----------+--------+
| time | # images | max Mpx | avg mpx | avg MB | total mb | max mb |
+------+----------+---------+---------+--------+----------+--------+
| 010  |       40 |      60 |      42 |    121 |     4822 |    172 |
| 011  |       40 |      39 |      39 |    110 |     4409 |    112 |
| 012  |       19 |      60 |      42 |    120 |     2280 |    172 |
| 013  |       37 |      60 |      60 |    171 |     6328 |    173 |
| 014  |       17 |      60 |      60 |    172 |     2916 |    173 |
| 015  |       20 |      60 |      60 |    171 |     3427 |    173 |
| 020  |       35 |      60 |      60 |    171 |     5986 |    173 |
| 021  |       15 |      60 |      60 |    170 |     2555 |    172 |
| 022  |       26 |      60 |      60 |    172 |     4463 |    173 |
| 023  |       18 |      60 |      60 |    171 |     3079 |    173 |
| 030  |        6 |      60 |      59 |    170 |     1018 |    173 |
| 032  |        5 |      60 |      60 |    171 |      857 |    173 |
| 033  |        2 |      60 |      60 |    172 |      343 |    173 |
+------+----------+---------+---------+--------+----------+--------+
13 rows in set (0.01 sec)

That is between 2:50-3:20 there was a total of 6 tiff files  uploaded by Fae
with gwtoolset (out of 141 total uploads in that time period, 4.2%), compared
to say 1:00-1:30 which didn't have a spike but had 99 tiff files uploaded by
fae (compared to 373 total, 27%). If it was caused by viewing Special:Newfiles,
I would expect the spike would come when the 99 tiffs were uploaded instead of
when the 6 tiffs were uploaded.

Which leads me to suspect the issue was not with people viewing
Special:NewFiles a lot, but maybe viewing something else that had a lot of
uncached thumbnail hits associated. Maybe the category for the batch upload,
which would have up to 200 images on it, probably a lot over the
$wgMaxImageArea so triggering what I mentioned in comment 4 - and the rest
might simply have not been viewed before, was viewed by several someones at the
same time. [[Commons:Category:NYPL maps (over 50 megapixels)]] was linked in
the VP at the time (although it had been for about a day), maybe somebody just
hit reload on that page repetitively for some unknown reason and that
overloaded things. Or something.

With all that said, I guess even if it wasn't Special:Newfiles, it probably
doesn't change much as its still related to on-demand thumbnailing.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to