https://bugzilla.wikimedia.org/show_bug.cgi?id=45178
Diederik van Liere <[email protected]> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|UNCONFIRMED |ASSIGNED Ever confirmed|0 |1 --- Comment #1 from Diederik van Liere <[email protected]> --- Hey Andrew, Thanks for reaching out! Yes you are right, there are a couple of 1000's titles that have spaces in the titles and this indeed happened after the tab introduction but in an unexpected way. Prior to the tab introduction, the title of the page would be truncated (because we used space as a delimiter) and so incorrect / incomplete titles would show up in the dumps data. Now, with the introduction of the space we really surfaced this bug. The space is introduced because under very rare conditions, the Nginx server does not encode the space as %20; so far I have only see this happening if the request comes from Googlebot, and the server response is 301 (Moved Permanently). We tried to replicate the conditions so we could fix our Nginx server configuration but we have not yet been able to do so. We could add a function in webstatscollector (the software that generates the data) to replace those spaces with %20 but I am worried that this will introduce performance regressions. My plan is: 1) We will test webstatscollector with a replace function, if this all works, great! problem solved. 2) If the replace function introduces a performance regression then I will mark this bug as WONTFIX. Rest assured, it affects only a really really small set of articles and those views are not real views in the first place as they come from Googlebot. -- You are receiving this mail because: You are on the CC list for the bug. You are the assignee for the bug. You are watching all bug changes. _______________________________________________ Wikibugs-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
