https://bugzilla.wikimedia.org/show_bug.cgi?id=45178

Diederik van Liere <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |ASSIGNED
     Ever confirmed|0                           |1

--- Comment #1 from Diederik van Liere <[email protected]> ---
Hey Andrew,

Thanks for reaching out! Yes you are right, there are a couple of 1000's titles
that have spaces in the titles and this indeed happened after the tab
introduction but in an unexpected way.

Prior to the tab introduction, the title of the page would be truncated
(because we used space as a delimiter) and so incorrect / incomplete titles
would show up in the dumps data. Now, with the introduction of the space we
really surfaced this bug. 

The space is introduced because under very rare conditions, the Nginx server
does not encode the space as %20; so far I have only see this happening if the
request comes from Googlebot, and the server response is 301 (Moved
Permanently).

We tried to replicate the conditions so we could fix our Nginx server
configuration but we have not yet been able to do so. We could add a function
in webstatscollector (the software that generates the data) to replace those
spaces with %20 but I am worried that this will introduce performance
regressions. 

My plan is:
1) We will test webstatscollector with a replace function, if this all works,
great! problem solved.
2) If the replace function introduces a performance regression then I will mark
this bug as WONTFIX. 

Rest assured, it affects only a really really small set of articles and those
views are not real views in the first place as they come from Googlebot.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
You are watching all bug changes.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to