Re: [Wikitech-l] unexpected error info in HTML

2013-08-22 Thread Liangent
On Fri, Aug 23, 2013 at 8:33 AM, Jiang BIAN  wrote:

> Thanks for the link. But I think this is targeting the language variant
> related fix.
>

This is the root cause of that behavior you mentioned. (It only happens /
happened on zhwiki and maybe as well as some wikis with variants, right?)

-Liangent


>
> We actually observed stale cache in a wider range, see the bug entry:
> https://bugzilla.wikimedia.org/show_bug.cgi?id=46014
>
>
> On Thu, Aug 22, 2013 at 5:26 PM, Liangent  wrote:
>
> > On Fri, Aug 23, 2013 at 8:13 AM, Jiang BIAN 
> wrote:
> >
> > > We are actually crawling the HTML via bot, so the bug is not actually
> > fixed
> > > for non-login user, right?
> > >
> >
> > I can't think of a good way to fix the problem from this aspect besides
> > waiting for old cached page to expire, unless some sysadmin is happy to
> > nuke all existing Squid cached pages.
> >
> > However if you have a list of affected pages as you're crawling HTML,
> which
> > we don't have, you can simply purge them in batch and recrawl those
> pages.
> >
> >
> > > Could you share the bug's link?
> > >
> >
> > There was no bug created in bugzilla... I submitted a patch[1] directly
> to
> > fix the bug once it was spotted.
> >
> > [1] https://gerrit.wikimedia.org/r/#/c/76060/
> >
> > -Liangent
> >
> >
> > >
> > > Thanks
> > >
> > >
> > > On Thu, Aug 22, 2013 at 4:38 PM, Liangent  wrote:
> > >
> > > > On Fri, Aug 23, 2013 at 7:06 AM, Sumana Harihareswara <
> > > > suma...@wikimedia.org
> > > > > wrote:
> > > >
> > > > > On 08/01/2013 03:08 AM, Jiang BIAN  wrote:
> > > > > > Hi,
> > > > > >
> > > > > > I noticed some pages we crawled containing error message like
> this;
> > > > > >
> > > > > >  > > > > class="mw-content-ltr"> > > > > > class="error">Failed to render property P373:
> > > > > > Wikibase\LanguageWithConversion::factory: given languages do not
> > have
> > > > the
> > > > > > same parent language
> > > > > >
> > > > > >
> > > > > > But when I open the url in browser, there is no such message. And
> > > using
> > > > > > index.php can also get normal content without error messages.
> > > > > >
> > > > > > Here are examples you can retry:
> > > > > >
> > > > > > bad
> > > > > > $ wget 'http://zh.wikipedia.org/zh-cn/Google'
> > > > > >
> > > > > > good
> > > > > > $ wget 'http://zh.wikipedia.org/w/index.php?title=Google'
> > > > > >
> > > > > >
> > > > > > Looks like something is wrong on Wikipedia side, anything need to
> > > fix?
> > > > > >
> > > > > >
> > > > > >
> > > > > > Thanks
> > > > >
> > > > > I checked with Jiang Bian and found out that this is still
> happening
> > --
> > > > > can anyone help Google out here? :-)
> > > > >
> > > > > --
> > > > > Sumana Harihareswara
> > > > > Engineering Community Manager
> > > > > Wikimedia Foundation
> > > > >
> > > > > ___
> > > > > Wikitech-l mailing list
> > > > > Wikitech-l@lists.wikimedia.org
> > > > > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> > > > >
> > > >
> > > > There was a bug in some Wikibase version deployed in July which
> caused
> > > this
> > > > error, but a fix was backported soon and since then I've never seen
> any
> > > > similar error as a logged in user. If you still see some errors only
> > when
> > > > unlogged in at particular URLs (like what you described) now, it's
> > likely
> > > > that those URLs got cached in Squid when the bug was live... In this
> > case
> > > > purging those pages[1] should be able to fix the issue.
> > > >
> > > > [1] https://en.wikipedia.org/wiki/Wikipedia:Purge
> > > >
> > > > -Liangent
> > > > ___
> > > > Wikitech-l mailing list
> > > > Wikitech-l@lists.wikimedia.org
> > > > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> > > >
> > >
> > >
> > >
> > > --
> > > Jiang BIAN
> > >
> > > This email may be confidential or privileged.  If you received this
> > > communication by mistake, please don't forward it to anyone else,
> please
> > > erase all copies and attachments, and please let me know that it went
> to
> > > the wrong person.  Thanks.
> > > ___
> > > Wikitech-l mailing list
> > > Wikitech-l@lists.wikimedia.org
> > > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> > >
> > ___
> > Wikitech-l mailing list
> > Wikitech-l@lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> >
>
>
>
> --
> Jiang BIAN
>
> This email may be confidential or privileged.  If you received this
> communication by mistake, please don't forward it to anyone else, please
> erase all copies and attachments, and please let me know that it went to
> the wrong person.  Thanks.
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>
___
Wikitech-l mailing l

Re: [Wikitech-l] unexpected error info in HTML

2013-08-22 Thread Jiang BIAN
Thanks for the link. But I think this is targeting the language variant
related fix.

We actually observed stale cache in a wider range, see the bug entry:
https://bugzilla.wikimedia.org/show_bug.cgi?id=46014


On Thu, Aug 22, 2013 at 5:26 PM, Liangent  wrote:

> On Fri, Aug 23, 2013 at 8:13 AM, Jiang BIAN  wrote:
>
> > We are actually crawling the HTML via bot, so the bug is not actually
> fixed
> > for non-login user, right?
> >
>
> I can't think of a good way to fix the problem from this aspect besides
> waiting for old cached page to expire, unless some sysadmin is happy to
> nuke all existing Squid cached pages.
>
> However if you have a list of affected pages as you're crawling HTML, which
> we don't have, you can simply purge them in batch and recrawl those pages.
>
>
> > Could you share the bug's link?
> >
>
> There was no bug created in bugzilla... I submitted a patch[1] directly to
> fix the bug once it was spotted.
>
> [1] https://gerrit.wikimedia.org/r/#/c/76060/
>
> -Liangent
>
>
> >
> > Thanks
> >
> >
> > On Thu, Aug 22, 2013 at 4:38 PM, Liangent  wrote:
> >
> > > On Fri, Aug 23, 2013 at 7:06 AM, Sumana Harihareswara <
> > > suma...@wikimedia.org
> > > > wrote:
> > >
> > > > On 08/01/2013 03:08 AM, Jiang BIAN  wrote:
> > > > > Hi,
> > > > >
> > > > > I noticed some pages we crawled containing error message like this;
> > > > >
> > > > >  > > > class="mw-content-ltr"> > > > > class="error">Failed to render property P373:
> > > > > Wikibase\LanguageWithConversion::factory: given languages do not
> have
> > > the
> > > > > same parent language
> > > > >
> > > > >
> > > > > But when I open the url in browser, there is no such message. And
> > using
> > > > > index.php can also get normal content without error messages.
> > > > >
> > > > > Here are examples you can retry:
> > > > >
> > > > > bad
> > > > > $ wget 'http://zh.wikipedia.org/zh-cn/Google'
> > > > >
> > > > > good
> > > > > $ wget 'http://zh.wikipedia.org/w/index.php?title=Google'
> > > > >
> > > > >
> > > > > Looks like something is wrong on Wikipedia side, anything need to
> > fix?
> > > > >
> > > > >
> > > > >
> > > > > Thanks
> > > >
> > > > I checked with Jiang Bian and found out that this is still happening
> --
> > > > can anyone help Google out here? :-)
> > > >
> > > > --
> > > > Sumana Harihareswara
> > > > Engineering Community Manager
> > > > Wikimedia Foundation
> > > >
> > > > ___
> > > > Wikitech-l mailing list
> > > > Wikitech-l@lists.wikimedia.org
> > > > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> > > >
> > >
> > > There was a bug in some Wikibase version deployed in July which caused
> > this
> > > error, but a fix was backported soon and since then I've never seen any
> > > similar error as a logged in user. If you still see some errors only
> when
> > > unlogged in at particular URLs (like what you described) now, it's
> likely
> > > that those URLs got cached in Squid when the bug was live... In this
> case
> > > purging those pages[1] should be able to fix the issue.
> > >
> > > [1] https://en.wikipedia.org/wiki/Wikipedia:Purge
> > >
> > > -Liangent
> > > ___
> > > Wikitech-l mailing list
> > > Wikitech-l@lists.wikimedia.org
> > > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> > >
> >
> >
> >
> > --
> > Jiang BIAN
> >
> > This email may be confidential or privileged.  If you received this
> > communication by mistake, please don't forward it to anyone else, please
> > erase all copies and attachments, and please let me know that it went to
> > the wrong person.  Thanks.
> > ___
> > Wikitech-l mailing list
> > Wikitech-l@lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> >
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>



-- 
Jiang BIAN

This email may be confidential or privileged.  If you received this
communication by mistake, please don't forward it to anyone else, please
erase all copies and attachments, and please let me know that it went to
the wrong person.  Thanks.
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] unexpected error info in HTML

2013-08-22 Thread Liangent
On Fri, Aug 23, 2013 at 8:13 AM, Jiang BIAN  wrote:

> We are actually crawling the HTML via bot, so the bug is not actually fixed
> for non-login user, right?
>

I can't think of a good way to fix the problem from this aspect besides
waiting for old cached page to expire, unless some sysadmin is happy to
nuke all existing Squid cached pages.

However if you have a list of affected pages as you're crawling HTML, which
we don't have, you can simply purge them in batch and recrawl those pages.


> Could you share the bug's link?
>

There was no bug created in bugzilla... I submitted a patch[1] directly to
fix the bug once it was spotted.

[1] https://gerrit.wikimedia.org/r/#/c/76060/

-Liangent


>
> Thanks
>
>
> On Thu, Aug 22, 2013 at 4:38 PM, Liangent  wrote:
>
> > On Fri, Aug 23, 2013 at 7:06 AM, Sumana Harihareswara <
> > suma...@wikimedia.org
> > > wrote:
> >
> > > On 08/01/2013 03:08 AM, Jiang BIAN  wrote:
> > > > Hi,
> > > >
> > > > I noticed some pages we crawled containing error message like this;
> > > >
> > > >  > > class="mw-content-ltr"> > > > class="error">Failed to render property P373:
> > > > Wikibase\LanguageWithConversion::factory: given languages do not have
> > the
> > > > same parent language
> > > >
> > > >
> > > > But when I open the url in browser, there is no such message. And
> using
> > > > index.php can also get normal content without error messages.
> > > >
> > > > Here are examples you can retry:
> > > >
> > > > bad
> > > > $ wget 'http://zh.wikipedia.org/zh-cn/Google'
> > > >
> > > > good
> > > > $ wget 'http://zh.wikipedia.org/w/index.php?title=Google'
> > > >
> > > >
> > > > Looks like something is wrong on Wikipedia side, anything need to
> fix?
> > > >
> > > >
> > > >
> > > > Thanks
> > >
> > > I checked with Jiang Bian and found out that this is still happening --
> > > can anyone help Google out here? :-)
> > >
> > > --
> > > Sumana Harihareswara
> > > Engineering Community Manager
> > > Wikimedia Foundation
> > >
> > > ___
> > > Wikitech-l mailing list
> > > Wikitech-l@lists.wikimedia.org
> > > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> > >
> >
> > There was a bug in some Wikibase version deployed in July which caused
> this
> > error, but a fix was backported soon and since then I've never seen any
> > similar error as a logged in user. If you still see some errors only when
> > unlogged in at particular URLs (like what you described) now, it's likely
> > that those URLs got cached in Squid when the bug was live... In this case
> > purging those pages[1] should be able to fix the issue.
> >
> > [1] https://en.wikipedia.org/wiki/Wikipedia:Purge
> >
> > -Liangent
> > ___
> > Wikitech-l mailing list
> > Wikitech-l@lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> >
>
>
>
> --
> Jiang BIAN
>
> This email may be confidential or privileged.  If you received this
> communication by mistake, please don't forward it to anyone else, please
> erase all copies and attachments, and please let me know that it went to
> the wrong person.  Thanks.
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] unexpected error info in HTML

2013-08-22 Thread Jiang BIAN
We are actually crawling the HTML via bot, so the bug is not actually fixed
for non-login user, right?

Could you share the bug's link?

Thanks


On Thu, Aug 22, 2013 at 4:38 PM, Liangent  wrote:

> On Fri, Aug 23, 2013 at 7:06 AM, Sumana Harihareswara <
> suma...@wikimedia.org
> > wrote:
>
> > On 08/01/2013 03:08 AM, Jiang BIAN  wrote:
> > > Hi,
> > >
> > > I noticed some pages we crawled containing error message like this;
> > >
> > >  > class="mw-content-ltr"> > > class="error">Failed to render property P373:
> > > Wikibase\LanguageWithConversion::factory: given languages do not have
> the
> > > same parent language
> > >
> > >
> > > But when I open the url in browser, there is no such message. And using
> > > index.php can also get normal content without error messages.
> > >
> > > Here are examples you can retry:
> > >
> > > bad
> > > $ wget 'http://zh.wikipedia.org/zh-cn/Google'
> > >
> > > good
> > > $ wget 'http://zh.wikipedia.org/w/index.php?title=Google'
> > >
> > >
> > > Looks like something is wrong on Wikipedia side, anything need to fix?
> > >
> > >
> > >
> > > Thanks
> >
> > I checked with Jiang Bian and found out that this is still happening --
> > can anyone help Google out here? :-)
> >
> > --
> > Sumana Harihareswara
> > Engineering Community Manager
> > Wikimedia Foundation
> >
> > ___
> > Wikitech-l mailing list
> > Wikitech-l@lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> >
>
> There was a bug in some Wikibase version deployed in July which caused this
> error, but a fix was backported soon and since then I've never seen any
> similar error as a logged in user. If you still see some errors only when
> unlogged in at particular URLs (like what you described) now, it's likely
> that those URLs got cached in Squid when the bug was live... In this case
> purging those pages[1] should be able to fix the issue.
>
> [1] https://en.wikipedia.org/wiki/Wikipedia:Purge
>
> -Liangent
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>



-- 
Jiang BIAN

This email may be confidential or privileged.  If you received this
communication by mistake, please don't forward it to anyone else, please
erase all copies and attachments, and please let me know that it went to
the wrong person.  Thanks.
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] unexpected error info in HTML

2013-08-22 Thread Liangent
On Fri, Aug 23, 2013 at 7:06 AM, Sumana Harihareswara  wrote:

> On 08/01/2013 03:08 AM, Jiang BIAN  wrote:
> > Hi,
> >
> > I noticed some pages we crawled containing error message like this;
> >
> >  class="mw-content-ltr"> > class="error">Failed to render property P373:
> > Wikibase\LanguageWithConversion::factory: given languages do not have the
> > same parent language
> >
> >
> > But when I open the url in browser, there is no such message. And using
> > index.php can also get normal content without error messages.
> >
> > Here are examples you can retry:
> >
> > bad
> > $ wget 'http://zh.wikipedia.org/zh-cn/Google'
> >
> > good
> > $ wget 'http://zh.wikipedia.org/w/index.php?title=Google'
> >
> >
> > Looks like something is wrong on Wikipedia side, anything need to fix?
> >
> >
> >
> > Thanks
>
> I checked with Jiang Bian and found out that this is still happening --
> can anyone help Google out here? :-)
>
> --
> Sumana Harihareswara
> Engineering Community Manager
> Wikimedia Foundation
>
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>

There was a bug in some Wikibase version deployed in July which caused this
error, but a fix was backported soon and since then I've never seen any
similar error as a logged in user. If you still see some errors only when
unlogged in at particular URLs (like what you described) now, it's likely
that those URLs got cached in Squid when the bug was live... In this case
purging those pages[1] should be able to fix the issue.

[1] https://en.wikipedia.org/wiki/Wikipedia:Purge

-Liangent
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] unexpected error info in HTML

2013-08-22 Thread Sumana Harihareswara
On 08/01/2013 03:08 AM, Jiang BIAN  wrote:
> Hi,
> 
> I noticed some pages we crawled containing error message like this;
> 
>  class="error">Failed to render property P373:
> Wikibase\LanguageWithConversion::factory: given languages do not have the
> same parent language
> 
> 
> But when I open the url in browser, there is no such message. And using
> index.php can also get normal content without error messages.
> 
> Here are examples you can retry:
> 
> bad
> $ wget 'http://zh.wikipedia.org/zh-cn/Google'
> 
> good
> $ wget 'http://zh.wikipedia.org/w/index.php?title=Google'
> 
> 
> Looks like something is wrong on Wikipedia side, anything need to fix?
> 
> 
> 
> Thanks

I checked with Jiang Bian and found out that this is still happening --
can anyone help Google out here? :-)

-- 
Sumana Harihareswara
Engineering Community Manager
Wikimedia Foundation

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

[Wikitech-l] unexpected error info in HTML

2013-08-01 Thread Jiang BIAN
Hi,

I noticed some pages we crawled containing error message like this;

Failed to render property P373:
Wikibase\LanguageWithConversion::factory: given languages do not have the
same parent language


But when I open the url in browser, there is no such message. And using
index.php can also get normal content without error messages.

Here are examples you can retry:

bad
$ wget 'http://zh.wikipedia.org/zh-cn/Google'

good
$ wget 'http://zh.wikipedia.org/w/index.php?title=Google'


Looks like something is wrong on Wikipedia side, anything need to fix?



Thanks

-- 
Jiang BIAN

This email may be confidential or privileged.  If you received this
communication by mistake, please don't forward it to anyone else, please
erase all copies and attachments, and please let me know that it went to
the wrong person.  Thanks.
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

[Wikitech-l] unexpected error info in HTML

2013-07-31 Thread Jiang BIAN
Hi,

I noticed some pages we crawled containing error message like this;

Failed to render property P373:
Wikibase\LanguageWithConversion::factory: given languages do not have the
same parent language


But when I open the url in browser, there is no such message. And using
index.php can also get normal content without error messages.

Here are examples you can retry:

bad
$ wget 'http://zh.wikipedia.org/zh-cn/Google'

good
$ wget 'http://zh.wikipedia.org/w/index.php?title=Google'


Looks like something is wrong on Wikipedia side, anything need to fix?



Thanks


-- 
Jiang BIAN

This email may be confidential or privileged.  If you received this
communication by mistake, please don't forward it to anyone else, please
erase all copies and attachments, and please let me know that it went to
the wrong person.  Thanks.
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l