EBernhardson added a comment.
Looks like it's hitting this:
https://gerrit.wikimedia.org/r/c/search/highlighter/+/435282/5/experimental-highlighter-lucene/src/main/java/org/wikimedia/highlighter/experimental/lucene/hit/AutomatonHitEnum.java#189
TASK DETAIL
EBernhardson added a comment.
Hmm, the request with only 222 items is interesting, will look at that
TASK DETAIL
https://phabricator.wikimedia.org/T239004
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: EBernhardson
Cc: EBernhardson, Aklapper,
EBernhardson added a comment.
While it doesn't say timeout, the request spins for some time and eventually
the backend gives up and reports a failure. As i said the error messages shown
to users could be improved, but the request would still fail.
TASK DETAIL
Bugreporter added a comment.
This does not timeout but the number of results is also wrong:
https://www.wikidata.org/w/index.php?sort=relevance=haswbstatement%3AP31%3DQ6256+intitle%3A%2F%5Ba-z%5D*%2F=Special%3ASearch=advanced=1=%7B%7D=1=1=1
Actually there's 222 items with `P31=Q6256`,
EBernhardson added a comment.
The request basically asks to run a regex against 76M titles and fails with
timeouts. While the error messages could be improved, this is such a niche
thing that I don't think it's particularly important. This happens to fail the
timeout in a different way than