AKhatun_WMF added a comment.
- For June, the average daily successful parsing rate was **~85%**. Ranging
from 75% to 90%. Note that this only includes queries with status 200 and 500.
- 11% of the distinct queries ran into errors related to prefixes. The number
of distinct queries due to each prefix is shown below. By adding the first 4
prefixes (mwapi, geof, foaf, gas) into the query processors' prefix list the
average daily successful parsing rate was >96%. A few prefixes were off
slightly (data instead of wdata, ref instead of wdref. These account for very
few queries, but I fixed them nevertheless.)
| **prefix_name** | **count ** |
| mwapi | 7419357 |
| geof | 54183 |
| foaf | 17198 |
| gas | 13753 |
| wds | 2761 |
| wdv | 216 |
| fn | 62 |
| dc | 50 |
| mediawiki | 23 |
| wdref | 22 |
| wdata | 3 |
|
Total distinct queries: 67467327
- Other errors included:
- `Variable used when already in-scope`. This happened when the same
variable was reused in a query. Testing such queries in WDQS returns results
nicely. These form 2% of the errors in distinct queries.
- Another notable error is the `WITH` clause. Although it runs well in
WDQS, parser doesn't accept it. These form 2.5% of the distinct queries.
It seems including the prefixes should fix things, but should we also think
of fixing the other two errors (although small in number). Not sure why Jena
cannot parse them though.
TASK DETAIL
https://phabricator.wikimedia.org/T285465
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: AKhatun_WMF
Cc: Aklapper, AKhatun_WMF, JAllemandou, Invadibot, MPhamWMF, maantietaja,
CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE,
GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper,
Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984,
Manybubbles, Mbch331
_______________________________________________
Wikidata-bugs mailing list -- [email protected]
To unsubscribe send an email to [email protected]