[Wikidata-bugs] [Maniphest] [Commented On] T248308: Analyse a small sample of the most often used query patterns on WDQS

GoranSMilovanovic Mon, 27 Apr 2020 15:38:55 -0700

GoranSMilovanovic added a comment.


  Update `Mon 27 Apr 2020 10:31:05 PM UTC`:
  
  **The most frequently observed SPARQL queries dataset**
  
  - Selection criteria: the query was observed >= 50 times in the WDQS endpoint 
sample (approx. `1M` queries, `2020/04/01` - `2020/04/21`).
  - For each query we report the mean WDQS processing time, the median WDQS 
processing time, and the standard deviation of processing time;
  - the dataset is sorted in descending order of mean WDQS processing time;
  - the `Percent` column stands for the `%` of the total number of queries in 
the sample represented by the respective (repeatedly observed) query and does 
not sum up to `100%` because, again, we report only on the queries that were 
observed 50 or more times on the endpoint.
  
  Here goes the dataset:
  
  F31783527: repeatedQueries_Filter50.csv 
<https://phabricator.wikimedia.org/F31783527>
  
  Columns:
  
  - `uniqueSparqlId` - the unique ID of the query - never mind, I need it for 
some join operations on data frames;
  - `sparql` - the query itself
  - `Num_Observations` - how many times was the query observed in the sample;
  - `mean_query_time` - the mean WDQS processing time for this query
  - `median_query_time` - the median WDQS processing time for this query
  - `stdev_query_time` - the standard deviation of the  WDQS processing time 
for this query
  - `Percent` - explained above.

TASK DETAIL
  https://phabricator.wikimedia.org/T248308

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic
Cc: MGerlach, JAllemandou, Lucas_Werkmeister_WMDE, Simon_Villeneuve, dcausse, 
Jakob_WMDE, Gehel, Addshore, Lydia_Pintscher, WMDE-leszek, Aklapper, 
darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, merbst, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, 
Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331

_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

[Wikidata-bugs] [Maniphest] [Commented On] T248308: Analyse a small sample of the most often used query patterns on WDQS

Reply via email to