GoranSMilovanovic added a comment.
Update `Mon 27 Apr 2020 10:31:05 PM UTC`: **The most frequently observed SPARQL queries dataset** - Selection criteria: the query was observed >= 50 times in the WDQS endpoint sample (approx. `1M` queries, `2020/04/01` - `2020/04/21`). - For each query we report the mean WDQS processing time, the median WDQS processing time, and the standard deviation of processing time; - the dataset is sorted in descending order of mean WDQS processing time; - the `Percent` column stands for the `%` of the total number of queries in the sample represented by the respective (repeatedly observed) query and does not sum up to `100%` because, again, we report only on the queries that were observed 50 or more times on the endpoint. Here goes the dataset: F31783527: repeatedQueries_Filter50.csv <https://phabricator.wikimedia.org/F31783527> Columns: - `uniqueSparqlId` - the unique ID of the query - never mind, I need it for some join operations on data frames; - `sparql` - the query itself - `Num_Observations` - how many times was the query observed in the sample; - `mean_query_time` - the mean WDQS processing time for this query - `median_query_time` - the median WDQS processing time for this query - `stdev_query_time` - the standard deviation of the WDQS processing time for this query - `Percent` - explained above. TASK DETAIL https://phabricator.wikimedia.org/T248308 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GoranSMilovanovic Cc: MGerlach, JAllemandou, Lucas_Werkmeister_WMDE, Simon_Villeneuve, dcausse, Jakob_WMDE, Gehel, Addshore, Lydia_Pintscher, WMDE-leszek, Aklapper, darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
_______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
