GoranSMilovanovic added a comment.
Update `Mon 27 Apr 2020 10:10:23 PM UTC`: **Final reports** - Here goes the **Part A** of the Final Report which encompasses the Exploratory Data Analysis (EDA) only, encompassing: (1) the characteristics of the sample of SPARQL queries used in this study, (2) the overview of the number of queries run per (a) day of week, (b) hour of day, (c) WMF Datacenter/Host, (d) HTTP method of request, (e) server HTTP response code, and (f) the desired output format, (3) the mean and median WDQS query processing times across the mentioned (a) - (f) variables, and (4) the distributions of WDQS processing times across WMF Datacenter/Hosts and output format. F31783509: WDQS Endpoint Analytics_20200427_A.nb.html <https://phabricator.wikimedia.org/F31783509> **Summary** - The `eqiad` data center is receiving tons of queries in comparison to `codfw`. - The `XML` output format seems to take much more to process in comparison to `JSON` and `text/plain` (except for we really have only few observations of `text/plain` in the sample). - The distributions of the WDQS processing time across the crucial variables (WMF Datacenter/Host, Output format) are highly skewed towards short processing times - so we really need to focus on the outliers seriously (as already did in the ML approach). **Next:** - share the dataset of most frequently observed SPARQL queries at the WDQS endpoint; - share Part B of the Report: optimizing the WDQS processing times w. XGBoost and features parsed from SPARQL (nothing new, all covered in our meetings with thephp.cc, just a wrap-up). TASK DETAIL https://phabricator.wikimedia.org/T248308 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GoranSMilovanovic Cc: MGerlach, JAllemandou, Lucas_Werkmeister_WMDE, Simon_Villeneuve, dcausse, Jakob_WMDE, Gehel, Addshore, Lydia_Pintscher, WMDE-leszek, Aklapper, darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
_______________________________________________ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs