[Wikidata-bugs] [Maniphest] T261849: Benchmark old and new model accuracy on new labeled data

2020-09-20 Thread Lydia_Pintscher
Lydia_Pintscher closed this task as "Resolved". Lydia_Pintscher added a comment. \o/ TASK DETAIL https://phabricator.wikimedia.org/T261849 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Ladsgroup, Lydia_Pintscher Cc: GoranSMilovanovic, Aklapper,

[Wikidata-bugs] [Maniphest] T261849: Benchmark old and new model accuracy on new labeled data

2020-09-16 Thread Lydia_Pintscher
Lydia_Pintscher added a comment. Current state: Lydia needs to review. Amir will explain :D TASK DETAIL https://phabricator.wikimedia.org/T261849 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Ladsgroup, Lydia_Pintscher Cc: GoranSMilovanovic,

[Wikidata-bugs] [Maniphest] T261849: Benchmark old and new model accuracy on new labeled data

2020-09-14 Thread Michael
Michael closed subtask T261850: compare model accuracy with and without property suggester as Resolved. TASK DETAIL https://phabricator.wikimedia.org/T261849 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Ladsgroup, Michael Cc: GoranSMilovanovic,

[Wikidata-bugs] [Maniphest] T261849: Benchmark old and new model accuracy on new labeled data

2020-09-10 Thread Ladsgroup
Ladsgroup moved this task from Doing to Peer Review on the Item Quality Scoring Improvement (Item Quality Scoring Improvement - Sprint 3) board. Ladsgroup added a comment. So I took a 750 sample of current items in wikidata. I took a stratified sample, like 150 from a certain range of size

[Wikidata-bugs] [Maniphest] T261849: Benchmark old and new model accuracy on new labeled data

2020-09-08 Thread Ladsgroup
Ladsgroup claimed this task. Ladsgroup moved this task from To Do to Doing on the Item Quality Scoring Improvement (Item Quality Scoring Improvement - Sprint 3) board. Restricted Application added a project: User-Ladsgroup. TASK DETAIL https://phabricator.wikimedia.org/T261849 WORKBOARD

[Wikidata-bugs] [Maniphest] T261849: Benchmark old and new model accuracy on new labeled data

2020-09-02 Thread Lydia_Pintscher
Lydia_Pintscher moved this task from Backlog to Item Quality Scoring Improvement - Sprint 3 on the Item Quality Scoring Improvement board. Lydia_Pintscher edited projects, added Item Quality Scoring Improvement (Item Quality Scoring Improvement - Sprint 3); removed Item Quality Scoring

[Wikidata-bugs] [Maniphest] T261849: Benchmark old and new model accuracy on new labeled data

2020-09-02 Thread Lydia_Pintscher
Lydia_Pintscher set the point value for this task to "3". TASK DETAIL https://phabricator.wikimedia.org/T261849 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Lydia_Pintscher Cc: Aklapper, Lydia_Pintscher, guergana.tzatchkova, Akuckartz,

[Wikidata-bugs] [Maniphest] T261849: Benchmark old and new model accuracy on new labeled data

2020-09-02 Thread Lydia_Pintscher
Lydia_Pintscher created this task. Lydia_Pintscher added projects: Item Quality Scoring Improvement, Wikidata. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION **Problem:** We would like to see if the new model is better than the old model in predicting the quality of