[Wikidata-bugs] [Maniphest] T328813: Develop a ML-based service to detect vandalism on Wikidata
diego closed subtask T341820: Evaluate and improve the Revert Risk model for Wikidata. as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T328813 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Michael, calbon, achou, MunizaA, Lydia_Pintscher, leila, diego, Aklapper, Danny_Benjafield_WMDE, S8321414, KinneretG, Astuthiodit_1, YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Dringsim, Nandana, Gnoeee, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, KimKelting, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T343419: Move Wikidata tools to Lift Wing
diego added a parent task: T341820: Evaluate and improve the Revert Risk model for Wikidata.. TASK DETAIL https://phabricator.wikimedia.org/T343419 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: diego, achou, Arian_Bozorg, Ladsgroup, Lucas_Werkmeister_WMDE, Michael, ItamarWMDE, Aklapper, Lydia_Pintscher, elukey, Danny_Benjafield_WMDE, fbalicchia, isarantopoulos, Astuthiodit_1, karapayneWMDE, Simonmaignan, Invadibot, amy_rc, maantietaja, calbon, Anerka, Akuckartz, Nandana, Lahi, Gq86, Xinbenlv, Vacio, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Zache, Wikidata-bugs, aude, Alchimista, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T343419: Move Wikidata tools to Lift Wing
diego added a comment. Also the experimental model is available through the Knowledge Integrity package <https://gitlab.wikimedia.org/repos/research/knowledge_integrity>. Here you have an example Python notebook on how to use it from PAWS (or from your local machine). <https://public-paws.wmcloud.org/User:Diego_%28WMF%29/WikidataRevertRisk/wikidata_ki_example_notebook.ipynb> TASK DETAIL https://phabricator.wikimedia.org/T343419 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: diego, achou, Arian_Bozorg, Ladsgroup, Lucas_Werkmeister_WMDE, Michael, ItamarWMDE, Aklapper, Lydia_Pintscher, elukey, Danny_Benjafield_WMDE, fbalicchia, isarantopoulos, Astuthiodit_1, karapayneWMDE, Simonmaignan, Invadibot, amy_rc, maantietaja, calbon, Anerka, Akuckartz, Nandana, Lahi, Gq86, Xinbenlv, Vacio, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Zache, Wikidata-bugs, aude, Alchimista, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T343419: Move Wikidata tools to Lift Wing
diego added a comment. And if you want to help with the evaluation, please go to this site: https://annotool.toolforge.org/ and help us to annotate data :) TASK DETAIL https://phabricator.wikimedia.org/T343419 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: diego, achou, Arian_Bozorg, Ladsgroup, Lucas_Werkmeister_WMDE, Michael, ItamarWMDE, Aklapper, Lydia_Pintscher, elukey, Danny_Benjafield_WMDE, fbalicchia, isarantopoulos, Astuthiodit_1, karapayneWMDE, Simonmaignan, Invadibot, amy_rc, maantietaja, calbon, Anerka, Akuckartz, Nandana, Lahi, Gq86, Xinbenlv, Vacio, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Zache, Wikidata-bugs, aude, Alchimista, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T343419: Move Wikidata tools to Lift Wing
diego added a comment. In T343419#9068806 <https://phabricator.wikimedia.org/T343419#9068806>, @achou wrote: > @elukey Research team's plan for the RevertRisk Wikidata model is to evaluate it in Q1, and then improve and deploy it in Q2. I can confirm this! TASK DETAIL https://phabricator.wikimedia.org/T343419 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: diego, achou, Arian_Bozorg, Ladsgroup, Lucas_Werkmeister_WMDE, Michael, ItamarWMDE, Aklapper, Lydia_Pintscher, elukey, Danny_Benjafield_WMDE, fbalicchia, isarantopoulos, Astuthiodit_1, karapayneWMDE, Simonmaignan, Invadibot, amy_rc, maantietaja, calbon, Anerka, Akuckartz, Nandana, Lahi, Gq86, Xinbenlv, Vacio, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Zache, Wikidata-bugs, aude, Alchimista, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata
diego closed this task as "Resolved". TASK DETAIL https://phabricator.wikimedia.org/T333892 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Isaac, achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, KinneretG, Astuthiodit_1, YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata
diego added a comment. **Weekly Updates** - The Wikidata Revert Risk model is now available for testing on this PAWS notebook <https://public-paws.wmcloud.org/User:Diego_(WMF)/WikidataRevertRisk/wikidata_ki_example_notebook.ipynb>. I'm going to resolve this task and add the evaluation and improvements in a new ticket. TASK DETAIL https://phabricator.wikimedia.org/T333892 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Isaac, achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, KinneretG, Astuthiodit_1, YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata
diego added a subscriber: Isaac. diego added a comment. **Weekly Updates** - @MunizaA has released an alpha version of the evaluation tool. Results for Wikidata Model can be found here <https://annotool.toolforge.org/projects/6>. - For Wikidata Revert Risk, I'm going to upload thetraining and testing code, plus the model on public repo, and then open another task for model's evaluation and improvements. - Regarding the Item Quality model, I'm going to coordinate with @Isaac for the follow-ups on that project. TASK DETAIL https://phabricator.wikimedia.org/T333892 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Isaac, achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, KinneretG, Astuthiodit_1, YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata
diego added a comment. **Weekly updates** - I'm currently working on the Model Card for this algorithm. - @MunizaA please notify us in this ticket when the annotation tool app is ready. - We are preparing the code to be shared with @Lydia_Pintscher and (through her) with volunteer developers to test the current algorithm on their own datasets. TASK DETAIL https://phabricator.wikimedia.org/T333892 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata
diego added a comment. - Weekly Updates** - We have met with Lydia and community developers. We are going to share our code with them and we have also learn about their efforts on automatic content patrolling in Wikidata. - The evaluation tool code is ready, this week @MunizaA would upload this to a public end-point (toolforge or wmfcloud). TASK DETAIL https://phabricator.wikimedia.org/T333892 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata
diego added a comment. **Weekly Updates** - We are still working on the evaluation tool. TASK DETAIL https://phabricator.wikimedia.org/T333892 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata
diego added a comment. **Weekly Updates** - @MunizaA is working on evaluation tool that would be usable by all the Revert Risk Models, including the Wikidata on as well as the LA and Multilingual for Wikipedia TASK DETAIL https://phabricator.wikimedia.org/T333892 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata
diego added a comment. **Weekly Updates** - The model card for Multilingual model is available here <https://meta.wikimedia.org/wiki/Machine_learning_models/Proposed/Multilingual_revert_risk_model_card>. - We are working with Lydia to evaluate the model, and update if needed. TASK DETAIL https://phabricator.wikimedia.org/T333892 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata
diego added a subscriber: achou. diego added a comment. **Weekly Updates** - The first version of this model is ready to go to LiftWing. - @MunizaA has submitted a merge request <https://gitlab.wikimedia.org/repos/research/knowledge_integrity/-/merge_requests/16>. Now @achou is reviewing the code. - I'll be meeting with @Lydia_Pintscher next week to show the results and discuss next steps. - We are planning to create and upload the model card next week. TASK DETAIL https://phabricator.wikimedia.org/T333892 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata
diego added a comment. **Weekly Updates** - We are finalizing the feature extraction pipeline code and the code to serve the model on LiftWing. TASK DETAIL https://phabricator.wikimedia.org/T333892 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata
diego added a comment. **Weekly Updates** - We have develop a meta-model. This model has two main components. - The first one is a Catboost based classifier, designed to assess the Revert Risk for claims set and updates. - The second model is an hybrid approach, designed to evaluate Revert Risk on Wikidata Item Descriptions. This model uses mBert <https://huggingface.co/bert-base-multilingual-cased>. - @MunizaA has developed a methodology for creating clean training data for the mBert Model - @MunizaA is now working on implementing this model, and the feature extraction pipeline by updating the Knowledge Integrity Repo <https://gitlab.wikimedia.org/repos/research/knowledge_integrity>. TASK DETAIL https://phabricator.wikimedia.org/T333892 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata
diego added a comment. **Weekly updates** - @MunizaA has created an efficient pipeline to train HuggingFace Transformers, using the GPUs from the stat machines, and data coming from the Data Lake. - We are experimenting with different LLM such as mBert and Roberta, to detect vandalism on Item Descriptions. TASK DETAIL https://phabricator.wikimedia.org/T333892 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata
diego added a subscriber: MunizaA. diego added a comment. **Weekly Updates** - @MunizaA has been testing the feasibility and utility of using Wikidata Embeddings, both for Item Quality and Revert Risk. We have studied different implementations, and experimenting with the PyTorch BigGraph model <https://github.com/facebookresearch/PyTorch-BigGraph>. We have been able to train on medium-size subgraphs. While the training on large graphs seems to be possible, we are still evaluating the value of such embeddings for the proposed tasks. - We have tested specific approaches for different types of actions. Eg: One language-based model to assess quality of descriptions and labels, and other models for claims containing triples (Q_x P_y Q_z). This is improving the performance and quality of our results. TASK DETAIL https://phabricator.wikimedia.org/T333892 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T328813: Develop a ML-based service to detect vandalism on Wikidata
diego added a comment. **Update** - I'm testing a Deep Learning approach, to see if offers relevant advantages over the current XGBOOST model. TASK DETAIL https://phabricator.wikimedia.org/T328813 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Michael, calbon, achou, MunizaA, Lydia_Pintscher, leila, diego, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T332021: Wikidata Articlequality ORES/ML model needs updating after MUL
diego added subscribers: Isaac, diego. diego added a comment. @Michael FYI: @Isaac has done interesting progress on Wikidata Item Quality automatic evaluation T321224 <https://phabricator.wikimedia.org/T321224>. Also, I'm leading another work on vandalism detection on Wikidata T328813 <https://phabricator.wikimedia.org/T328813>. TASK DETAIL https://phabricator.wikimedia.org/T332021 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: diego, Isaac, Aklapper, Lydia_Pintscher, Manuel, Michael, Astuthiodit_1, Gethan, karapayneWMDE, Simonmaignan, Invadibot, Theofpa, maantietaja, calbon, guergana.tzatchkova, Anerka, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, Xinbenlv, Vacio, Capankajsmilyo, GoranSMilovanovic, Fz-29, QZanden, LawExplorer, elukey, _jensen, rosalieper, Mkdw, Scott_WUaS, notconfusing, Wikidata-bugs, aude, Ricordisamoa, Alchimista, He7d3r, Ladsgroup, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T328813: Develop a ML-based service to detect vandalism on Wikidata
diego added a comment. **Update** - New features had slightly improved the accuracy (now is 75%), I'm still working on improving the model. TASK DETAIL https://phabricator.wikimedia.org/T328813 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: calbon, achou, MunizaA, Lydia_Pintscher, leila, diego, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T328813: Develop a ML-based service to detect vandalism on Wikidata
diego added a comment. **Update** - Currently I'm working on featuring engineering. The current model has around 72% accuracy on balanced data. TASK DETAIL https://phabricator.wikimedia.org/T328813 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: calbon, achou, MunizaA, Lydia_Pintscher, leila, diego, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T328813: Develop a ML-based service to detect vandalism on Wikidata
diego added a comment. **Update** - Still working on the data evaluation. Currently I'm studying the use of tags and user groups and their relation with reverts. TASK DETAIL https://phabricator.wikimedia.org/T328813 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: calbon, achou, MunizaA, Lydia_Pintscher, leila, diego, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T321224: Wikidata Item Quality Model
diego added a comment. I'm trying to implement a link-prediction task on Wikidata, to be used as proxy for claims coverage. I'm building on top of Goyal & Ferrara <https://arxiv.org/pdf/1705.02801.pdf>'s work. The existing libraries might require some tweaks to work on the full Wikidata Graph, but before addressing the scalability issues I want to test this approach on a small sample to see the suitability of this approach. TASK DETAIL https://phabricator.wikimedia.org/T321224 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Isaac, diego Cc: diego, Miriam, Isaac, Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Lydia_Pintscher, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T307323: WMDE Machine Learning (ORES)
diego added a subscriber: Lydia_Pintscher. diego added a comment. Hey @DAbad, as part of this proposal <https://docs.google.com/document/d/1qAF7nJNAMw3yOwoKP2HuvkpuhwjaWs-BX9dSGRTkJc8/edit?usp=sharing>, I'm in conversations with @Lydia_Pintscher and @calbon to develop new models for Wikidata that works directly on Liftwing. It would great to know in detail the requirements for the models mentioned in this ticket, and also whether you have labeled that we can use to train the algorithms. TASK DETAIL https://phabricator.wikimedia.org/T307323 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: DAbad, diego Cc: Lydia_Pintscher, diego, calbon, DAbad, SWakiyama, Aklapper, Astuthiodit_1, STH, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Shizhao, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - We finished this project, results can be found on Meta <https://meta.wikimedia.org/wiki/Research:Identifying_Controversial_Content_in_Wikidata>, the code and models could be found in Gitlab <https://gitlab.wikimedia.org/repos/research/controveriesWikidata>. - I'll discuss future work with @Lydia_Pintscher. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego closed this task as "Resolved". diego updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - I was comparing the results when adding anonymous edits, until now I haven't find major differences with the previous results. I'll continue working on this during the next week before my next meeting with Lydia. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - I've presented the main results of this work during the Tuesday Research Sessions, slides can be find here <https://docs.google.com/presentation/d/1JUqUqhlwPwCx6koy5t8oKiYC4flHNEUrpBMUkvt76xE/edit?usp=sharing>. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - We meet with Lydia and discussed the current results. - We reviewed the results confirming that most co-edited items corresponds to on going events, even when we change the time window to be considered. - Now, I'll be studying the relevance/prevalence of anonymous edits on popular content. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, karapayneWMDE, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - No updates TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, karapayneWMDE, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - I'm working in identifying collaborative edits on wikidata items not related to current events. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - No updates TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - We are now focusing in understanding collaborations patterns: when/how more than user edits the same item in a given period of time. - We found that in Wikidata such collaborations are less frequent than in other Wikimedia projects. - We also found that items edited by more than one user are usually related to on going events (awards, deaths, releases) - I'll present some of these findings: - On research meeting (Tuesday) in March - And @Lydia_Pintscher will propose a date probably in April to present these results to the Wikidata folks. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - I'm organizing the new results to be discussed with the stakeholder. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego moved this task from FY2021-22-Research-Oct-Dec to FY2021-22-Research-Jan-March on the Research board. diego edited projects, added Research (FY2021-22-Research-Jan-March); removed Research (FY2021-22-Research-Oct-Dec). TASK DETAIL https://phabricator.wikimedia.org/T287946 WORKBOARD https://phabricator.wikimedia.org/project/board/45/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - I'm focusing on modeling the relationship between topics and collaborations/controversies. - I'm working on graph representation of these components TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - We have seen that few items are edited by more than one user. - We are currently researching about the item and users characteristics related to collaborative work. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - No updates this week. I'm going to meet with the stakeholder next week. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - I've been working on classifier to predict reverts. - The current classifier uses article (item), revision and user information. - On a balance test set, the actual model gets results over 70% of accuracy - However, there is a set of caveats to be considered: - 'auto-reverts': users can revert themselves, this shouldn't be consider as signal of controversy. We need to analyze more this behavior. - power-users: we need to take in account that a small set of users produces most of the edits and reverts, this behavior could affect our results. We are working on different sampling method to address this issue. - The meta page was updated with the results in Q1 and partial results in Q2. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - Working on modeling the reverting behavior. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** No updates this week. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - Preliminary results presented to our stakeholder. - Next weeeks we will be focusing a deeper understanding of reverting behavior. **TODO** - Update meta page (within the next 3 weeks) TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** We presented this work at the TTO'21 conference <https://truthandtrustonline.com/>. We received interesting feedback, including questions about the definition of controversial content. Some potential collaboration for a second round on this research were opened. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego moved this task from FY2021-22-Research-July-Sept to FY2021-22-Research-Oct-Dec on the Research board. diego edited projects, added Research (FY2021-22-Research-Oct-Dec); removed Research (FY2021-22-Research-July-Sept). TASK DETAIL https://phabricator.wikimedia.org/T287946 WORKBOARD https://phabricator.wikimedia.org/project/board/45/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - I've started gathering and organizing the different results, to write a first report. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **updates** I've created a page <https://meta.wikimedia.org/wiki/Research:Identifying_Controversial_Content_in_Wikidata> on meta about this project. In the following weeks I'll be uploading some of the analysis and main results there. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - I've been crunching data to study the "disputed by" qualifier. The plan is to have some statistics on this and compare with the reverts behavior. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** No updates this week. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - I've been running analysis on the predictability of reverts on Wikidata, including page, user and edit characteristics such as the property and the action summary explained above. - Probably not surprising I've found that the user characteristics such as the "account age" (the difference between a given edit and the user account creation) is the most related with the revert probability: - I've also noted that bots are less likely than humans to reverted, and that edits in new articles (items) are more likely to be reverted. - I'm now analyzing a set of properties that showed some correlation with reverts: - P 9157 - P 97 - P 3602 - P 646 - P 3782 - P 183 - P 2860 - P 7902 - P 9339 - P 2671 - Next steps is to model interactions between users, and also analyze the usage of the "disputed by" qualifiers. F34631288: image.png <https://phabricator.wikimedia.org/F34631288> TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. **Updates** - I'm focusing on reverted revisions. - Developed a methodology to characterize Wikidata edits according different dimensions, such as the property edited, the edit type (from edit summaries), and user characteristics. (popular edit types) F34618187: image.png <https://phabricator.wikimedia.org/F34618187> - Exploring the differences on reverts done/received by bots and humans. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. No updates this week. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. @Lydia_Pintscher , regarding your question about the number of users co-editing a Wikidata page, I found that for all edits to namespace 0, in July 2021, considering items that have at least one sitelink: - 84% of pages were edited just by one user. - 14% by two users , and the reminding 2% of pages, by more than 2 users. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego added a comment. As a very initial exploration, we analyzed a subset of Wikidata items, categorized them by topic, and checked which of them received more **updates**, as proxy for conroversiality. More specifically, - We selected all the Wikidata items with sitelinks to enwiki. - We counted the number edits summaries containing the keyword:`wbsetclaim-update` - We found that claims related to Software and computing are the ones - proportionally - more updated within this subsabe F34574587: image.png <https://phabricator.wikimedia.org/F34574587> - We also found that most updated property is P31 <https://phabricator.wikimedia.org/P31> F34574595: image.png <https://phabricator.wikimedia.org/F34574595> These last results are not normalized yet. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego triaged this task as "High" priority. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata
diego created this task. diego added projects: Wikidata, Research (FY2021-22-Research-July-Sept). Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION The aim of this project is to identify controversial content in Wikidata. Specifically we will develop the following tasks: [ ] Create and test different definitions of controversiality in Wikidata, [ ] Develop a model to early identify controversial content. TASK DETAIL https://phabricator.wikimedia.org/T287946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T272192: Migrate to new Wikidata Analytics
diego added a comment. I see. I was asking because we wrote these address on published papers, and those are immutable. But if is not possible, is not possible. TASK DETAIL https://phabricator.wikimedia.org/T272192 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GoranSMilovanovic, diego Cc: diego, WMDE-leszek, Lea_Lacroix_WMDE, Lydia_Pintscher, GoranSMilovanovic, Aklapper, Invadibot, maantietaja, Akuckartz, Michael, Nandana, Lahi, Gq86, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T272192: Migrate to new Wikidata Analytics
diego added a comment. Would be possible to add redirects from the old urls to the new ones? TASK DETAIL https://phabricator.wikimedia.org/T272192 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GoranSMilovanovic, diego Cc: diego, WMDE-leszek, Lea_Lacroix_WMDE, Lydia_Pintscher, GoranSMilovanovic, Aklapper, Invadibot, maantietaja, Akuckartz, Michael, Nandana, Lahi, Gq86, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T204438: finding statements that need a reference
diego added a comment. https://dl.acm.org/doi/abs/10.1145/3366424.3383571 TASK DETAIL https://phabricator.wikimedia.org/T204438 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: diego, Hjfocs, Nandana, GoranSMilovanovic, Aklapper, Lydia_Pintscher, Akuckartz, Dinadineke, DannyS712, tabish.shaikh91, Lahi, Gq86, Soteriaspace, Jayprakash12345, JakeTheDeveloper, QZanden, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, TheDJ, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T90881: Framework for checking sources on Wikidata (Does the source actually say what we claim it says?)
diego added a comment. Hi all This problem is called Natural Language Inference (NLI) also known as textual entitlement . It is a super hot problem now in the NLP community, but imho research is still far away from producing usable tools in the Wikipedia context. This also requires a lot of computational resources (GPUs) to train. Anyhow, I'm exploring if would be possible to create a usable API where you could send a claim and a document and the API will tell the relation between those pieces (confirm, reject, no information). I think the algorithm won't work well with subtle issues (eg. the references is talking about the main topic of the item, but does not content the specific information about the claim), but could be able to catch if the document (reference) is completely unrelated. I'll keep you updated. TASK DETAIL https://phabricator.wikimedia.org/T90881 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: maiarocg, diego Cc: diego, Abbe98, Tamslo, Ricordisamoa, Liuxinyu970226, Lydia_Pintscher, Aklapper, Akuckartz, Dinadineke, DannyS712, Nandana, tabish.shaikh91, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, Jayprakash12345, JakeTheDeveloper, QZanden, merbst, LawExplorer, Culex, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, TheDJ, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T155560: Linked fact checker
diego added a comment. @leila I see some overlap although this task seems to be broader than the one I'm working on. Given that I don't see much documentation nor code about this task, I prefer to not take responsibility on this. TASK DETAIL https://phabricator.wikimedia.org/T155560 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: diego, Cirdan, Capankajsmilyo, PokestarFan, Natalia, leila, Cervisiarius, Tarrow, Hjfocs, srijan, DarTar, GLCiampaglia, Tgr, Harej, Zppix, Jseddon, Basvb, Halfak, Aklapper, Akuckartz, darthmon_wmde, Nandana, Zambujo, Lahi, Gq86, GoranSMilovanovic, Fz-29, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Daniel_Mietchen, jayvdb, Ricordisamoa, He7d3r, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T215616: Improve interlingual links across wikis through Wikidata IDs
diego added a comment. I think we are talking about three different things: i) page_id -> CurrentWikidataItem: this was my original request, and I think @JAllemandou 's script solves this issue. Having that table updated would be great. ii) revision_id-> CurrentWikidataItem: This can be obtained by joining the previous table with the revision table. Having that table pre-computed would save time and resources on joining, but we can also do the join just when is needed. iii)revision_id ->HistoricalWikidataItem: I was not looking for that, although it would be very interesting information. TASK DETAIL https://phabricator.wikimedia.org/T215616 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: diego Cc: Isaac, Tbayer, jcrespo, EBernhardson, Halfak, Nuria, JAllemandou, diego, Nandana, Akovalyov, Banyek, AndyTan, Rayssa-, Lahi, Gq86, GoranSMilovanovic, QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, Wikidata-bugs, aude, Capt_Swing, Dinoguy1000, Mbch331, Jay8g, Krenair, jeremyb ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T215616: Improve interlingual links across wikis through Wikidata IDs
diego added a comment. @JAllemandou , yes. Having this by revision would be great!TASK DETAILhttps://phabricator.wikimedia.org/T215616EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: Isaac, Tbayer, jcrespo, EBernhardson, Halfak, Nuria, JAllemandou, diego, Nandana, Akovalyov, Banyek, AndyTan, Rayssa-, Lahi, Gq86, GoranSMilovanovic, QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, Wikidata-bugs, aude, Capt_Swing, Dinoguy1000, Mbch331, Jay8g, Krenair, jeremyb___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T215616: Improve interlingual links across wikis through Wikidata IDs
diego added a comment. @Tbayer , great. Thanks.TASK DETAILhttps://phabricator.wikimedia.org/T215616EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: Tbayer, jcrespo, EBernhardson, Halfak, Nuria, JAllemandou, diego, Nandana, Akovalyov, Banyek, AndyTan, Rayssa-, Lahi, Gq86, GoranSMilovanovic, QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, Wikidata-bugs, aude, Capt_Swing, Dinoguy1000, Mbch331, Jay8g, Krenair, jeremyb___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T215616: Improve interlingual links across wikis through Wikidata IDs
diego added a comment. @jcrespo, the API works good for query specific pages/entities, not for example to know which pages that existing in X_wiki are missing on the Y_wiki. My point here it is that the wikidata identifier is currently the main identifier for a page/concept, and that this fact is not reflected on the DB structure. I understand that this might be due historical reasons, but it would be good to think in a way that our DBs make easier to link content across wikis.TASK DETAILhttps://phabricator.wikimedia.org/T215616EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: jcrespo, EBernhardson, Halfak, Nuria, JAllemandou, diego, Nandana, Akovalyov, Banyek, AndyTan, Rayssa-, Lahi, Gq86, GoranSMilovanovic, QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, Wikidata-bugs, aude, Capt_Swing, Dinoguy1000, Mbch331, Jay8g, Krenair, jeremyb___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T215616: Improve interlingual links across wikis through Wikidata IDs
diego added a comment. @EBernhardson , this looks exactly what I was looking for, initially. Thank you very much for that. However, I wont close this task, because wikibase_item is still missing the page_id information. Joining by page_title does not seems very 'healthy'. We should keep discussing how to solve that. ThanksTASK DETAILhttps://phabricator.wikimedia.org/T215616EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: EBernhardson, Halfak, Nuria, JAllemandou, diego, Nandana, Akovalyov, Banyek, AndyTan, Rayssa-, Lahi, Gq86, GoranSMilovanovic, QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, Wikidata-bugs, aude, Capt_Swing, Dinoguy1000, Mbch331, Jay8g, Krenair, jeremyb___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T215616: Improve interlingual links across wikis through Wikidata IDs
diego added a comment. Looks good @JAllemandou, thanks. This is a good workaround, but imho, we should have an structure or schema that makes this kind of tasks easier, specially for people outside without access to a cluster.TASK DETAILhttps://phabricator.wikimedia.org/T215616EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: Nuria, JAllemandou, diego, Nandana, Akovalyov, Banyek, AndyTan, Rayssa-, Lahi, Gq86, GoranSMilovanovic, QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, Wikidata-bugs, aude, Capt_Swing, Dinoguy1000, Mbch331, Jay8g, Krenair, jeremyb___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T215616: Improve interlingual links across wikis through Wikidata IDs
diego added a project: Wikidata. TASK DETAILhttps://phabricator.wikimedia.org/T215616EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: Nuria, JAllemandou, diego, Nandana, Akovalyov, AndyTan, Lahi, Gq86, GoranSMilovanovic, QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, Wikidata-bugs, aude, Capt_Swing, Mbch331, Jay8g, Krenair, jeremyb___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T182849: Identify unhelpful file names on commons
diego added a comment. Hi @chelsyx , Check this notebook, apparently the number of white spaces are a pretty good indicator of the filename quality.TASK DETAILhttps://phabricator.wikimedia.org/T182849EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: chelsyx, diegoCc: diego, Base, Liuxinyu970226, thiemowmde, Aklapper, Abit, Ramsey-WMF, mpopov, chelsyx, Nandana, JKSTNK, Lahi, PDrouin-WMF, Gq86, E1presidente, Cparle, Anooprao, SandraF_WMF, GoranSMilovanovic, QZanden, Tramullas, Acer, LawExplorer, Silverfish, _jensen, Susannaanas, Jane023, Wikidata-bugs, matthiasmullie, aude, Ricordisamoa, Wesalius, Lydia_Pintscher, Fabrice_Florin, Raymond, Steinsplitter, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T178249: Parameter for linking a new page to the Wikidata
diego added a comment. Hi, Kateryna is working on this: https://meta.wikimedia.org/wiki/Research:Matching_Red_Links_with_Wikidata_Items Please ping or write something in the discussion page if you want to know more about that projecy.TASK DETAILhttps://phabricator.wikimedia.org/T178249EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: diego, IKhitron, SerDIDG, putnik, Aklapper, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs