[Wikidata-bugs] [Maniphest] T328813: Develop a ML-based service to detect vandalism on Wikidata

2024-04-29 Thread diego
diego closed subtask T341820: Evaluate  and improve the Revert Risk model for 
Wikidata. as Resolved.

TASK DETAIL
  https://phabricator.wikimedia.org/T328813

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Michael, calbon, achou, MunizaA, Lydia_Pintscher, leila, diego, Aklapper, 
Danny_Benjafield_WMDE, S8321414, KinneretG, Astuthiodit_1, YLiou_WMF, 
karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, 
Dringsim, Nandana, Gnoeee, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
KimKelting, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, 
aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T343419: Move Wikidata tools to Lift Wing

2023-08-04 Thread diego
diego added a parent task: T341820: Evaluate  and improve the Revert Risk model 
for Wikidata..

TASK DETAIL
  https://phabricator.wikimedia.org/T343419

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: diego, achou, Arian_Bozorg, Ladsgroup, Lucas_Werkmeister_WMDE, Michael, 
ItamarWMDE, Aklapper, Lydia_Pintscher, elukey, Danny_Benjafield_WMDE, 
fbalicchia, isarantopoulos, Astuthiodit_1, karapayneWMDE, Simonmaignan, 
Invadibot, amy_rc, maantietaja, calbon, Anerka, Akuckartz, Nandana, Lahi, Gq86, 
Xinbenlv, Vacio, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Zache, Wikidata-bugs, aude, Alchimista, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T343419: Move Wikidata tools to Lift Wing

2023-08-04 Thread diego
diego added a comment.


  Also the experimental model is available through the Knowledge Integrity 
package <https://gitlab.wikimedia.org/repos/research/knowledge_integrity>.
  
  Here you have an example Python notebook on how to use it from PAWS (or from 
your local machine). 
<https://public-paws.wmcloud.org/User:Diego_%28WMF%29/WikidataRevertRisk/wikidata_ki_example_notebook.ipynb>

TASK DETAIL
  https://phabricator.wikimedia.org/T343419

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: diego, achou, Arian_Bozorg, Ladsgroup, Lucas_Werkmeister_WMDE, Michael, 
ItamarWMDE, Aklapper, Lydia_Pintscher, elukey, Danny_Benjafield_WMDE, 
fbalicchia, isarantopoulos, Astuthiodit_1, karapayneWMDE, Simonmaignan, 
Invadibot, amy_rc, maantietaja, calbon, Anerka, Akuckartz, Nandana, Lahi, Gq86, 
Xinbenlv, Vacio, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Zache, Wikidata-bugs, aude, Alchimista, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T343419: Move Wikidata tools to Lift Wing

2023-08-04 Thread diego
diego added a comment.


  And if you want to help with the evaluation, please go to this site: 
https://annotool.toolforge.org/ and help us to annotate data :)

TASK DETAIL
  https://phabricator.wikimedia.org/T343419

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: diego, achou, Arian_Bozorg, Ladsgroup, Lucas_Werkmeister_WMDE, Michael, 
ItamarWMDE, Aklapper, Lydia_Pintscher, elukey, Danny_Benjafield_WMDE, 
fbalicchia, isarantopoulos, Astuthiodit_1, karapayneWMDE, Simonmaignan, 
Invadibot, amy_rc, maantietaja, calbon, Anerka, Akuckartz, Nandana, Lahi, Gq86, 
Xinbenlv, Vacio, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Zache, Wikidata-bugs, aude, Alchimista, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T343419: Move Wikidata tools to Lift Wing

2023-08-04 Thread diego
diego added a comment.


  In T343419#9068806 <https://phabricator.wikimedia.org/T343419#9068806>, 
@achou wrote:
  
  > @elukey Research team's plan for the RevertRisk Wikidata model is to 
evaluate it in Q1, and then improve and deploy it in Q2.
  
  I can confirm this!

TASK DETAIL
  https://phabricator.wikimedia.org/T343419

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: diego, achou, Arian_Bozorg, Ladsgroup, Lucas_Werkmeister_WMDE, Michael, 
ItamarWMDE, Aklapper, Lydia_Pintscher, elukey, Danny_Benjafield_WMDE, 
fbalicchia, isarantopoulos, Astuthiodit_1, karapayneWMDE, Simonmaignan, 
Invadibot, amy_rc, maantietaja, calbon, Anerka, Akuckartz, Nandana, Lahi, Gq86, 
Xinbenlv, Vacio, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Zache, Wikidata-bugs, aude, Alchimista, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata

2023-07-07 Thread diego
diego closed this task as "Resolved".

TASK DETAIL
  https://phabricator.wikimedia.org/T333892

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Isaac, achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, 
KinneretG, Astuthiodit_1, YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, 
GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata

2023-07-07 Thread diego
diego added a comment.


  **Weekly Updates**
  
  - The Wikidata Revert Risk model is now available for testing on this PAWS 
notebook 
<https://public-paws.wmcloud.org/User:Diego_(WMF)/WikidataRevertRisk/wikidata_ki_example_notebook.ipynb>.
  
  I'm going to resolve this task and add the evaluation and improvements in a 
new ticket.

TASK DETAIL
  https://phabricator.wikimedia.org/T333892

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Isaac, achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, 
KinneretG, Astuthiodit_1, YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, 
GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata

2023-06-30 Thread diego
diego added a subscriber: Isaac.
diego added a comment.


  **Weekly Updates**
  
  - @MunizaA has released an alpha version of the evaluation tool. Results for 
Wikidata Model can be found here <https://annotool.toolforge.org/projects/6>.
  - For Wikidata Revert Risk, I'm going to upload thetraining and testing code, 
plus the model on public repo, and then open another task for model's 
evaluation and improvements.
  - Regarding the Item Quality model, I'm going to coordinate with @Isaac for 
the follow-ups on that project.

TASK DETAIL
  https://phabricator.wikimedia.org/T333892

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Isaac, achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, 
KinneretG, Astuthiodit_1, YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, 
GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata

2023-06-16 Thread diego
diego added a comment.


  **Weekly updates**
  
  - I'm currently working on the Model Card for this algorithm.
  - @MunizaA  please notify us in this ticket when the annotation tool app is 
ready.
  - We are preparing the code to be shared with @Lydia_Pintscher and (through 
her) with volunteer developers to test the current algorithm on their own 
datasets.

TASK DETAIL
  https://phabricator.wikimedia.org/T333892

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, 
Astuthiodit_1, YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, 
ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, 
aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata

2023-06-12 Thread diego
diego added a comment.


  - Weekly Updates**
  
  - We have met with Lydia and community developers.  We are going to share our 
code with them and we have also learn about their efforts on automatic content 
patrolling in Wikidata.
  - The evaluation tool code is ready, this week @MunizaA would upload this to 
a public end-point (toolforge or wmfcloud).

TASK DETAIL
  https://phabricator.wikimedia.org/T333892

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, 
Astuthiodit_1, YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, 
ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, 
aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata

2023-06-02 Thread diego
diego added a comment.


  **Weekly Updates**
  
  - We are still working on the evaluation tool.

TASK DETAIL
  https://phabricator.wikimedia.org/T333892

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, 
Astuthiodit_1, YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, 
ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, 
aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata

2023-05-26 Thread diego
diego added a comment.


  **Weekly Updates**
  
  - @MunizaA  is working on evaluation tool that would be usable by all the 
Revert Risk Models, including the Wikidata on as well as the LA and 
Multilingual for Wikipedia

TASK DETAIL
  https://phabricator.wikimedia.org/T333892

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, 
Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata

2023-05-14 Thread diego
diego added a comment.


  **Weekly Updates**
  
  - The model card for Multilingual model is available here 
<https://meta.wikimedia.org/wiki/Machine_learning_models/Proposed/Multilingual_revert_risk_model_card>.
  - We are working with Lydia to evaluate the model, and update if needed.

TASK DETAIL
  https://phabricator.wikimedia.org/T333892

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, 
Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata

2023-05-05 Thread diego
diego added a subscriber: achou.
diego added a comment.


  **Weekly Updates**
  
  - The first version of this model is ready to go to LiftWing.
  - @MunizaA  has submitted a merge request 
<https://gitlab.wikimedia.org/repos/research/knowledge_integrity/-/merge_requests/16>.
 Now @achou is reviewing the code.
  - I'll be meeting with @Lydia_Pintscher next week to show the results and 
discuss next steps.
  - We are planning to create and upload the model card next week.

TASK DETAIL
  https://phabricator.wikimedia.org/T333892

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: achou, Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, 
Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata

2023-04-28 Thread diego
diego added a comment.


  **Weekly Updates**
  
  - We are finalizing the feature extraction pipeline code and the code to 
serve the model on LiftWing.

TASK DETAIL
  https://phabricator.wikimedia.org/T333892

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, 
karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, 
Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, 
Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata

2023-04-20 Thread diego
diego added a comment.


  **Weekly Updates**
  
  - We have develop a meta-model. This model has two main components.
- The first one is a Catboost based classifier, designed to assess the 
Revert Risk for claims set and updates.
- The second model is an hybrid approach, designed to evaluate Revert Risk 
on Wikidata Item Descriptions. This model uses mBert 
<https://huggingface.co/bert-base-multilingual-cased>.
- @MunizaA has developed a methodology for creating clean training data for 
the mBert Model
  - @MunizaA  is now working on implementing this model, and the feature 
extraction pipeline by updating the Knowledge Integrity Repo 
<https://gitlab.wikimedia.org/repos/research/knowledge_integrity>.

TASK DETAIL
  https://phabricator.wikimedia.org/T333892

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Lydia_Pintscher, MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, 
karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, 
Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, 
Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata

2023-04-14 Thread diego
diego added a comment.


  **Weekly updates**
  
  - @MunizaA has created an efficient pipeline to train HuggingFace 
Transformers, using the GPUs from the stat machines, and data coming from the 
Data Lake.
  - We are experimenting with different LLM such as mBert and Roberta, to 
detect vandalism on Item Descriptions.

TASK DETAIL
  https://phabricator.wikimedia.org/T333892

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, karapayneWMDE, 
Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, 
Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, 
rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T333892: Develop a new generation of ML models for Wikidata

2023-04-07 Thread diego
diego added a subscriber: MunizaA.
diego added a comment.


  **Weekly Updates**
  
  - @MunizaA has been testing the feasibility and utility of using Wikidata 
Embeddings, both for Item Quality and Revert Risk. We have studied different 
implementations, and experimenting with the PyTorch BigGraph model 
<https://github.com/facebookresearch/PyTorch-BigGraph>. We have been able to 
train on medium-size subgraphs. While the training on large graphs seems to be 
possible, we are still evaluating the value of such embeddings for the proposed 
tasks.
  - We have tested specific approaches for different types of actions. Eg: One 
language-based model to assess quality of descriptions and labels, and other 
models for claims containing triples (Q_x P_y Q_z). This is improving the 
performance and quality of our results.

TASK DETAIL
  https://phabricator.wikimedia.org/T333892

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: MunizaA, Aklapper, leila, mrephabricator, Astuthiodit_1, karapayneWMDE, 
Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, 
Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, 
rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T328813: Develop a ML-based service to detect vandalism on Wikidata

2023-03-31 Thread diego
diego added a comment.


  **Update**
  
  - I'm testing a Deep Learning approach, to see if offers relevant advantages 
over the current XGBOOST model.

TASK DETAIL
  https://phabricator.wikimedia.org/T328813

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Michael, calbon, achou, MunizaA, Lydia_Pintscher, leila, diego, Aklapper, 
Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T332021: Wikidata Articlequality ORES/ML model needs updating after MUL

2023-03-22 Thread diego
diego added subscribers: Isaac, diego.
diego added a comment.


  @Michael FYI:
  @Isaac has done interesting progress on Wikidata Item Quality automatic 
evaluation T321224 <https://phabricator.wikimedia.org/T321224>. Also, I'm 
leading another work on vandalism detection on Wikidata T328813 
<https://phabricator.wikimedia.org/T328813>.

TASK DETAIL
  https://phabricator.wikimedia.org/T332021

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: diego, Isaac, Aklapper, Lydia_Pintscher, Manuel, Michael, Astuthiodit_1, 
Gethan, karapayneWMDE, Simonmaignan, Invadibot, Theofpa, maantietaja, calbon, 
guergana.tzatchkova, Anerka, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, 
Xinbenlv, Vacio, Capankajsmilyo, GoranSMilovanovic, Fz-29, QZanden, 
LawExplorer, elukey, _jensen, rosalieper, Mkdw, Scott_WUaS, notconfusing, 
Wikidata-bugs, aude, Ricordisamoa, Alchimista, He7d3r, Ladsgroup, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T328813: Develop a ML-based service to detect vandalism on Wikidata

2023-03-10 Thread diego
diego added a comment.


  **Update**
  
  - New features had slightly improved the accuracy (now is 75%), I'm still 
working on improving the model.

TASK DETAIL
  https://phabricator.wikimedia.org/T328813

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: calbon, achou, MunizaA, Lydia_Pintscher, leila, diego, Aklapper, 
Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T328813: Develop a ML-based service to detect vandalism on Wikidata

2023-03-05 Thread diego
diego added a comment.


  **Update**
  
  - Currently I'm working on featuring engineering. The current model has 
around 72% accuracy on balanced data.

TASK DETAIL
  https://phabricator.wikimedia.org/T328813

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: calbon, achou, MunizaA, Lydia_Pintscher, leila, diego, Aklapper, 
Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T328813: Develop a ML-based service to detect vandalism on Wikidata

2023-02-17 Thread diego
diego added a comment.


  **Update**
  
  - Still working on the data evaluation. Currently I'm studying the use of 
tags and user groups and their relation with reverts.

TASK DETAIL
  https://phabricator.wikimedia.org/T328813

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: calbon, achou, MunizaA, Lydia_Pintscher, leila, diego, Aklapper, 
Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T321224: Wikidata Item Quality Model

2022-12-22 Thread diego
diego added a comment.


  I'm trying to implement a link-prediction task on Wikidata, to be used as 
proxy for claims coverage. I'm building on top of Goyal & Ferrara 
<https://arxiv.org/pdf/1705.02801.pdf>'s work. The existing libraries might 
require some tweaks to work on the full Wikidata Graph, but before addressing 
the scalability issues I want to test this approach on a small sample to see 
the suitability of this approach.

TASK DETAIL
  https://phabricator.wikimedia.org/T321224

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Isaac, diego
Cc: diego, Miriam, Isaac, Astuthiodit_1, karapayneWMDE, Invadibot, Ywats0ns, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, 
GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Lydia_Pintscher, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T307323: WMDE Machine Learning (ORES)

2022-06-17 Thread diego
diego added a subscriber: Lydia_Pintscher.
diego added a comment.


  Hey @DAbad, as part of this proposal 
<https://docs.google.com/document/d/1qAF7nJNAMw3yOwoKP2HuvkpuhwjaWs-BX9dSGRTkJc8/edit?usp=sharing>,
 I'm in conversations with @Lydia_Pintscher and @calbon  to develop new models 
for Wikidata that works directly on Liftwing.
  
  It would great to know in detail the requirements for the models mentioned in 
this ticket, and also whether you have labeled that we can use to train the 
algorithms.

TASK DETAIL
  https://phabricator.wikimedia.org/T307323

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: DAbad, diego
Cc: Lydia_Pintscher, diego, calbon, DAbad, SWakiyama, Aklapper, Astuthiodit_1, 
STH, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, 
Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude, Shizhao, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2022-04-08 Thread diego
diego added a comment.


  **Updates**
  
  - We finished this project, results can be found on Meta 
<https://meta.wikimedia.org/wiki/Research:Identifying_Controversial_Content_in_Wikidata>,
  the code and models could be found in Gitlab 
<https://gitlab.wikimedia.org/repos/research/controveriesWikidata>.
  - I'll discuss future work with @Lydia_Pintscher.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Astuthiodit_1, 
karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, 
Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, 
_jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2022-04-08 Thread diego
diego closed this task as "Resolved".
diego updated the task description.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Astuthiodit_1, 
karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, 
Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, 
_jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2022-03-25 Thread diego
diego added a comment.


  **Updates**
  
  - I was comparing the results when adding anonymous edits, until now I 
haven't find major differences with the previous results. I'll continue working 
on this during the next week before my next meeting with Lydia.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Astuthiodit_1, 
karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, 
Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, 
_jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2022-03-18 Thread diego
diego added a comment.


  **Updates**
  
  - I've presented the main results of this work during the Tuesday Research 
Sessions, slides can be find here 
<https://docs.google.com/presentation/d/1JUqUqhlwPwCx6koy5t8oKiYC4flHNEUrpBMUkvt76xE/edit?usp=sharing>.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Astuthiodit_1, 
karapayneWMDE, Invadibot, maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, 
Gq86, GoranSMilovanovic, QZanden, LawExplorer, Avner, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2022-03-06 Thread diego
diego added a comment.


  **Updates**
  
  - We meet with Lydia and discussed the current results.
  - We reviewed the results confirming that most co-edited items corresponds to 
on going events, even when we change the time window to be considered.
  - Now, I'll be studying the relevance/prevalence of anonymous edits on 
popular content.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, karapayneWMDE, Invadibot, 
maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, 
aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2022-02-18 Thread diego
diego added a comment.


  **Updates**
  
  - No updates

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, karapayneWMDE, Invadibot, 
maantietaja, Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, 
aude, Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2022-02-11 Thread diego
diego added a comment.


  **Updates**
  
  - I'm working in identifying collaborative edits on wikidata items not 
related to current events.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2022-02-04 Thread diego
diego added a comment.


  **Updates**
  
  - No updates

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2022-01-21 Thread diego
diego added a comment.


  **Updates**
  
  - We are now focusing in understanding collaborations patterns: when/how more 
than user edits the same item in a given period of time.
- We found that in Wikidata such collaborations are less frequent than in 
other Wikimedia projects.
- We also found that items edited by more than one user are usually related 
to on going events (awards, deaths, releases)
  - I'll present some of these findings:
- On research meeting (Tuesday) in March
- And @Lydia_Pintscher  will propose a date probably in April to present 
these results to the Wikidata folks.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2022-01-16 Thread diego
diego added a comment.


  **Updates**
  
  - I'm organizing the new results to be discussed with the stakeholder.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2022-01-16 Thread diego
diego moved this task from FY2021-22-Research-Oct-Dec to 
FY2021-22-Research-Jan-March on the Research board.
diego edited projects, added Research (FY2021-22-Research-Jan-March); removed 
Research (FY2021-22-Research-Oct-Dec).

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

WORKBOARD
  https://phabricator.wikimedia.org/project/board/45/

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2022-01-07 Thread diego
diego added a comment.


  **Updates**
  
  - I'm focusing on modeling the relationship between topics and 
collaborations/controversies.
- I'm working on graph representation of these components

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-12-24 Thread diego
diego added a comment.


  **Updates**
  
  - We have seen that few items are edited by more than one user.
  - We are currently researching about the item and users characteristics 
related to collaborative  work.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-12-03 Thread diego
diego added a comment.


  **Updates**
  
  - No updates this week. I'm going to meet with the stakeholder next week.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-11-12 Thread diego
diego added a comment.


  **Updates**
  
  - I've been working on classifier to predict reverts.
- The current classifier uses article (item), revision and user information.
- On a balance test set, the actual model gets results over 70% of accuracy
- However, there is a set of caveats to be considered:
  - 'auto-reverts': users can revert themselves, this shouldn't be consider 
as signal of controversy. We need to analyze more this behavior.
  - power-users:  we need to take in account that a small set of users 
produces most of the edits and reverts, this behavior could affect our results. 
We are working on different sampling method to address this issue.
  - The meta page was updated with the results in Q1 and partial results in Q2.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-11-12 Thread diego
diego updated the task description.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-11-05 Thread diego
diego added a comment.


  **Updates**
  
  - Working on modeling the reverting behavior.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-10-29 Thread diego
diego added a comment.


  **Updates**
  
  No updates this week.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-10-25 Thread diego
diego added a comment.


  **Updates**
  
  - Preliminary results presented to our stakeholder.
  - Next weeeks we will be focusing a deeper understanding of reverting 
behavior.
  
  **TODO**
  
  - Update meta page (within the next 3 weeks)

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-10-08 Thread diego
diego added a comment.


  **Updates**
  
  We presented this work at the TTO'21 conference 
<https://truthandtrustonline.com/>. We received interesting feedback, including 
questions about the definition of controversial content. Some potential 
collaboration for a second round on this research were opened.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-10-08 Thread diego
diego moved this task from FY2021-22-Research-July-Sept to 
FY2021-22-Research-Oct-Dec on the Research board.
diego edited projects, added Research (FY2021-22-Research-Oct-Dec); removed 
Research (FY2021-22-Research-July-Sept).

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

WORKBOARD
  https://phabricator.wikimedia.org/project/board/45/

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-10-03 Thread diego
diego added a comment.


  **Updates**
  
  - I've started gathering and organizing the different results, to write a 
first report.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-09-24 Thread diego
diego added a comment.


  **updates**
  
  I've created a page 
<https://meta.wikimedia.org/wiki/Research:Identifying_Controversial_Content_in_Wikidata>
 on meta about this project. In the following weeks I'll be uploading some of 
the analysis and main results there.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-09-17 Thread diego
diego added a comment.


  **Updates**
  
  - I've been crunching data to study the "disputed by"  qualifier. The plan is 
to have some statistics on this and compare with the reverts behavior.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-09-10 Thread diego
diego added a comment.


  **Updates**
  
  No updates this week.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-09-03 Thread diego
diego added a comment.


  **Updates**
  
  - I've been running analysis on the predictability of reverts on Wikidata, 
including page, user and edit characteristics  such as the property and the 
action summary explained above.
  - Probably not surprising I've found that the user characteristics such as 
the "account age" (the difference between a given edit  and the user account 
creation) is the most related with the revert probability:
  - I've also noted that bots are less likely than humans to reverted, and that 
edits in new articles (items) are more likely to be reverted.
  - I'm now analyzing a set of properties that showed some correlation with 
reverts:
- P 9157
- P 97
- P 3602
- P 646
- P 3782
- P 183
- P 2860
- P 7902
- P 9339
- P 2671
  
  - Next steps is to model interactions between users, and also analyze the 
usage of the "disputed by"  qualifiers.
  
  F34631288: image.png <https://phabricator.wikimedia.org/F34631288>

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-08-27 Thread diego
diego added a comment.


  **Updates**
  
  - I'm focusing on reverted revisions.
  - Developed a methodology to characterize Wikidata edits according different 
dimensions, such as the property edited, the edit type (from edit summaries), 
and user characteristics.
  
  (popular edit types)
  F34618187: image.png <https://phabricator.wikimedia.org/F34618187>
  
  - Exploring the differences on reverts done/received by bots and humans.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-08-21 Thread diego
diego added a comment.


  No updates this week.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-08-03 Thread diego
diego added a comment.


  @Lydia_Pintscher , regarding your question about the number of users 
co-editing a Wikidata page, I found that for all edits to namespace 0,  in July 
2021, considering items that have at least one sitelink:
  
  - 84% of pages were edited just by one user.
  - 14% by two users , and the reminding 2% of pages, by more than 2 users.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-08-03 Thread diego
diego added a comment.


  As a very initial  exploration, we analyzed a subset of Wikidata items, 
categorized them by topic, and checked which of them received more **updates**, 
as proxy for conroversiality.
  
  More specifically,
  
  - We selected all the Wikidata items with sitelinks to enwiki.
  - We counted the number edits summaries containing the 
keyword:`wbsetclaim-update`
  - We found that claims related to Software and computing are the ones - 
proportionally - more updated within this subsabe
  
  F34574587: image.png <https://phabricator.wikimedia.org/F34574587>
  
  - We also found that most updated property is P31 
<https://phabricator.wikimedia.org/P31>
  
  F34574595: image.png <https://phabricator.wikimedia.org/F34574595>
  
  These last results are not normalized yet.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-08-03 Thread diego
diego triaged this task as "High" priority.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T287946: Identifying controversial content in Wikidata

2021-08-03 Thread diego
diego created this task.
diego added projects: Wikidata, Research (FY2021-22-Research-July-Sept).
Restricted Application added a subscriber: Aklapper.

TASK DESCRIPTION
  The aim of this project is to identify controversial content in Wikidata.
  
  Specifically we will develop the following tasks:
  
  [ ] Create and test different definitions of controversiality in Wikidata,
  [ ] Develop a model to early identify controversial content.

TASK DETAIL
  https://phabricator.wikimedia.org/T287946

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Pablo, leila, Lydia_Pintscher, diego, Aklapper, Invadibot, maantietaja, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T272192: Migrate to new Wikidata Analytics

2021-03-30 Thread diego
diego added a comment.


  I see. I was asking because we wrote these address on published papers, and 
those are immutable. But if is not possible, is not possible.

TASK DETAIL
  https://phabricator.wikimedia.org/T272192

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic, diego
Cc: diego, WMDE-leszek, Lea_Lacroix_WMDE, Lydia_Pintscher, GoranSMilovanovic, 
Aklapper, Invadibot, maantietaja, Akuckartz, Michael, Nandana, Lahi, Gq86, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T272192: Migrate to new Wikidata Analytics

2021-03-30 Thread diego
diego added a comment.


  Would be possible to add redirects from the old urls to the new ones?

TASK DETAIL
  https://phabricator.wikimedia.org/T272192

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic, diego
Cc: diego, WMDE-leszek, Lea_Lacroix_WMDE, Lydia_Pintscher, GoranSMilovanovic, 
Aklapper, Invadibot, maantietaja, Akuckartz, Michael, Nandana, Lahi, Gq86, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T204438: finding statements that need a reference

2021-01-06 Thread diego
diego added a comment.


  https://dl.acm.org/doi/abs/10.1145/3366424.3383571

TASK DETAIL
  https://phabricator.wikimedia.org/T204438

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: diego, Hjfocs, Nandana, GoranSMilovanovic, Aklapper, Lydia_Pintscher, 
Akuckartz, Dinadineke, DannyS712, tabish.shaikh91, Lahi, Gq86, Soteriaspace, 
Jayprakash12345, JakeTheDeveloper, QZanden, merbst, LawExplorer, _jensen, 
rosalieper, Scott_WUaS, Wikidata-bugs, aude, TheDJ, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T90881: Framework for checking sources on Wikidata (Does the source actually say what we claim it says?)

2020-12-23 Thread diego
diego added a comment.


  Hi all
  
  This problem is called Natural Language Inference (NLI) also known as textual 
entitlement . It is a super hot problem now in the NLP community, but imho 
research is still far away from producing usable tools in the Wikipedia 
context. This also requires a lot of computational resources (GPUs) to train.
  
  Anyhow, I'm exploring if would be possible to create a usable API where you 
could send a claim and a document and the API  will tell the relation between 
those pieces (confirm, reject, no information). I think the algorithm won't 
work well with subtle issues (eg. the references is talking about the main 
topic of the item, but does not content the specific information about the 
claim), but could be able to catch if the document (reference) is completely 
unrelated.
  
  I'll keep you updated.

TASK DETAIL
  https://phabricator.wikimedia.org/T90881

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: maiarocg, diego
Cc: diego, Abbe98, Tamslo, Ricordisamoa, Liuxinyu970226, Lydia_Pintscher, 
Aklapper, Akuckartz, Dinadineke, DannyS712, Nandana, tabish.shaikh91, Lahi, 
Gq86, GoranSMilovanovic, Soteriaspace, Jayprakash12345, JakeTheDeveloper, 
QZanden, merbst, LawExplorer, Culex, _jensen, rosalieper, Scott_WUaS, 
Wikidata-bugs, aude, TheDJ, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T155560: Linked fact checker

2020-09-24 Thread diego
diego added a comment.


  @leila I see some overlap although this task seems to be broader than the one 
I'm working on. Given that I don't see much documentation nor code about this 
task, I prefer to not take responsibility on this.

TASK DETAIL
  https://phabricator.wikimedia.org/T155560

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: diego, Cirdan, Capankajsmilyo, PokestarFan, Natalia, leila, Cervisiarius, 
Tarrow, Hjfocs, srijan, DarTar, GLCiampaglia, Tgr, Harej, Zppix, Jseddon, 
Basvb, Halfak, Aklapper, Akuckartz, darthmon_wmde, Nandana, Zambujo, Lahi, 
Gq86, GoranSMilovanovic, Fz-29, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude, Daniel_Mietchen, jayvdb, Ricordisamoa, He7d3r, 
Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T215616: Improve interlingual links across wikis through Wikidata IDs

2019-02-21 Thread diego
diego added a comment.


  I think we are talking about three different things:
  
  i) page_id -> CurrentWikidataItem: this was my original request, and I think 
@JAllemandou 's script solves this issue. Having that table updated would be 
great.
  ii) revision_id-> CurrentWikidataItem: This can be obtained by joining the 
previous table with the revision table. Having that table pre-computed would 
save time and resources on joining, but we can also do the join just when is 
needed. 
  iii)revision_id ->HistoricalWikidataItem: I was not looking for that, 
although it would be very interesting information.

TASK DETAIL
  https://phabricator.wikimedia.org/T215616

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: diego
Cc: Isaac, Tbayer, jcrespo, EBernhardson, Halfak, Nuria, JAllemandou, diego, 
Nandana, Akovalyov, Banyek, AndyTan, Rayssa-, Lahi, Gq86, GoranSMilovanovic, 
QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, 
Wikidata-bugs, aude, Capt_Swing, Dinoguy1000, Mbch331, Jay8g, Krenair, jeremyb
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T215616: Improve interlingual links across wikis through Wikidata IDs

2019-02-19 Thread diego
diego added a comment.
@JAllemandou , yes. Having this by revision would be great!TASK DETAILhttps://phabricator.wikimedia.org/T215616EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: Isaac, Tbayer, jcrespo, EBernhardson, Halfak, Nuria, JAllemandou, diego, Nandana, Akovalyov, Banyek, AndyTan, Rayssa-, Lahi, Gq86, GoranSMilovanovic, QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, Wikidata-bugs, aude, Capt_Swing, Dinoguy1000, Mbch331, Jay8g, Krenair, jeremyb___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T215616: Improve interlingual links across wikis through Wikidata IDs

2019-02-11 Thread diego
diego added a comment.
@Tbayer , great. Thanks.TASK DETAILhttps://phabricator.wikimedia.org/T215616EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: Tbayer, jcrespo, EBernhardson, Halfak, Nuria, JAllemandou, diego, Nandana, Akovalyov, Banyek, AndyTan, Rayssa-, Lahi, Gq86, GoranSMilovanovic, QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, Wikidata-bugs, aude, Capt_Swing, Dinoguy1000, Mbch331, Jay8g, Krenair, jeremyb___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T215616: Improve interlingual links across wikis through Wikidata IDs

2019-02-11 Thread diego
diego added a comment.
@jcrespo, the API works good for query specific pages/entities, not for example to know which pages that existing in X_wiki are missing on the Y_wiki. 
My point here it is that the wikidata identifier  is currently the main identifier for a page/concept, and that this fact is not reflected on the DB structure. I understand that this might be due historical reasons, but it would be good to think in a way that our DBs make easier to link content across wikis.TASK DETAILhttps://phabricator.wikimedia.org/T215616EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: jcrespo, EBernhardson, Halfak, Nuria, JAllemandou, diego, Nandana, Akovalyov, Banyek, AndyTan, Rayssa-, Lahi, Gq86, GoranSMilovanovic, QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, Wikidata-bugs, aude, Capt_Swing, Dinoguy1000, Mbch331, Jay8g, Krenair, jeremyb___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T215616: Improve interlingual links across wikis through Wikidata IDs

2019-02-11 Thread diego
diego added a comment.
@EBernhardson , this looks exactly what I was looking for, initially.  Thank you very much for that.

However, I wont close this task, because wikibase_item is still missing the page_id information. Joining by page_title does not seems very 'healthy'. We should keep discussing how to solve that. ThanksTASK DETAILhttps://phabricator.wikimedia.org/T215616EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: EBernhardson, Halfak, Nuria, JAllemandou, diego, Nandana, Akovalyov, Banyek, AndyTan, Rayssa-, Lahi, Gq86, GoranSMilovanovic, QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, Wikidata-bugs, aude, Capt_Swing, Dinoguy1000, Mbch331, Jay8g, Krenair, jeremyb___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T215616: Improve interlingual links across wikis through Wikidata IDs

2019-02-11 Thread diego
diego added a comment.
Looks good @JAllemandou, thanks.
This is a good workaround, but imho,  we should have an structure or schema that makes this kind of tasks easier, specially for people outside without access to a cluster.TASK DETAILhttps://phabricator.wikimedia.org/T215616EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: Nuria, JAllemandou, diego, Nandana, Akovalyov, Banyek, AndyTan, Rayssa-, Lahi, Gq86, GoranSMilovanovic, QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, Wikidata-bugs, aude, Capt_Swing, Dinoguy1000, Mbch331, Jay8g, Krenair, jeremyb___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Updated] T215616: Improve interlingual links across wikis through Wikidata IDs

2019-02-11 Thread diego
diego added a project: Wikidata.
TASK DETAILhttps://phabricator.wikimedia.org/T215616EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: Nuria, JAllemandou, diego, Nandana, Akovalyov, AndyTan, Lahi, Gq86, GoranSMilovanovic, QZanden, Marostegui, LawExplorer, Avner, Minhnv-2809, _jensen, Luke081515, Wikidata-bugs, aude, Capt_Swing, Mbch331, Jay8g, Krenair, jeremyb___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T182849: Identify unhelpful file names on commons

2019-02-07 Thread diego
diego added a comment.
Hi @chelsyx ,

Check this notebook, apparently the number of white spaces are a pretty good indicator of the filename quality.TASK DETAILhttps://phabricator.wikimedia.org/T182849EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: chelsyx, diegoCc: diego, Base, Liuxinyu970226, thiemowmde, Aklapper, Abit, Ramsey-WMF, mpopov, chelsyx, Nandana, JKSTNK, Lahi, PDrouin-WMF, Gq86, E1presidente, Cparle, Anooprao, SandraF_WMF, GoranSMilovanovic, QZanden, Tramullas, Acer, LawExplorer, Silverfish, _jensen, Susannaanas, Jane023, Wikidata-bugs, matthiasmullie, aude, Ricordisamoa, Wesalius, Lydia_Pintscher, Fabrice_Florin, Raymond, Steinsplitter, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T178249: Parameter for linking a new page to the Wikidata

2018-09-26 Thread diego
diego added a comment.
Hi,

Kateryna is  working on this: https://meta.wikimedia.org/wiki/Research:Matching_Red_Links_with_Wikidata_Items

Please ping or write something in the discussion page if you want to know more about that projecy.TASK DETAILhttps://phabricator.wikimedia.org/T178249EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: diegoCc: diego, IKhitron, SerDIDG, putnik, Aklapper, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs