GoranSMilovanovic added a comment.

  My initial observations - **please comment:**
  
  From the wmf.mediawiki_history 
<https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Edits/Mediawiki_history>
 table (Data Lake, Hadoop):
  
    select page_title, event_comment from wmf.mediawiki_history where 
event_entity='revision' and event_type='create' and wiki_db='wikidatawiki' and 
snapshot='2019-11' and event_comment rlike 'wbsetclaim-update' limit 10;
  
  we find:
  
    page_title      event_comment
    Q23895474       /* wbsetclaim-update:2||1 */ [[Property:P150]]: 
[[Q18805608]]
    Q508568 /* wbsetclaim-update:2||1|1 */ [[Property:P106]]: [[Q774306]]
    Q3505042        /* wbsetclaim-update:2||1 */ [[Property:P279]]: 
[[Q16911701]]
    Q152362 /* wbsetclaim-update:2||1 */ [[Property:P18]]: Mihai Răzvan 
Ungureanu 2013-11-23.jpg
    Q3418516        /* wbsetclaim-update:2||1|2 */ [[Property:P1006]]: 072590327
    Q5187   /* wbsetclaim-update:2||1|1 */ [[Property:P150]]: [[Q1026761]]
    Q4531589        /* wbsetclaim-update:2||1|1 */ [[Property:P570]]: 12 
January 2018
    Q4071572        /* wbsetclaim-update:2||1 */ [[Property:P166]]: [[Q791135]]
    Q32236148       /* wbsetclaim-update:2||1 */ [[Property:P625]]: 54°41'36"N, 
129°4'6"E
  
  so yes, the method proposed by @WMDE-leszek is helpful to recognize when the 
value of a statement in some particular entity changes + we can also get to 
`revision_id`, `event_timestamp`, and `event_user_id` from this table.
  
  Proposed approach (corrected):
  (1) fetch a relevant revision (i.e. the one having `wbsetclaim-update` in the 
`event_comment` field),
  (2) use the API to collect the JSON representation of the revised entity by 
`revision-id`,
  (3) look at all (or some, constrained by some time frame?) subsequent 
revisions of the same entity,
  (4) use the API to collect the JSON representations from these subsequent 
revisions;
  (5) compare the JSON representations to see if the change in the value of the 
statement was followed by a change in the reference(s) of the respective 
statement or not.
  
  **Please comment**.

TASK DETAIL
  https://phabricator.wikimedia.org/T240466

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: GoranSMilovanovic
Cc: Aklapper, Addshore, Jan_Dittrich, hoo, rosalieper, noarave, Tarrow, 
Lydia_Pintscher, GoranSMilovanovic, WMDE-leszek, Sarai-WMDE, darthmon_wmde, 
DannyS712, Nandana, Lahi, Gq86, QZanden, LawExplorer, _jensen, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to