GoranSMilovanovic added a comment.
My initial observations - **please comment:** From the wmf.mediawiki_history <https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Edits/Mediawiki_history> table (Data Lake, Hadoop): select page_title, event_comment from wmf.mediawiki_history where event_entity='revision' and event_type='create' and wiki_db='wikidatawiki' and snapshot='2019-11' and event_comment rlike 'wbsetclaim-update' limit 10; we find: page_title event_comment Q23895474 /* wbsetclaim-update:2||1 */ [[Property:P150]]: [[Q18805608]] Q508568 /* wbsetclaim-update:2||1|1 */ [[Property:P106]]: [[Q774306]] Q3505042 /* wbsetclaim-update:2||1 */ [[Property:P279]]: [[Q16911701]] Q152362 /* wbsetclaim-update:2||1 */ [[Property:P18]]: Mihai Răzvan Ungureanu 2013-11-23.jpg Q3418516 /* wbsetclaim-update:2||1|2 */ [[Property:P1006]]: 072590327 Q5187 /* wbsetclaim-update:2||1|1 */ [[Property:P150]]: [[Q1026761]] Q4531589 /* wbsetclaim-update:2||1|1 */ [[Property:P570]]: 12 January 2018 Q4071572 /* wbsetclaim-update:2||1 */ [[Property:P166]]: [[Q791135]] Q32236148 /* wbsetclaim-update:2||1 */ [[Property:P625]]: 54°41'36"N, 129°4'6"E so yes, the method proposed by @WMDE-leszek is helpful to recognize when the value of a statement in some particular entity changes + we can also get to `revision_id`, `event_timestamp`, and `event_user_id` from this table. Proposed approach (corrected): (1) fetch a relevant revision (i.e. the one having `wbsetclaim-update` in the `event_comment` field), (2) use the API to collect the JSON representation of the revised entity by `revision-id`, (3) look at all (or some, constrained by some time frame?) subsequent revisions of the same entity, (4) use the API to collect the JSON representations from these subsequent revisions; (5) compare the JSON representations to see if the change in the value of the statement was followed by a change in the reference(s) of the respective statement or not. **Please comment**. TASK DETAIL https://phabricator.wikimedia.org/T240466 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GoranSMilovanovic Cc: Aklapper, Addshore, Jan_Dittrich, hoo, rosalieper, noarave, Tarrow, Lydia_Pintscher, GoranSMilovanovic, WMDE-leszek, Sarai-WMDE, darthmon_wmde, DannyS712, Nandana, Lahi, Gq86, QZanden, LawExplorer, _jensen, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
