dcausse added a comment.
I believe that at first we are interested in knowing the number of triples that would be moved out if all item that verifies the condition: `?s wdt:P31 Q13442814` are moved out with all the triples //belonging// to these items. The triples that belongs to an entity (e.g. Q1895685) are the ones visible via https://www.wikidata.org/wiki/Special:EntityData/Q1895685.ttl?flavor=dump with the additional complexity of shared values and references that have to be treated separately because they might be shared by other entities. In here you'll notice that for instance the triple `s:Q1895685-9a482323-4d57-acf2-b6b7-bc36d578bd57 ps:P478 "171"` does not reference ha the QID of the paper but this triple must be counted as well. This makes knowing the triples that belongs to `?s` a bit tricky but we could leverage the structure of the wikibase_rdf table for this: - first count the number of triples that are not shared with other entities using the `context` column, this column (not a data available in WDQS can help to group the triples by entity) - count the number of triples attached to shared references and values, here we should also count the ones that are shared between //S// and //not S// because these will have to be duplicated in both graph Working with the wikibase_rdf table is not entirely obvious so I started a short notebook to demonstrate how to use it, please see https://people.wikimedia.org/~dcausse/wikibase_rdf_scholarly_articles_example.html. I'd be happy to jump in a meeting to better explain this if this helps. TASK DETAIL https://phabricator.wikimedia.org/T342111 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: AndrewTavis_WMDE, dcausse Cc: Lydia_Pintscher, dcausse, Gehel, dr0ptp4kt, AndrewTavis_WMDE, Aklapper, Manuel, Danny_Benjafield_WMDE, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- [email protected] To unsubscribe send an email to [email protected]
