dcausse added a comment.

  I believe that at first we are interested in knowing the number of triples 
that would be moved out if all item that verifies the condition: `?s wdt:P31 
Q13442814` are moved out with all the triples //belonging// to these items.
  The triples that belongs to an entity (e.g. Q1895685) are the ones visible 
via https://www.wikidata.org/wiki/Special:EntityData/Q1895685.ttl?flavor=dump 
with the additional complexity of shared values and references that have to be 
treated separately because they might be shared by other entities.
  In here you'll notice that for instance the triple 
`s:Q1895685-9a482323-4d57-acf2-b6b7-bc36d578bd57 ps:P478 "171"` does not 
reference ha the QID of the paper but this triple must be counted as well. 
  This makes knowing the triples that belongs to `?s` a bit tricky but we could 
leverage the structure of the wikibase_rdf table for this:
  
  - first count the number of triples that are not shared with other entities 
using the `context` column, this column (not a data available in WDQS can help 
to group the triples by entity)
  - count the number of triples attached to shared references and values, here 
we should also count the ones that are shared between //S// and //not S// 
because these will have to be duplicated in both graph
  
  Working with the wikibase_rdf table is not entirely obvious so I started a 
short notebook to demonstrate how to use it, please see 
https://people.wikimedia.org/~dcausse/wikibase_rdf_scholarly_articles_example.html.
 I'd be happy to jump in a meeting to better explain this if this helps.

TASK DETAIL
  https://phabricator.wikimedia.org/T342111

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: AndrewTavis_WMDE, dcausse
Cc: Lydia_Pintscher, dcausse, Gehel, dr0ptp4kt, AndrewTavis_WMDE, Aklapper, 
Manuel, Danny_Benjafield_WMDE, Astuthiodit_1, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
_______________________________________________
Wikidata-bugs mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to