Addshore created this task.
Addshore added a subscriber: Addshore.
Addshore added projects: Wikidata, WMDE-Analytics-Engineering.
Herald added subscribers: StudiesWorld, Aklapper.

TASK DESCRIPTION
  Notes from discussion with Lydia!
  
    - **Number of entites by entity type**
      - Can probably be done just counting pages in a namespace
      - Count properties in SPARQL below, (all the rest are items)
      - SELECT (count(?cs) as ?count) WHERE { ?cs a wikibase:Property }  
    - **Number of redirects by entity type**
      - As above redirects are recorded in the page table and we can filter by 
NS
      - look for ?x owl:sameAs ?y in SPARQL for redirects
    - **Number of Item talk pages**
    - **Average blob size of items**
      - Easy db query
    - **Max blob size of items**
      - Easy db query
    - **Number of properties by datatype**
      - Use the wb_property_info table
  
    - **Number of statements by data type**
      - Could probably run a sparql query for each property?
  
    - **Number of ranks by type?**
      - Can be done in SPARQL
      - SELECT (count(distinct(?s)) AS ?scount) WHERE {?s wikibase:rank 
wikibase:PreferredRank}
      - For normal rank we just asume ( statement count - preferd rank - 
deprecated rank = normal rank )
  
    - **Number of uses of Novalue and SomeValue in mainsnaks?**
  
    - **Number of qualifiers**
      - ??????????????????????????????????????????
  
    - **References to Wikipedia**
      - SELECT (count(distinct(?s)) AS ?scount) WHERE {?s prov:wasDerivedFrom 
wdref:004ec6fbee857649acdbdbad4f97b2c8571df97b}
    - **Referenced statements**
      - ??????????????????????????????????????????
  
    - **Number of labels, descriptions & aliases per lang**
      - can be done in the wb_terms table
    - **Number of sitelinks per site**
      - Can be done using wb_items_per_site
  
  **More complex / long? :/**
  
    - **Count grouped by Number of statements per item**
    - **Count grouped by Number of labels, descriptions & aliases per item**
      - Should be possible through the wb_terms table
    - **Count grouped by Number of site links per item**
      - Should be possible through the wb_items_per_site table
    - **Count grouped by Number of qualifiers per statement?**

TASK DETAIL
  https://phabricator.wikimedia.org/T119182

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Addshore
Cc: Aklapper, Addshore, StudiesWorld, Wikidata-bugs, aude, Mbch331



_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to