Addshore closed this task as "Resolved".
Addshore claimed this task.
Addshore added a comment.
Restricted Application added a project: User-Addshore.


  Running from /user/joal/wmf/data/wmf/mediawiki/wikidata_parquet/20190204
  
  |                                | Strings       | Unique Strings | One 
occurrence strings |
  | ------------------------------ | ------------- | -------------- | 
---------------------- |
  | Labels, Descriptions & Aliases | 1,996,735,054 | 106,498,268    | 
71,574,791             |
  | Labels & Aliases               | 342,328,983   | 89,450,632     | 
57,859,429             |
  | Labels                         | 279,406,857   | 77,753,053     | 
48,276,975             |
  | Descriptions                   | 1,654,406,071 | 17,180,878     | 
13,867,657             |
  | Aliases                        | 62,922,126    | 13,765,857     | 
11,584,411             |
  |
  
  Raw results from a notebook with P8168 
<https://phabricator.wikimedia.org/P8168>:
  
    -------------------LAD----------------------------------
    Total bytes for strings:     55403285421
    Total duplicate bytes for strings:     50665680015
    Useful bytes for strings:      4737605406
    Total strings:      1996735054
    Total unique strings:       106498268
    Total one occurrence strings:        71574791
    -----------------------------------------------------
    ----------------------LA-------------------------------
    Total bytes for strings:      8602346166
    Total duplicate bytes for strings:      4564380973
    Useful bytes for strings:      4037965193
    Total strings:       342328983
    Total unique strings:        89450632
    Total one occurrence strings:        57859429
    -----------------------------------------------------
    ------------------------L-----------------------------
    Total bytes for strings:      7764983650
    Total duplicate bytes for strings:      4023237872
    Useful bytes for strings:      3741745778
    Total strings:       279406857
    Total unique strings:        77753053
    Total one occurrence strings:        48276975
    -----------------------------------------------------
    -------------------------D----------------------------
    Total bytes for strings:     46800939255
    Total duplicate bytes for strings:     46098575317
    Useful bytes for strings:       702363938
    Total strings:      1654406071
    Total unique strings:        17180878
    Total one occurrence strings:        13867657
    -----------------------------------------------------
    ------------------------A-----------------------------
    Total bytes for strings:       837362516
    Total duplicate bytes for strings:       506426114
    Useful bytes for strings:       330936402
    Total strings:        62922126
    Total unique strings:        13765857
    Total one occurrence strings:        11584411
    -----------------------------------------------------

TASK DETAIL
  https://phabricator.wikimedia.org/T217821

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Addshore
Cc: JAllemandou, Aklapper, Addshore, alaa_wmde, Nandana, Lahi, Gq86, 
GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Wikidata-bugs, 
aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to