https://bugs.documentfoundation.org/show_bug.cgi?id=161941

--- Comment #3 from Buovjaga <[email protected]> ---
Created attachment 195300
  --> https://bugs.documentfoundation.org/attachment.cgi?id=195300&action=edit
Font statistics from crashtesting document corpus

Downloaded docx, odg, odp, ods, odt, pptx and xlsx files from bugtrackers and
forums directories of the crashtesting corpus.

Total number of documents I investigated:
$ find . -type f \( -name '*.docx' -o -name '*.odg' -o -name '*.odp' -o -name
'*.ods' -o -name '*.odt' -o -name '*.pptx' -o -name '*.xlsx' \) | wc -l
607790

Xlsx files dominate, their share being 500633.

I unzipped all of them in this style:

find . -name '*.xlsx' | while read file; do
    unzip -o "$file" -d "${file%.*}"
done

Some of the xml files had permission issues, so I had to fix them with

sudo find . -name '*.xml' -exec chmod 644 {} \;

I got the usage stats by using ripgrep and filtering out duplicates with sort
and uniq:

rg --files-with-matches 'Agency FB' . -g '*.xml' | cut --delimiter "/"
--fields=2,3,4 | sort | uniq -u > agency_fb.txt

Agency FB, 246 files
Baskerville Old Face, 303 files
Berlin Sans FB, 319 files
Cooper Black, 105 files
Lucida Calligraphy, 88 files
Lucida Grande, 201 files
Lucida Handwriting, 103 files

I guess that's not nothing, but seems to indicate that the chances of running
into these fonts in the wild is pretty low.

It would be interesting to hear, if the clients of companies have signalled a
need for replacement fonts or what other motivators there could be for shipping
them.

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to