https://bz.apache.org/bugzilla/show_bug.cgi?id=60952
Bug ID: 60952
Summary: Figure out how to map font to runs/text pieces in Word
6.0 files
Product: POI
Version: 3.16-dev
Hardware: PC
Status: NEW
Severity: enhancement
Priority: P2
Component: HWPF
Assignee: [email protected]
Reporter: [email protected]
Target Milestone: ---
Created attachment 34897
--> https://bz.apache.org/bugzilla/attachment.cgi?id=34897&action=edit
triggering file
In bug 50955, I found that the hack of using the first non-default/non-symbol
font in the font table in Word 6.0 files worked fairly well. There was one
file out of ~1300 for which this failed.
I'm attaching that file. The issue in this file is that cp1257 comes before
1251 in the font table.
I wasn't able to figure out how to 1) determine that 1251 should be the default
or 2) how to map the font encodings to runs/text pieces.
The test file comes from Common Crawl.
--
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]