Hi, when doing text extraction, we sometimes get the warning "[CmapSubtable] Format 14 cmap table is not supported and will be ignored". The code in org.apache.fontbox.ttf.CmapSubtable mentions that this format is for "Unicode Variation Sequences". A comment in the source code links a blog entry (which now seems to be available at https://ccjktype.fonts.adobe.com/2013/05/opentype-cmap-table-ramblings.html instead).
I was not able to find anything about format 14 cmap tables in the PDFBox issue tracker. Is support for those kinds of tables planned for future versions of PDFBox? Mit freundlichen Grüßen Erik Brangs *** Suchen. Finden. Entdecken. Deutsche Nationalbibliothek *** -- Erik Brangs Deutsche Nationalbibliothek Informationstechnik Adickesallee 1 60322 Frankfurt am Main Telefon: +49 69 1525-1792 Telefax: +49 69 1525-1799 mailto:e.bra...@dnb.de https://www.dnb.de