Hi, Yes, reducing logging is the way. I don't know if there are more.
I'd also be interested in the "unionsq" file, I wonder if this is a false positive. This happens because "uniNNNN" is a valid glyph name. There is unionsqdisplay and unionsqtext too, but not unionsq.
Tilman On 02.08.2023 11:20, Brangs, Erik wrote:
Hi, we're using PDFBox 3.0.0-beta1 to extract text from PDFs. This produces lots of warnings about missing unicode mappings. Is there a programmatic way to suppress those messages or would it be better to configure the logging to do that? If it's better to configure logging, I would try to configure the logging level for PDSimpleFont, PDType0Font, PDFont and GlyphList. Are those all relevant loggers or are there any more? For GlyphList, the most common warning is "Not a number in Unicode character name: unionsq". I also saw a warning "Not a number in Unicode character name: users" but only for one PDF. Mit freundlichen Grüßen Erik Brangs *** Suchen. Finden. Entdecken. Deutsche Nationalbibliothek ***
--------------------------------------------------------------------- To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org For additional commands, e-mail: users-h...@pdfbox.apache.org