Hi,

Yes, reducing logging is the way. I don't know if there are more.

I'd also be interested in the "unionsq" file, I wonder if this is a false positive. This happens because "uniNNNN" is a valid glyph name. There is unionsqdisplay and unionsqtext too, but not unionsq.

Tilman

On 02.08.2023 11:20, Brangs, Erik wrote:
Hi,

we're using PDFBox 3.0.0-beta1 to extract text from PDFs. This produces lots of 
warnings about missing unicode mappings. Is there a programmatic way to 
suppress those messages or would it be better to configure the logging to do 
that?

If it's better to configure logging, I would try to configure the logging level 
for PDSimpleFont, PDType0Font, PDFont and GlyphList. Are those all relevant 
loggers or are there any more?

For GlyphList, the most common warning is "Not a number in Unicode character name: 
unionsq". I also saw a warning "Not a number in Unicode character name: users" but 
only for one PDF.


Mit freundlichen Grüßen
Erik Brangs
*** Suchen. Finden. Entdecken. Deutsche Nationalbibliothek ***



---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org

Reply via email to