Re: Errors on master after upgrading to 4.10.3

2016-02-17 Thread Joseph Hagerty
Ahh, makes sense. I did have a feeling I was barking up the wrong tree since it's an Extraction issue, but I thought I'd throw it out there, anyway. Thanks so much for the information! On Wed, Feb 17, 2016 at 4:49 PM, Rachel Lynn Underwood < r.lynn.underw...@gmail.com> wrote: > This is an error

Re: Errors on master after upgrading to 4.10.3

2016-02-17 Thread Rachel Lynn Underwood
This is an error being thrown by Apache PDFBox/Tika. You're seeing it now because Solr 4.x uses a different Tika version than Solr 3.x. It looks like this error is thrown when you parse a PDF with Tika, and a font in that PDF doesn't have a ToUnicode mapping.

Re: Errors on master after upgrading to 4.10.3

2016-02-16 Thread Joseph Hagerty
Does literally nobody else see this error in their logs? I see this error hundreds of times per day, in occasional bursts. Should I file this as a bug? On Mon, Feb 15, 2016 at 4:56 PM, Joseph Hagerty wrote: > After migrating from 3.5 to 4.10.3, I'm seeing the following error

Errors on master after upgrading to 4.10.3

2016-02-15 Thread Joseph Hagerty
After migrating from 3.5 to 4.10.3, I'm seeing the following error with alarming regularity in the master's error log: 2/15/2016, 4:32:22 PM ERROR PDSimpleFont Can't determine the width of the space character using 250 as default I can't seem to glean much information about this one from the web.