I am wondering the best way to debug an error I am getting in Solr. The error is below, but as far as I can tell, pdfbox can not read a font and returns a null pointer which is passed to tika and then to solr. Even though it is only a warning, this appears to terminate the indexing and I get an error that the indexing could not complete.
My question is how do I determine what the name and directory of this file, and is there a way to configure either solr or tika to not terminate the indexing on a null pointer? Or is this a completely different problem? Thanks for any help or advice! 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.lang.NullPointerException 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.lang.NullPointerException 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.io.IOException: Error: Could not find font(COSName{Rx142}) in map={Rx133=org.apache.pdfbox.pdmodel.font.PDTrueTypeFont@f1f3dd, Rx136=org.apache.pdfbox.pdmodel.font.PDTrueTypeFont@c15066, Rx138=org.apache.pdfbox.pdmodel.font.PDTrueTypeFont@1858b31, Rx110=org.apache.pdfbox.pdmodel.font.PDTrueTypeFont@233dfd, Rx02=org.apache.pdfbox.pdmodel.font.PDTrueTypeFont@186de83} 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.lang.NullPointerException 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.lang.NullPointerException 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.lang.NullPointerException 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.lang.NullPointerException 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.lang.NullPointerException 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.io.IOException: Error: Could not find font(COSName{Rx302}) in map={Rx110=org.apache.pdfbox.pdmodel.font.PDTrueTypeFont@233dfd, Rx02=org.apache.pdfbox.pdmodel.font.PDTrueTypeFont@186de83, Rx266=org.apache.pdfbox.pdmodel.font.PDTrueTypeFont@845fc8} 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.lang.NullPointerException 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.lang.NullPointerException 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.io.IOException: Error: Could not find font(COSName{Rx302}) in map={Rx110=org.apache.pdfbox.pdmodel.font.PDTrueTypeFont@233dfd, Rx02=org.apache.pdfbox.pdmodel.font.PDTrueTypeFont@186de83, Rx266=org.apache.pdfbox.pdmodel.font.PDTrueTypeFont@845fc8} 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.lang.NullPointerException 5/23/2014 9:56:09 AM WARN PDFStreamEngine java.lang.NullPointerException 5/23/2014 9:57:23 AM WARN COSDocument Warning: You did not close a PDF Document -- View this message in context: http://lucene.472066.n3.nabble.com/PDFStreamEngine-returning-a-NULL-pointer-error-tp4138722.html Sent from the Solr - User mailing list archive at Nabble.com.