Yep, this is a long-standing issue with several sub-parts, see:

https://issues.apache.org/jira/browse/SOLR-3382

Best
Erick

On Sun, Sep 9, 2012 at 5:04 AM, Patrick Oliver Glauner
<[email protected]> wrote:
> Hello
>
> We use Solr 3.1 and Jetty. I enabled logging in Jetty as described here: 
> http://wiki.apache.org/solr/LoggingInDefaultJettySetup
>
> We are indexing millions of documents and for some fulltexts we get 
> exceptions and therefore logs like this one:
>
> <record>
>
>   <date>2012-09-04T15:55:16</date>
>
>   <millis>1346766916578</millis>
>
>   <sequence>0</sequence>
>
>   <logger>org.apache.solr.core.SolrCore</logger>
>
>   <level>SEVERE</level>
>
>   <class>org.apache.solr.common.SolrException</class>
>
>   <method>log</method>
>
>   <thread>10</thread>
>
>   <message>java.lang.RuntimeException: [was class 
> java.io.CharConversionException] Invalid UTF-8 character 0xd835(a surrogate 
> character)  at c
>
> har #1144, byte #127)
>
>         at 
> com.ctc.wstx.util.ExceptionUtil.throwRuntimeException(ExceptionUtil.java:18)
>
>         at 
> com.ctc.wstx.sr.StreamScanner.throwLazyError(StreamScanner.java:731)
>
>         at 
> com.ctc.wstx.sr.BasicStreamReader.safeFinishToken(BasicStreamReader.java:3657)
>
>         at 
> com.ctc.wstx.sr.BasicStreamReader.getText(BasicStreamReader.java:809)
>
>         at org.apache.solr.handler.XMLLoader.readDoc(XMLLoader.java:287)
>
>         at org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:146)
>
>         at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:77)
>
>         at 
> org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:55)
>
>         at 
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:129)
>
>         at org.apache.solr.core.SolrCore.execute(SolrCore.java:1360)
>
>         at 
> org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:356)
>
>         at 
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:252)
>
>         at 
> org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1212)
>
>         at 
> org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:399)
>
>         at 
> org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
>
>         at 
> org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182)
>
>         at 
> org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766)
>
>         at 
> org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:450)
>
>         at 
> org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:230)
>
>         at 
> org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114)
>
>         at 
> org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152)
>
>         at org.mortbay.jetty.Server.handle(Server.java:326)
>
> [...]
>
> </message>
>
> </record>
>
>
> It would be great to also log the sent document id so that we can reindex the 
> respective document.
>
> How can we write the id to the log?
>
>
> Thank you!
>
>
> Best regards
>
> Patrick Glauner
>
> --
> Patrick GLAUNER [[email protected]]
>
> CERN
> Information Technology Department
> CH-1211 Geneva 23

Reply via email to