[ 
https://issues.apache.org/jira/browse/SOLR-2381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008428#comment-13008428
 ] 

uygar bayar edited comment on SOLR-2381 at 3/18/11 1:30 PM:
------------------------------------------------------------

hi i use 3.x trunk. I insert documents with pecl php. 

-rw-r--r--  1 nutch nutch 540234 Mar 17 12:37 
jetty-6.1.26-patched-JETTY-1340.jar
-rw-r--r--  1 nutch nutch  11358 Mar 17 12:37 jetty-LICENSE.txt
-rw-r--r--  1 nutch nutch   1621 Mar 17 12:37 jetty-NOTICE.txt
-rw-r--r--  1 nutch nutch 177393 Mar 17 12:37 
jetty-util-6.1.26-patched-JETTY-1340.jar

SEVERE: org.apache.solr.common.SolrException: Invalid UTF-8 middle byte 0x63 
(at char #334, byte #127)
        at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:79)
        at 
org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:55)
        at 
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:129)
        at org.apache.solr.core.SolrCore.execute(SolrCore.java:1360)
        at 
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:356)
        at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:252)
        at 
org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1212)
        at 
org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:399)
        at 
org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
        at 
org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182)
        at 
org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766)
        at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:450)
        at 
org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:230)
        at 
org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114)
        at 
org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152)
        at org.mortbay.jetty.Server.handle(Server.java:326)
        at 
org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:542)
        at 
org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:945)
        at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:756)
        at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:218)
        at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:404)
        at 
org.mortbay.jetty.bio.SocketConnector$Connection.run(SocketConnector.java:228)
        at 
org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582)
Caused by: com.ctc.wstx.exc.WstxIOException: Invalid UTF-8 middle byte 0x63 (at 
char #334, byte #127)
        at com.ctc.wstx.sr.StreamScanner.throwFromIOE(StreamScanner.java:708)
        at com.ctc.wstx.sr.BasicStreamReader.next(BasicStreamReader.java:1086)
        at org.apache.solr.handler.XMLLoader.readDoc(XMLLoader.java:281)
        at org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:146)
        at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:77)
        ... 22 more
Caused by: java.io.CharConversionException: Invalid UTF-8 middle byte 0x63 (at 
char #334, byte #127)
        at com.ctc.wstx.io.UTF8Reader.reportInvalidOther(UTF8Reader.java:313)
        at com.ctc.wstx.io.UTF8Reader.read(UTF8Reader.java:204)
        at com.ctc.wstx.io.MergedReader.read(MergedReader.java:101)
        at com.ctc.wstx.io.ReaderSource.readInto(ReaderSource.java:84)
        at 
com.ctc.wstx.io.BranchingReaderSource.readInto(BranchingReaderSource.java:57)
        at 
com.ctc.wstx.sr.StreamScanner.loadMoreFromCurrent(StreamScanner.java:1046)
        at 
com.ctc.wstx.sr.StreamScanner.parseLocalName2(StreamScanner.java:1796)
        at com.ctc.wstx.sr.StreamScanner.parseLocalName(StreamScanner.java:1756)
        at 
com.ctc.wstx.sr.BasicStreamReader.handleNsAttrs(BasicStreamReader.java:2981)
        at 
com.ctc.wstx.sr.BasicStreamReader.handleStartElem(BasicStreamReader.java:2936)
        at 
com.ctc.wstx.sr.BasicStreamReader.nextFromTree(BasicStreamReader.java:2848)
        at com.ctc.wstx.sr.BasicStreamReader.next(BasicStreamReader.java:1019)
        ... 25 more

Mar 18, 2011 3:13:42 PM org.apache.solr.core.SolrCore execute
INFO: [] webapp=/solr path=/update/ params={indent=on&wt=xml&version=2.2} 
status=400 QTime=0 


      was (Author: ramires):
    hi i use 3.x trunk.

-rw-r--r--  1 nutch nutch 540234 Mar 17 12:37 
jetty-6.1.26-patched-JETTY-1340.jar
-rw-r--r--  1 nutch nutch  11358 Mar 17 12:37 jetty-LICENSE.txt
-rw-r--r--  1 nutch nutch   1621 Mar 17 12:37 jetty-NOTICE.txt
-rw-r--r--  1 nutch nutch 177393 Mar 17 12:37 
jetty-util-6.1.26-patched-JETTY-1340.jar

SEVERE: org.apache.solr.common.SolrException: Invalid UTF-8 middle byte 0x63 
(at char #334, byte #127)
        at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:79)
        at 
org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:55)
        at 
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:129)
        at org.apache.solr.core.SolrCore.execute(SolrCore.java:1360)
        at 
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:356)
        at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:252)
        at 
org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1212)
        at 
org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:399)
        at 
org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
        at 
org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182)
        at 
org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766)
        at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:450)
        at 
org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:230)
        at 
org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114)
        at 
org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152)
        at org.mortbay.jetty.Server.handle(Server.java:326)
        at 
org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:542)
        at 
org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:945)
        at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:756)
        at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:218)
        at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:404)
        at 
org.mortbay.jetty.bio.SocketConnector$Connection.run(SocketConnector.java:228)
        at 
org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582)
Caused by: com.ctc.wstx.exc.WstxIOException: Invalid UTF-8 middle byte 0x63 (at 
char #334, byte #127)
        at com.ctc.wstx.sr.StreamScanner.throwFromIOE(StreamScanner.java:708)
        at com.ctc.wstx.sr.BasicStreamReader.next(BasicStreamReader.java:1086)
        at org.apache.solr.handler.XMLLoader.readDoc(XMLLoader.java:281)
        at org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:146)
        at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:77)
        ... 22 more
Caused by: java.io.CharConversionException: Invalid UTF-8 middle byte 0x63 (at 
char #334, byte #127)
        at com.ctc.wstx.io.UTF8Reader.reportInvalidOther(UTF8Reader.java:313)
        at com.ctc.wstx.io.UTF8Reader.read(UTF8Reader.java:204)
        at com.ctc.wstx.io.MergedReader.read(MergedReader.java:101)
        at com.ctc.wstx.io.ReaderSource.readInto(ReaderSource.java:84)
        at 
com.ctc.wstx.io.BranchingReaderSource.readInto(BranchingReaderSource.java:57)
        at 
com.ctc.wstx.sr.StreamScanner.loadMoreFromCurrent(StreamScanner.java:1046)
        at 
com.ctc.wstx.sr.StreamScanner.parseLocalName2(StreamScanner.java:1796)
        at com.ctc.wstx.sr.StreamScanner.parseLocalName(StreamScanner.java:1756)
        at 
com.ctc.wstx.sr.BasicStreamReader.handleNsAttrs(BasicStreamReader.java:2981)
        at 
com.ctc.wstx.sr.BasicStreamReader.handleStartElem(BasicStreamReader.java:2936)
        at 
com.ctc.wstx.sr.BasicStreamReader.nextFromTree(BasicStreamReader.java:2848)
        at com.ctc.wstx.sr.BasicStreamReader.next(BasicStreamReader.java:1019)
        ... 25 more

Mar 18, 2011 3:13:42 PM org.apache.solr.core.SolrCore execute
INFO: [] webapp=/solr path=/update/ params={indent=on&wt=xml&version=2.2} 
status=400 QTime=0 

  
> The included jetty server does not support UTF-8
> ------------------------------------------------
>
>                 Key: SOLR-2381
>                 URL: https://issues.apache.org/jira/browse/SOLR-2381
>             Project: Solr
>          Issue Type: Bug
>            Reporter: Robert Muir
>            Assignee: Robert Muir
>            Priority: Blocker
>             Fix For: 3.1, 3.2, 4.0
>
>         Attachments: SOLR-2381-3.x+3.1.patch, SOLR-2381.patch, 
> SOLR-2381_take2.patch, SOLR-2381_xmltest.patch, 
> SOLR-ServletOutputWriter.patch, SOLR-ServletOutputWriter.patch, 
> jetty-6.1.26-patched-JETTY-1340.jar, jetty-6.1.26-patched-JETTY-1340.jar, 
> jetty-6.1.26-patched-SOLR-2381.jar, jetty-util-6.1.26-patched-JETTY-1340.jar, 
> jetty-util-6.1.26-patched-JETTY-1340.jar, 
> jetty-util-6.1.26-patched-SOLR-2381.jar, post_utf8enhanced.sh, 
> utf8enhanced.xml
>
>
> Some background here: 
> http://www.lucidimagination.com/search/document/6babe83bd4a98b64/which_unicode_version_is_supported_with_lucene
> Some possible solutions:
> * wait and see if we get resolution on 
> http://jira.codehaus.org/browse/JETTY-1340. To be honest, I am not even sure 
> where jetty is being maintained (there is a separate jetty project at 
> eclipse.org with another bugtracker, but the older releases are at codehaus).
> * include a patched version of jetty with correct utf-8, using that patch.
> * remove jetty and include a different container instead.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to