Re: Payload and exact search - 2
Ok, it works!! Thanks you very much. Leonardo -- View this message in context: http://lucene.472066.n3.nabble.com/Payload-and-exact-search-2-tp3750355p3760477.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Payload and exact search - 2
Thank's for your reply, so, if I apply the filter class=solr.DelimitedPayloadTokenFilterFactory encoder=identity/ as first filter in the chain, it shoud works In this new configuration, the first filter in the chain intercept the payload. It manages and removes the payload info and then the subsequent filters are applied to the clear text: is it right? Leonardo -- View this message in context: http://lucene.472066.n3.nabble.com/Payload-and-exact-search-2-tp3750355p3758152.html Sent from the Solr - User mailing list archive at Nabble.com.
Payload and exact search - 2
Hello, I already posted this question but for some reason it was attached to a thread with different topic. Is there the possibility of perform 'exact search' in a payload field? I'have to index text with auxiliary info for each word. In particular at each word is associated the bounding box containing it in the original pdf page (it is used for highligthing the search terms in the pdf). I used the payload to store that information. In the schema.xml, the fieldType definition is: --- fieldtype name=wppayloads stored=false indexed=true class=solr.TextField analyzer tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.WordDelimiterFilterFactory generateWordParts=1 generateNumberParts=1 catenateWords=1 catenateNumbers=1 catenateAll=0 splitOnCaseChange=1/ filter class=solr.LowerCaseFilterFactory/ filter class=solr.DelimitedPayloadTokenFilterFactory encoder=identity/ /analyzer /fieldtype --- while the field definition is: --- field name=words type=wppayloads indexed=true stored=true required=true multiValued=true/ --- When indexing, the field 'words' contains a list of word|box as in the following example: --- doc_id=example words={Fonte:|307.62,948.16,324.62,954.25 Comune|326.29,948.16,349.07,954.25 di|350.74,948.16,355.62,954.25 Bologna|358.95,948.16,381.28,954.25} --- Such solution works well except in the case of an exact search. For example, assuming the only indexed doc is the 'example' doc (before shown), the query words:Comune di Bologna returns no results. Someone know if there is the possibility of perform 'exact search' in a payload field? Thanks in advance, Leonardo -- View this message in context: http://lucene.472066.n3.nabble.com/Payload-and-exact-search-2-tp3750355p3750355.html Sent from the Solr - User mailing list archive at Nabble.com.
payload and exact match
Is there the possibility of perform 'exact search' in a payload field? I'have to index text with auxiliary info for each word. In particular at each word is associated the bounding box containing it in the original pdf page (it is used for highligthing the search terms in the pdf). I used the payload to store that information. In the schema.xml, the fieldType definition is: --- fieldtype name=wppayloads stored=false indexed=true class=solr.TextField analyzer tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.WordDelimiterFilterFactory generateWordParts=1 generateNumberParts=1 catenateWords=1 catenateNumbers=1 catenateAll=0 splitOnCaseChange=1/ filter class=solr.LowerCaseFilterFactory/ filter class=solr.DelimitedPayloadTokenFilterFactory encoder=identity/ /analyzer /fieldtype --- while the field definition is: --- field name=words type=wppayloads indexed=true stored=true required=true multiValued=true/ --- When indexing, the field 'words' contains a list of word|box as in the following example: --- doc_id=example words={Fonte:|307.62,948.16,324.62,954.25 Comune|326.29,948.16,349.07,954.25 di|350.74,948.16,355.62,954.25 Bologna|358.95,948.16,381.28,954.25} --- Such solution works well except in the case of an exact search. For example, assuming the only indexed doc is the 'example' doc (before shown), the query words:Comune di Bologna returns no results. Someone know if there is the possibility of perform 'exact search' in a payload field? Thanks in advance, Leonardo -- View this message in context: http://lucene.472066.n3.nabble.com/payload-and-exact-match-tp3745369p3745369.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: error in indexing
Someone can help me? Leonardo -- View this message in context: http://lucene.472066.n3.nabble.com/error-in-indexing-tp3709686p3712495.html Sent from the Solr - User mailing list archive at Nabble.com.
error in indexing
Hi all I have a NullPointerException in SolrJetty when using CommonsHttpSolrServer.add(SolrInputDocument) that I can not solveplease help me. From the core stats, the document does not result to be indexed. The stackTrace is: 2-feb-2012 13.05.04 org.apache.solr.common.SolrException log GRAVE: java.lang.NullPointerException at org.apache.lucene.document.Document.getFieldables(Document.java:240) at org.apache.solr.update.UpdateHandler.getIndexedId(UpdateHandler.java:113) at org.apache.solr.update.DirectUpdateHandler2.addDoc(DirectUpdateHandler2.java:230) at org.apache.solr.update.processor.RunUpdateProcessor.processAdd(RunUpdateProcessorFactory.java:61) at org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:139) at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:69) at org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:54) at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131) at org.apache.solr.core.SolrCore.execute(SolrCore.java:1316) at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:338) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:241) at org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1089) at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:365) at org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216) at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:181) at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:712) at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:405) at org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:211) at org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114) at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:139) at org.mortbay.jetty.Server.handle(Server.java:285) at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:502) at org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:835) at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:723) at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:202) at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:378) at org.mortbay.jetty.bio.SocketConnector$Connection.run(SocketConnector.java:226) at org.mortbay.thread.BoundedThreadPool$PoolThread.run(BoundedThreadPool.java:442) Leonardo -- View this message in context: http://lucene.472066.n3.nabble.com/error-in-indexing-tp3709686p3709686.html Sent from the Solr - User mailing list archive at Nabble.com.