Re: SOLR SpeelChecker and german Umlauts
Michael Ludwig schrieb: Kraus, Ralf | pixelhouse GmbH schrieb: When I am searching for ONE word with an german umlaut like kräuterkeckse (the right word is kräuterkekse) the spellchecker gives me two corrections : Spellcheck for kr = kren Spellcheck for uterkeksse = butterkekse WHY is SOLR break this ONE word apart ? Moin Ralf, please read the following threads to understand the issue. In short, you need to specify your query in spellcheck.q as well. DANKE ! :-) Greets -Ralf-
SOLR SpeelChecker and german Umlauts
Hello, I really need some help with the SOLR SpeelChecker and german Umlauts. So far I am really satisfied with the JAROWINKLER algorithm. Now my problem :-) When I am searching for ONE word with an german umlaut like kräuterkeckse (the right word is kräuterkekse) the spellchecker gives me two corrections : Spellcheck for kr = kren Spellcheck for uterkeksse = butterkekse WHY is SOLR break this ONE word apart ? My Config is : spellcheck = 'true'; spellcheck.dictionary = 'jarowinkler' spellcheck.onlyMorePopular = 'true' spellcheck.build = 'false' spellcheck.count = 1 Greets -Ralf-
Re: Big Problem with special characters
Otis Gospodnetic schrieb: Try debugQuery=true and see if the resulting query string makes sense. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch thx for the hint... My problem was the WhitespaceTokenizer :-( After I change back to StandardTokenizer everythign was fine ! Greets -Ralf-
Big Problem with special characters
Hello, first some details about my SOLR installation: schema.xml fieldType name=text_test class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt/ filter class=solr.LengthFilterFactory min=2 max=50 / filter class=solr.LowerCaseFilterFactory/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ filter class=solr.SnowballPorterFilterFactory language=german / /analyzer analyzer type=query tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt/ filter class=solr.LowerCaseFilterFactory/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ filter class=solr.SnowballPorterFilterFactory language=german / /analyzer /fieldType Search: qf=name^2.0+name2^1.5+name3 wt=phps rows=30 start=0 sort=score+desc fl=*,score q=speed qt=dismax When I have a string like (speed) in name3 or name2 SOLR dont find it at all :-( If I search for (speed) erverything is fine ! Greets -Ralf-
Using Lucene MultiFieldQueryParser with SOLR
Hello, I am searching for a way to use the Lucene MultiFieldQueryParser in my SOLR Installation. Is there a chance to change the solrQueryParser ? In my old Lucene Setting I used to combine many different types of QueryParser in my Querry... Or is there a chance to get MultiFieldQueryParser functions in SOLR ? Greets -Ralf-
Re: Using Lucene MultiFieldQueryParser with SOLR
Marc Sturlese schrieb: Think there's no search handler that uses MultiFieldQueryParser in Solr. But check DismaxRequestHandler, probably will do the job. Yo can specify all the fields where you want to search in and it will build the query using boolean queries. It includes also many more features: http://wiki.apache.org/solr/DisMaxRequestHandler Is there a chance to combine RequestHandler ? I need to use some additional normal boolean and integer querries ! Greets -Ralf-
Re: Using Lucene MultiFieldQueryParser with SOLR
Marc Sturlese schrieb: Think there's no search handler that uses MultiFieldQueryParser in Solr. But check DismaxRequestHandler, probably will do the job. Yo can specify all the fields where you want to search in and it will build the query using boolean queries. It includes also many more features: http://wiki.apache.org/solr/DisMaxRequestHandler THX A LOT ! You really made my day ! Greets -Ralf-
Re: Using Lucene MultiFieldQueryParser with SOLR
Marc Sturlese schrieb: Well dismax has a q.alt parameter where you can specify a query in lucene sintax. The query must be empty to use q.alt: http://.../select?q=q.alt=phone_number:1234567 This would search in the field phone_number independly of what fields you have configured in teh dismax. Now I use the fq parameter in combination with q.alt ... Runs fine yet :-) The fq parameter sets my additional query parameter :-) Greets -Ralf-
Re: Using Lucene MultiFieldQueryParser with SOLR
Marc Sturlese schrieb: The only problem I found with q.alt is that it doesn't allow highlighting (or at least it doesn't showed it for me). If you find out how to do it let me know. I use highlighting only with the normal querry ! My q.alt is *.* But its really sad that the dismax dont support wildcarts :-( Greets -Ralf-
Re: Disable logging in SOLR
Bill Au schrieb: Have you tried setting logging level to OFF from Solr's admin GUI: http://wiki.apache.org/solr/SolrAdminGUI thx 4 the hint ! But after I restart my tomcat its all reseted to default ? :-( Greets -Ralf-
Re: Disable logging in SOLR
Mark Miller schrieb: Kraus, Ralf | pixelhouse GmbH wrote: Hi, is there a way to disable all logging output in SOLR ? I mean the output text like : INFO: [core_de] webapp=/solr path=/update params={wt=json} status=0 QTime=3736 greets -Ralf- You probably do not want to totally disable logging in Solr. More likely, your looking to make Solr less chatty by not logging the INFO level. Solr is a bit chatty by default, mostly I think, because that can be very useful and is often worth the likely very small performance hit of all the extra logging. At the least though, I think you want to leave Severe/Error logging on in most cases, and possibly WARN. Its easy enough to change the logging levels though. Solr 1.3 uses java.util.logging and Solr 1.4 uses SLF4J defaulting to java.util.logging. So you can either change the system level properties file in your JDK folder, or you can use a param at startup: |-Djava.util.logging.config.file=/path/to/my/logging.properties Thats exactly the way I choose yesterday ;-) Thx Greets -Ralf-
Disable logging in SOLR
Hi, is there a way to disable all logging output in SOLR ? I mean the output text like : INFO: [core_de] webapp=/solr path=/update params={wt=json} status=0 QTime=3736 greets -Ralf-
Re: Combination of solr.xml and solrconfig.xml
Hi, question ;-) !DOCTYPE config SYSTEM http://java.sun.com/dtd/web-app_2_3.dtd; [ !ENTITY default_solrconfig SYSTEM /var/lib/tomcat5.5/webapps/solr/default_solrconfig.xml ] Is there a chance to set the home directory using a variable ? For example an unix enviroment variable ? Greets -Ralf- No chance ? Greets -Ralf-
BOSTing FIELDS in SCHEMA.XML
Hi, is there a chance to set boosting directly in my schema.xml ? field name=Name1 type=text indexed=true stored=true multivalued=true omitNorms=false boost=2 / field name=Name2 type=text indexed=true stored=true multivalued=true omitNorms=false boost=1.5f / field name=Name3 type=text indexed=true stored=true multivalued=true omitNorms=false boost=0.5f / field name=Name4 type=text indexed=true stored=true / I want do it directly in the index and not in my query ! Greets -Ralf-
Combination of solr.xml and solrconfig.xml
Hi, I have a little question concerning the combination of solr.xml and solrconfig.xml ! I am running a multicore configuration for doing multiple language index files ... my structure is like \solr (solr.xml) \solr\core_de (solrconfig.xml for german profile) \solr\core_uk (solrconfig.xml for english profile) My problem is that there is many many redundant code in the two solrconfig.xml file and I wonder if I can move this code to solr.xml and only leave the core specific lines in the solrconfig.xml ? Is there any chance to combine solrconfig.xml files ? Or any other suggestions ? Greets, Ralf
Re: Combination of solr.xml and solrconfig.xml
Giovanni De Stefano schrieb: Guten Tag Ralf, try to look here: http://wiki.apache.org/solr/CoreAdmin It says: You can also specify properties in solr.xml which can be used in the solrconfig.xml and schema.xml files. I believe this might help. thx !! I tried this : (My solr.xml :) solr persistent=true sharedLib=lib config updateHandler class=solr.DirectUpdateHandler2 / etc. . . . . /config cores adminPath=/admin/cores core name=core_uk instanceDir=core_uk config dataDir//x/x/dataDir /config /core /cores /solr Doesnt seem to work :-( Greets -Ralf-
Re: Combination of solr.xml and solrconfig.xml
Giovanni De Stefano schrieb: Hello Ralf, I believe you want to do something like: solr persistent=true sharedLib=lib cores adminPath=/admin/cores core name=core0 instanceDir=core0 property name=dataDir value=/data/core0 / /core core name=core1 instanceDir=core1 / /cores /solr Okay I will try to explain it better :-) - my two solrconfig.xml files are around 200 lines long - ONLY one line is the difference between this files : dataDir/www/solr//x/dataDir - I want to transfer this 199 other lines to my solr.xml so I don´t have this lines redundant. Greets -Ralf-
Re: Combination of solr.xml and solrconfig.xml
Giovanni De Stefano schrieb: On the other hand, if your files have 199 lines in *common*, then yes, you could/should move those 199 common lines in solr.xml. YES ! Thats my case But how could I transfer this lines (solrconfig.xml) config dataDir/www/solr//xx/dataDir updateHandler class=solr.DirectUpdateHandler2 / abortOnConfigurationError${solr.abortOnConfigurationError:true}/abortOnConfigurationError indexDefaults useCompoundFilefalse/useCompoundFile mergeFactor2/mergeFactor ramBufferSizeMB32/ramBufferSizeMB maxMergeDocs2147483647/maxMergeDocs maxFieldLength1/maxFieldLength writeLockTimeout1000/writeLockTimeout commitLockTimeout1/commitLockTimeout lockTypesingle/lockType /indexDefaults mainIndex useCompoundFilefalse/useCompoundFile mergeFactor2/mergeFactor ramBufferSizeMB32/ramBufferSizeMB maxMergeDocs2147483647/maxMergeDocs maxFieldLength1/maxFieldLength unlockOnStartupfalse/unlockOnStartup /mainIndex etc. to my solr.xml ??? Greets -Ralf-
Re: Combination of solr.xml and solrconfig.xml
Erik Hatcher schrieb: Unfortunately Solr doesn't offer that kind of reusability (yet). But you could just go old school and use XML entity includes to pull in a fragment of a file, and externalize the common pieces that way. Not as pretty, but oughta work. thx a lot ! This sounds perfect... (yet) Greets -Ralf-
SPELLCHECK Problems
Hi, My SOLRCONFIG.XML requestHandler name=/spellCheckCK class=solr.SearchHandler lst name=defaults /lst arr name=last-components strspellcheck/str /arr /requestHandler searchComponent name=spellcheck class=solr.SpellCheckComponent str name=queryAnalyzerFieldTypetextSpell/str lst name=spellchecker str name=namedefault/str str name=fieldspell/str str name=spellcheckIndexDir./spellchecker1/str str name=buildOnOptimizetrue/str /lst lst name=spellchecker str name=namejarowinkler/str str name=fieldspell/str !-- Use a different Distance Measure -- str name=distanceMeasureorg.apache.lucene.search.spell.JaroWinklerDistance/str str name=spellcheckIndexDir./spellchecker2/str str name=buildOnOptimizetrue/str /lst lst name=spellchecker str name=classnamesolr.FileBasedSpellChecker/str str name=namefile/str str name=sourceLocationdictionary.txt/str str name=characterEncodingUTF-8/str str name=spellcheckIndexDir./spellcheckerFile/str str name=buildOnOptimizetrue/str /lst /searchComponent My Schema.xml field name=RezeptNameSpellCheck type=textSpell indexed=true stored=true multiValued=true/ copyField source=RezeptName dest=RezeptNameSpellCheck / Search: spellcheck=true wt=phps rows=30 start=0 sort=score+desc spellcheck.build=true spellcheck.extendedResults=false spellcheck.count=1 q=sudeln spellcheck.onlyMorePopular=true spellcheck.dictionary=file Now my Problems :-) If I use the file choice with spellcheck.dictionary=file I got very bad suggestions :-( If I use default or Jarowinkler I dont get any suggestions at all :-( Whats the problem ? Greets, Ralf
Re: SPELLCHECK Problems
Grant Ingersoll schrieb: What's your textSpell FieldType look like? Spelling is definitely something that needs tuning, so you might have to play with some of the knobs like accuracy, etc. As for JaroWinkler, and I suppose the default, your field is spell, but based on your configuration, I gather you really want it to be RezeptNameSpellCheck. I am guessing that if you point Luke at your those two spell checking indexes, you're going to find that they are empty. Hey ! Thx a lot ... Thats indeed was my problem :-) Greets, Ralf
Need help with DictionaryCompoundWordTokenFilterFactory
Hi, Now I ran into another problem by using the solr.DictionaryCompoundWordTokenFilterFactory :-( If I search for the german word Spargelcremesuppe which contains Spargel, Creme and Suppe SOLR will find way to many result. Its because SOLR finds EVERY entry with either one of the three words in it :-( Here is my schema.xml fieldType name=text_text class=solr.TextField positionIncrementGap=100 analyzer tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.DictionaryCompoundWordTokenFilterFactory dictionary=dictionary.txt minWordSize=5 minSubwordSize=2 maxSubwordSize=15 onlyLongestMatch=true / filter class=solr.SynonymFilterFactory synonyms=synonyms.txt ignoreCase=true expand=true/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt/ filter class=solr.LowerCaseFilterFactory/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ filter class=solr.SnowballPorterFilterFactory language=German / /analyzer /fieldType Any help ? Greets, Ralf Kraus
Re: Problem with setting solr.solr.home property
Manupriya schrieb: Hi, Till now I was working with the jetty server bundled with the SOLR distribution. But I want to deploy solr.war to another jetty server. Here I am facing some problem with solr/home. Whenever I start the jetty server, I try to extract the solr.war and edit the web.xml ! Greets -Ralf-
Re: SOLR Problem with special chars
Otis Gospodnetic schrieb: Ralf, Can you paste the part of your schema.xml where you defined the relevant field? Otis Sure ! fieldType name=text_normal class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.LowerCaseFilterFactory/ filter class=solr.SnowballPorterFilterFactory language=German / /analyzer analyzer type=query tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.LowerCaseFilterFactory/ filter class=solr.SnowballPorterFilterFactory language=German / /analyzer /fieldType Greets
Re: SOLR Problem with special chars
Otis Gospodnetic schrieb: now it works : fieldType name=text_new class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt/ filter class=solr.WordDelimiterFilterFactory generateWordParts=1 generateNumberParts=1 catenateWords=1 catenateNumbers=1 catenateAll=0 splitOnCaseChange=1/ filter class=solr.LengthFilterFactory min=3 max=50 / filter class=solr.LowerCaseFilterFactory/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ filter class=solr.SnowballPorterFilterFactory language=German / filter class=solr.EnglishPorterFilterFactory protected=protwords.txt / /analyzer analyzer type=query tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.SynonymFilterFactory synonyms=synonyms.txt ignoreCase=true expand=true/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt/ filter class=solr.WordDelimiterFilterFactory generateWordParts=1 generateNumberParts=1 catenateWords=0 catenateNumbers=0 catenateAll=0 splitOnCaseChange=1/ filter class=solr.LowerCaseFilterFactory/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ filter class=solr.SnowballPorterFilterFactory language=German / filter class=solr.EnglishPorterFilterFactory protected=protwords.txt / /analyzer /fieldType Greets, Ralf
SOLR Problem with special chars
Hello, My string in my DB is like Kellogs, Corn- (Flakes) When I search with Kellogs or Corn or Flakes I cant find the entry in my index :-( Is there something I missing ? Greets, -- Ralf Kraus
Re: Need help with SolrIndexSearcher CoreContainer
Hi, After 5-6 searches I run out of memory :-( Examples: String homeDir = /var/lib/tomcat5.5/webapps/solr; File configFile = new File( homeDir, solr.xml ); CoreContainer myCoreContainer = new CoreContainer( homeDir, configFile ); mySolrCore = myCoreContainer.getCore(core_de); RefCountedSolrIndexSearcher temp_search = mySolrCore.getSearcher(); SolrIndexSearcher searcher = temp_search.get(); No one ever worked directly with CoreContainer and SolrIndexSearcher ? Greets -Ralf-
Need help with SolrIndexSearcher CoreContainer
Hi, I want to use a SolrIndexSearcher for some special searches in my app... I startup my Solr with two cores in it (core_de core_uk). But when I try this then my Solr Server generates a complete new cory instead of using the existing one... After 5-6 searches I run out of memory :-( Examples: String homeDir = /var/lib/tomcat5.5/webapps/solr; File configFile = new File( homeDir, solr.xml ); CoreContainer myCoreContainer = new CoreContainer( homeDir, configFile ); mySolrCore = myCoreContainer.getCore(core_de); RefCountedSolrIndexSearcher temp_search = mySolrCore.getSearcher(); SolrIndexSearcher searcher = temp_search.get(); Greets -Ralf-
Re: Problem with SolrJ and SolrIndexSearcher
Otis Gospodnetic schrieb: Hi Ralf, Your subject mentions SolrJ, but I don't see any SolrJ code in your email. It looks as if you are messing with Solr's own code. thx for you help... Greets
Problem with SolrJ and SolrIndexSearcher
Hi, I want to use a SolrIndexSearcher for some special searches in my app... I startup my Solr with two cores in it (core_de core_uk). But when I try this then my Solr Server generates a complete new cory instead of using the existing one... After 5-6 searches I run out of memory :-( Examples: String homeDir = /var/lib/tomcat5.5/webapps/solr; File configFile = new File( homeDir, solr.xml ); CoreContainer myCoreContainer = new CoreContainer( homeDir, configFile ); mySolrCore = myCoreContainer.getCore(core_de); RefCountedSolrIndexSearcher temp_search = mySolrCore.getSearcher(); SolrIndexSearcher searcher = temp_search.get(); Greets -Ralf-
Re: Performanec Lucene / Solr
Hey, I think it will have the disadvantage of being a lot slower though... How were you handling things with Lucene? You must have used Java then? If you even want to get close to that performance I think you need to use non http embedded solr. I am using this : - I wrote a JAVA JSP file to get an EmbeddedSolrServer - Now I call this JSP file from my PHP script and the JSP makes my search request to SOLR - after that I generate a CSV file out of the JSP and read it from PHP It´s the same way I did it with the prior LUCENE engine I used. But now the peformence is 10% from the prior LUCENE speed :-( Greets -Ralf-
Re: Performanec Lucene / Solr
Hi, Thx a lot for the tip ! But when I try it I got HTTP/1.1 500 null java.lang.NullPointerException at org.apache.solr.common.util.StrUtils.splitSmart(StrUtils.java:37) My Request is : INFO: [core_de] webapp=/solr path=/select/ params={wt=phpsquery=Toolsrecords=30start_record=0} status=500 QTime=1 Exception in SOLR: SCHWERWIEGEND: java.lang.NullPointerException at org.apache.solr.common.util.StrUtils.splitSmart(StrUtils.java:37) at org.apache.solr.search.OldLuceneQParser.parse(LuceneQParserPlugin.java:104) at org.apache.solr.search.QParser.getQuery(QParser.java:88) at org.apache.solr.handler.component.QueryComponent.prepare(QueryComponent.java:82) at org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:148) at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131) at org.apache.solr.core.SolrCore.execute(SolrCore.java:1204) at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232) at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:202) at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:173) at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213) at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:178) at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:126) at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:105) at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:107) at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:148) at org.apache.coyote.http11.Http11AprProcessor.process(Http11AprProcessor.java:833) at org.apache.coyote.http11.Http11AprProtocol$Http11ConnectionHandler.process(Http11AprProtocol.java:639) at org.apache.tomcat.util.net.AprEndpoint$Worker.run(AprEndpoint.java:1285) at java.lang.Thread.run(Thread.java:595) Greets -Ralf-
Re: Performanec Lucene / Solr
Hi, And rows instead of records, and start instead of start_record. :) Erik You´re my man :-) Greets -Ralf-
Re: Performanec Lucene / Solr
Hi, queryResponseWriter name=phps class=org.apache.solr.request.PHPSerializedResponseWriter/ Then in PHP, hit Solr directly like this: $response = unserialize(file_get_contents($url)); Where $url is something like http://localhost:8983/solr/select?q=*:* No SOLR is 2times faster than LUCENE = Strike ! Hello weekend I am comming :-) Greets -Ralf-
Performanec Lucene / Solr
Hello, I am validating Sorl 1.3 now for about 3 weeks... My goal is to migrate from Lucene to Solr because of the much better plugins and search functions. Right now I am stress testing the performence and sending 2500 search request via JSON protocol and from my PHPUnit testcase. All search reuqest are different so caching don´t do it for me. Right now our old Lucene-JSPs are avout 4 times faster than my SOLR Sollution :-( any chances I can tweak my solrconfig.xml ? Greets -Ralf-
Re: Performanec Lucene / Solr
Mark Miller schrieb: Right now I am stress testing the performence and sending 2500 search request via JSON protocol and from my PHPUnit testcase. All search reuqest are different so caching don´t do it for me. Right now our old Lucene-JSPs are avout 4 times faster than my SOLR Sollution :-( Well, with Lucene it is an API call in the same JVM in the same web application. With Solr, you are making HTTP calls across the network, serializing requests and de-serializing responses. So the comparison is not exactly apples to apples. Look at what Solr offers -- replication, caching, plugins etc. Will you really need to go over 2500 requests per second? Do you need to be concerned with performance above and beyond that? Will it be easier to scale out to more boxes? And have you tried solrj without http? Right now I am using this php classes to send and receiver my requests : - Apache_Solr_Service.php - Responce.php It has the advantage that I don´t need to write extra JSP oder JAVA code... Greets -Ralf-
Re: Performanec Lucene / Solr
Mark Miller schrieb: Kraus, Ralf | pixelhouse GmbH wrote: Mark Miller schrieb: Right now I am stress testing the performence and sending 2500 search request via JSON protocol and from my PHPUnit testcase. All search reuqest are different so caching don´t do it for me. Right now our old Lucene-JSPs are avout 4 times faster than my SOLR Sollution :-( Well, with Lucene it is an API call in the same JVM in the same web application. With Solr, you are making HTTP calls across the network, serializing requests and de-serializing responses. So the comparison is not exactly apples to apples. Look at what Solr offers -- replication, caching, plugins etc. Will you really need to go over 2500 requests per second? Do you need to be concerned with performance above and beyond that? Will it be easier to scale out to more boxes? And have you tried solrj without http? Right now I am using this php classes to send and receiver my requests : - Apache_Solr_Service.php - Responce.php It has the advantage that I don´t need to write extra JSP oder JAVA code... Greets -Ralf- I think it will have the disadvantage of being a lot slower though... How were you handling things with Lucene? You must have used Java then? If you even want to get close to that performance I think you need to use non http embedded solr. Okay okay :-) I am writing a new JSP Handler for my requests as we speak :-) I really hope performence will be better than with {wt=javabin} Greets -Ralf-
Re: Performanec Lucene / Solr
Grant Ingersoll schrieb: Have you gone through http://wiki.apache.org/solr/SolrPerformanceFactors ? Can you explain a little more about your testcase, maybe even share code? I only know a little PHP, but maybe someone else who is better versed might spot something. I just wrote my JSP script for using solrj instead performence is much much better now ! Greets -Ralf-
Override Similarity
Hello, is there a chance to override the Similarity in my search ? In fact I want that all result return a 1 (with the idf methode). Greets -Ralf-
Override Similarity
Hello, is there a chance to override the Similarity in my search ? In fact I want that all result return a 1 (with the idf methode). Greets -Ralf-
Sorting TEXT Field problems :-(
Hello, Querry: {wt=jsonrows=30json.nl=mapstart=0sort=RezeptName+asc} Result : Doppeldecker Eiersalat Curry - Eiersalat Eiersalat Why is my second Curry... after Doppeldecker ??? RezeptName is a normal text field defined as : fieldType name=text class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.LowerCaseFilterFactory/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ filter class=solr.SnowballPorterFilterFactory language=German / /analyzer analyzer type=query tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.LowerCaseFilterFactory/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ filter class=solr.SnowballPorterFilterFactory language=German / /analyzer /fieldType Greets -Ralf-
Re: Sorting TEXT Field problems :-(
Thomas Traeger schrieb: Kraus, Ralf | pixelhouse GmbH schrieb: Hello, Querry: {wt=jsonrows=30json.nl=mapstart=0sort=RezeptName+asc} Result : Doppeldecker Eiersalat Curry - Eiersalat Eiersalat Why is my second Curry... after Doppeldecker ??? RezeptName is a normal text field defined as : fieldType name=text class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.LowerCaseFilterFactory/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ filter class=solr.SnowballPorterFilterFactory language=German / /analyzer analyzer type=query tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.LowerCaseFilterFactory/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ filter class=solr.SnowballPorterFilterFactory language=German / /analyzer /fieldType Greets -Ralf- Hi, normally you would define at least one special field for sorting: http://wiki.apache.org/solr/CommonQueryParameters#head-9f40612b42721ed9e1979a4a80d68f4f8524e9b4 you have to use a single valued, indexed but untokenized field (or use a tokenizer that produces only one token) You might also look at field alphaOnlySort in the example schema. Tom runs perfekt now ! Thx Greets -Ralf-
Re: Need help with DictionaryCompoundWordTokenFilterFactory
Steven A Rowe schrieb: Oops, variable-name != attribute-name. Thanks Hoss. Steve So dictFile or dictionary ??? Greets -Ralf-
Re: Need help with DictionaryCompoundWordTokenFilterFactory
Chris Hostetter schrieb: : :dictFile=de_DR.xml : : according to the code the param name is dictionary not dictFile. PS: the dictionary file shouldn't be and XML file, it should look just like a stopwords file (one word per line) -Hoss thx ! It finally runs perfect ! Greets -Ralf-
Re: Need help with DictionaryCompoundWordTokenFilterFactory
Thx a lot ! I downloaded a dictionary called de_DR.xml and put it into my conf directory... Then I changed my schema.xml to : class=solr.DictionaryCompoundWordTokenFilterFactory dictFile=./conf/de_DR.xml minWordSize=5 minSubwordSize=2 maxSubwordSize=15 onlyLongestMatch=true but solr can´t find the dictionary file :-( SCHWERWIEGEND: Could not start SOLR. Check solr/home property java.lang.RuntimeException: Error opening null at org.apache.solr.core.SolrResourceLoader.openResource(SolrResourceLoader.java:191) at org.apache.solr.core.SolrResourceLoader.getLines(SolrResourceLoader.java:237) at org.apache.solr.core.SolrResourceLoader.getLines(SolrResourceLoader.java:213) at org.apache.solr.analysis.DictionaryCompoundWordTokenFilterFactory.inform(DictionaryCompoundWordTokenFilterFactory.java:49) at org.apache.solr.core.SolrResourceLoader.inform(SolrResourceLoader.java:322) Any hints ? Greets -Ralf-
Re: Multi-language solr1.3 what would you reckon?
Hannes Carl Meyer schrieb: Hi, is it really neccessary to put it all into one index? You could also use the Solr MultiCore/MultipleIndexes feature and seperate by language. Is there a good webpage with infos about the multiindex-feature ? I know http://wiki.apache.org/solr/MultipleIndexes but there is not enough info :-( Greets -Ralf-
Need help with DictionaryCompoundWordTokenFilterFactory
Hi, I am trying to solve the typical german Donaudampfschiff- problem by using the DictionaryCompoundWordTokenFilter ... Anyone can show me how to configure my schema.xml to use the DictionaryCompoundWordTokenFilterFactory ??? Greets -Ralf-
Re: Need help with more than just one index
Hannes Carl Meyer schrieb: Hi Ralf, since Solr 1.3 it is possible to run multiple cores (indexes) inside a single deployment, please check: http://wiki.apache.org/solr/MultipleIndexes it is not even about seperating indexes but also have different configurations, index and query analyzers etc. Thx a lot Hannes ! Greets -Ralf-
Need help with more than just one index
Hello, I am wondering if there is a chance to use solr with more than just one index ? Is there a chance a could switch to another index if I want to search another context ? for example : searching for books : use index1 (schema1.xml) searching for magazines : use index 2 (schema2.xml) please help me... -- Greets -Ralf-
Re: feeding documents tru API
Cam Bazz schrieb: Hello, I have been looking at the API documentation but I dont know where to look in order to feed documents tru API without using xml files. any ideas? Look for the SolrIndexWriter class... http://lucene.apache.org/solr/api/org/apache/solr/update/SolrIndexWriter.html -- Greets -Ralf-
Problems with WordDelimiterFilterFactory
Hello, I am playing arount with WordDelimiterFilterFactory and run into some problems... When I search with RedElectronicLed I got this entries : Red - Electronic - Led = Perfect ! But I don´t get RedElectronicLed entries in my DB :-( Is there a chance I will get both ? Red - Electronic - Led and redelectronicled ? -- Greetings Ralf Kraus