Getting wrong version number when building Solr 1.4
Checked out the 1.4.0 release from the tag but when I build I get 1.4.1-dev appended to the build artifacts. Thank you, Nasseam http://bodukai.com
Hierarchical Facet Sorting
We have &facet.sort=true however our hierarchical fields are being sorted alphabetically instead of by count. Here is the field type and field definitions: omitNorms="true" positionIncrementGap="0" indexed="true" stored="false" delimiter="/"/> multiValued="true"/> Any help is much appreciated. Thank you, Nasseam Elkarra http://bodukai.com/solr/ The fastest possible shopping experience. Now with Solr integration.
Solr Demo at SF New Tech Meetup
Hello all, For those of you in the Bay Area, we will be demoing our Bodukai Boutique product at the SF New Tech Meetup on Wednesday, Oct. 14: http://sfnewtech.com/2009/10/05/1014-sf-new-tech-bodukai-yourversion-meehive-and-more/ Bodukai Boutique is the fastest ecommerce search and navigation solution: http://bodukai.com/boutique/ We will be demoing our Solr integration and all are welcome to come. Thank you, Nasseam Elkarra http://bodukai.com/boutique/ The fastest possible shopping experience
Re: Importing CSV file slow/crashes
Hello Yonik, Thank you for looking into this. Your question of if I'm using stock solr put me in the right direction. I am in fact using a patched version of solr to get hierarchal facet support (http://issues.apache.org/jira/browse/SOLR-64 ). I took out the 4 hiefacet fields from the schema and the import was back to normal times of less than a minute. This same configuration worked fine with the 5/1 patched build. Here is the field definition: omitNorms="true" positionIncrementGap="0" indexed="true" stored="false" delimiter="/" /> multiValued="true"/> stored="true" multiValued="true"/> stored="false" multiValued="true"/> stored="false" multiValued="true"/> CSV file snippet: category,category_seo "T-Shirt Mens/Crew Neck/","t-shirt-mens/crew-neck/" Thanks again! Nasseam On Oct 6, 2009, at 3:22 PM, Yonik Seeley wrote: On Tue, Oct 6, 2009 at 1:06 PM, Nasseam Elkarra wrote: I had a dev build of 1.4 from 5/1/2009 and importing a 20K row took less than a minute. Updating to the latest as of yesterday, the import is really slow and I had to cancel it after a half hour. This prevented me from upgrading a few months ago as well. I haven't had any success at replicating this problem. I just tried a 100K row CSV file, consisting of an id and a few text fields. The total size of the file is 79MB. On trunk (today): 22 seconds to index, another 5-7 secons to commit 5/21 version: 28 seconds to index, another 8 seconds to commit Then I modified the 5/1 schema to closer match the trunk schema (removing defaults, copyfields that could slow things down). Modified 5/1 version: 25 seconds to index, another 8 seconds to commit I only did 2 runs with trunk and 2 with one from 5/1, so the accuracy is probably low... but good enough to see there wasn't a problem in this test. We really need more info to help reproduce this. Are you using stock solr? Do you have any custom plugins, analyzers, token filters, etc? You're going to need to provide something so others can reproduce this. -Yonik http://www.lucidimagination.com
Re: Importing CSV file slow/crashes
Hello Erick, Sorry about that. I'm using the CSV update handler. Uploading a local CSV using the stream.file parameter. There are 94 fields and 36 copyFields. Thank you, Nasseam On Oct 6, 2009, at 10:09 AM, Erick Erickson wrote: Well, without some better idea of *how* you're doing the import, it's a little hard to say anything meaningful (hint, hint). Best Erick On Tue, Oct 6, 2009 at 1:06 PM, Nasseam Elkarra wrote: Hello all, I had a dev build of 1.4 from 5/1/2009 and importing a 20K row took less than a minute. Updating to the latest as of yesterday, the import is really slow and I had to cancel it after a half hour. This prevented me from upgrading a few months ago as well. Any ideas as to the cause of this? Thank you, Nasseam Elkarra http://bodukai.com/boutique/ The fastest possible shopping experience.
Hierarchical Facet Field Prefix Not Working
Hello all, We are using the patch from SOLR-64 (http://issues.apache.org/jira/browse/SOLR-64 ) to implement hierarchical facets for categories. We are trying to use the facet.prefix to prevent all categories from coming back. However, f.category.facet.prefix doesn't work. Using facet.prefix works but prevents the other facets from coming back since it is a global option. Are per facet options supported on hierarchical facet fields? If not, how can I get a specific category and it's children without getting the surrounding categories? Any help is much appreciated. Thank you, Nasseam Elkarra http://bodukai.com/boutique/ The fastest possible shopping experience.
Re: Joins or subselects in solr
Thanks for the advice. Glad you like the site! Yes, we are using the new multi-select support to do this. At first, we wrote our own client- side solution to allow for multi-select which was super complex but moved to the built-in solr support as soon as we found it. Excellent feature! -Nasseam Solr-powered Ajax search+nav: http://factbook.bodukai.com/ Powered by Boutique: http://bodukai.com/boutique/ On Jun 14, 2009, at 8:16 AM, Yonik Seeley wrote: On Fri, Jun 12, 2009 at 6:43 PM, Nasseam Elkarra wrote: I am storing items in an index. Each item has a comma separated list of related items. Is it possible to bring back an item and all of its related items in one query? If so how and how would you distinguish between which one is the main item and which are the related. You would currently need a custom query component to do this... not sure if it's worth it vs two requests. Solr-powered Ajax search+nav: http://factbook.bodukai.com/ Cool, nice multi-select! Are you using Solr's new multi-select support to do this? -Yonik http://www.lucidimagination.com
Joins or subselects in solr
Hello, I am storing items in an index. Each item has a comma separated list of related items. Is it possible to bring back an item and all of its related items in one query? If so how and how would you distinguish between which one is the main item and which are the related. Any help is much appreciated. Thanks! Nasseam Solr-powered Ajax search+nav: http://factbook.bodukai.com/ Powered by Boutique: http://bodukai.com/boutique/
Re: Hierarchal Faceting Field Type
Hello Koji, I applied the patch: http://issues.apache.org/jira/browse/SOLR-64 It's working but I have a question about facet.prefix. When we add a facet.prefix it filters the facet results but not the query results. Is there a way to filter both? Thank you, Nasseam On Apr 14, 2009, at 3:32 PM, Koji Sekiguchi wrote: Nasseam Elkarra wrote: Background: Set up a system for hierarchal categories using the following scheme: level one# level one#level two# level one#level two#level three# Trying to find the right combination of field type and query to get the desired results. Saw some previous posts about hierarchal facets which helped in the generating the right query but having an issue using the built in text field which ignores our delimiter and the string field which prevents us from doing a start with search. Does anyone have any insight into the field declaration? Any help is appreciated. Thank you. Out of need in my project, I'll get started to work for SOLR-64, expected any day. I'm thinking introducing a field type for hierarchical facet. Koji
Re: Stats field with decimal values
I get the error doing stats or facet query on an sfloat. So the following query: q = *:*&facet = true &facet.mincount=1&facet.sort=true&facet.limit=-1&facet.field=salePrice Gives the following error: java.lang.StringIndexOutOfBoundsException: String index out of range: 2 at java.lang.String.charAt(String.java:687) at org.apache.solr.util.NumberUtils.SortableStr2int(NumberUtils.java: 129) at org.apache.solr.util.NumberUtils.SortableStr2float(NumberUtils.java:83) at org.apache.solr.util.NumberUtils.SortableStr2floatStr(NumberUtils.java: 89) at org .apache .solr .schema.SortableFloatField.indexedToReadable(SortableFloatField.java:62) at org .apache .solr.request.SimpleFacets.getFieldCacheCounts(SimpleFacets.java:406) at org.apache.solr.request.SimpleFacets.getTermCounts(SimpleFacets.java: 252) at org .apache .solr.request.SimpleFacets.getFacetFieldCounts(SimpleFacets.java:280) at org.apache.solr.request.SimpleFacets.getFacetCounts(SimpleFacets.java: 170) at org .apache .solr.handler.component.FacetComponent.process(FacetComponent.java:72) at org .apache .solr .handler.component.SearchHandler.handleRequestBody(SearchHandler.java: 195) at org .apache .solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java: 131) at org.apache.solr.core.SolrCore.execute(SolrCore.java:1333) at org .apache .solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:303) at org .apache .solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:232) at org .apache .catalina .core .ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java: 235) at org .apache .catalina .core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206) at org .apache .catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java: 233) at org .apache .catalina.core.StandardContextValve.invoke(StandardContextValve.java: 191) at org .apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java: 128) at org .apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java: 102) at org .apache .catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109) at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java: 286) at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java: 845) at org.apache.coyote.http11.Http11Protocol $Http11ConnectionHandler.process(Http11Protocol.java:583) at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java: 447) at java.lang.Thread.run(Thread.java:637) Thank you, Nasseam http://factbook.bodukai.com/ Solr-Powered Search+Nav On Apr 30, 2009, at 12:01 PM, Shalin Shekhar Mangar wrote: On Fri, May 1, 2009 at 12:27 AM, Nasseam Elkarra wrote: Hello, I'm getting an error when trying to create stats on an sfloat field. The field is for price and when there is no decimal it works fine but when there is a decimal (e.g., 24.99) I get an error: java.lang.StringIndexOutOfBoundsException: String index out of range: 2 Can you post the complete stack trace? -- Regards, Shalin Shekhar Mangar.
Stats field with decimal values
Hello, I'm getting an error when trying to create stats on an sfloat field. The field is for price and when there is no decimal it works fine but when there is a decimal (e.g., 24.99) I get an error: java.lang.StringIndexOutOfBoundsException: String index out of range: 2 Changing the field to float works but I need sfloat so that I can do range queries. Any help is appreciated. Thank you, Nasseam http://factbook.bodukai.com/ Solr-Powered Search+Nav
Re: Hierarchal Faceting Field Type
Thank you. We tried your suggestion but we are still getting the following problem: multiValued="true"/> Sample data: level one;level two;level three; level one;level two;level threeB; When we query for: level one;level two;level three;* We are getting back : level one;level two;level threeB; Even though the B is before the semicolon. Any idea why? Thank you, Nasseam Check out our solr-powered Ajax search+nav solution: http://factbook.bodukai.com/ Powered by Boutique: http://bodukai.com/boutique/ On Apr 17, 2009, at 3:10 PM, Chris Hostetter wrote: : level one# : level one#level two# : level one#level two#level three# : : Trying to find the right combination of field type and query to get the : desired results. Saw some previous posts about hierarchal facets which helped : in the generating the right query but having an issue using the built in text : field which ignores our delimiter and the string field which prevents us from : doing a start with search. Does anyone have any insight into the field : declaration? Use TextField, with a PatternTokenizer BTW: if this isn't thread you've already seen, it's handy to know about... http://www.nabble.com/Hierarchical-Faceting-to20090898.html#a20176326 -Hoss
Hierarchal Faceting Field Type
Background: Set up a system for hierarchal categories using the following scheme: level one# level one#level two# level one#level two#level three# Trying to find the right combination of field type and query to get the desired results. Saw some previous posts about hierarchal facets which helped in the generating the right query but having an issue using the built in text field which ignores our delimiter and the string field which prevents us from doing a start with search. Does anyone have any insight into the field declaration? Any help is appreciated. Thank you.
Re: Solr index deletion
The tool says there are no problems. Solr is pointing to the right directory so not sure what is preventing it from returning any results. Any ideas? Here is the output: Segments file=segments_2 numSegments=1 version=FORMAT_USER_DATA [Lucene 2.9] 1 of 1: name=_0 docCount=18021 compound=false hasProx=true numFiles=9 size (MB)=8.389 has deletions [delFileName=_0_1.del] test: open reader.OK [18 deleted docs] test: fields, norms...OK [35 fields] test: terms, freq, prox...OK [60492 terms; 1157700 terms/docs pairs; 1224063 tokens] test: stored fields...OK [386828 total field count; avg 21.487 fields per doc] test: term vectorsOK [0 total vector count; avg 0 term/ freq vector fields per doc] No problems were detected with this index. -- Thanks, Nasseam On Mar 24, 2009, at 1:34 PM, Otis Gospodnetic wrote: There is, it's called CheckIndex and it is a part of Lucene (and Lucene jars that come with Solr, I believe): http://lucene.apache.org/java/2_4_1/api/org/apache/lucene/index/CheckIndex.html Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Nasseam Elkarra To: solr-user@lucene.apache.org Sent: Tuesday, March 24, 2009 4:21:50 PM Subject: Re: Solr index deletion Correction: index was not deleted. The folder is still there with the index files in it but a *:* query returns 0 results. Is there a tool to check the health of an index? Thanks, Nasseam On Mar 24, 2009, at 11:49 AM, Otis Gospodnetic wrote: Somehow that sounds very unlikely. Have you looked at logs? What have you found from Solr there? I am not checking the sources, but I don't think there is any place in Solr where the index directory gets deleted. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message ---- From: Nasseam Elkarra To: solr-user@lucene.apache.org Sent: Tuesday, March 24, 2009 2:35:22 PM Subject: Solr index deletion On a few occasions, our development server crashed and in the process solr deleted the index folder. We are suspecting another app on the server caused an OutOfMemoryException on Tomcat causing all apps including solr to crash. So my question is why is solr deleting the index? We are not doing any updates to the index only reading from it so any insight would be appreciated. Thank you, Nasseam
Re: Solr index deletion
Correction: index was not deleted. The folder is still there with the index files in it but a *:* query returns 0 results. Is there a tool to check the health of an index? Thanks, Nasseam On Mar 24, 2009, at 11:49 AM, Otis Gospodnetic wrote: Somehow that sounds very unlikely. Have you looked at logs? What have you found from Solr there? I am not checking the sources, but I don't think there is any place in Solr where the index directory gets deleted. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Nasseam Elkarra To: solr-user@lucene.apache.org Sent: Tuesday, March 24, 2009 2:35:22 PM Subject: Solr index deletion On a few occasions, our development server crashed and in the process solr deleted the index folder. We are suspecting another app on the server caused an OutOfMemoryException on Tomcat causing all apps including solr to crash. So my question is why is solr deleting the index? We are not doing any updates to the index only reading from it so any insight would be appreciated. Thank you, Nasseam
Solr index deletion
On a few occasions, our development server crashed and in the process solr deleted the index folder. We are suspecting another app on the server caused an OutOfMemoryException on Tomcat causing all apps including solr to crash. So my question is why is solr deleting the index? We are not doing any updates to the index only reading from it so any insight would be appreciated. Thank you, Nasseam
Multi-select on more than one facet field
Looking at the example here: http://wiki.apache.org/solr/SimpleFacetParameters#head-4ba81c89b265c3b5992e3292718a0d100f7251ef This being the query for selecting PDF: q=mainquery&fq=status:public&fq={! tag=dt}doctype:pdf&facet=on&facet.field={!ex=dt}doctype How would you do the query for selecting PDF OR Excel AND, assuming there is another facet field named author, where author is Mike? Thank you, Nasseam