Re: solr 1872
looks like it might actually be a zip file. try renaming/unzipping it. cheers, rob On Mon, Jul 30, 2012 at 2:50 PM, Sujatha Arun suja.a...@gmail.com wrote: I am uable to use the rar file from the site https://issues.apache.org/jira/browse/SOLR-1872. When I try to open it,I get the message 'SolrACLSecurity.rar is not RAR archive. Is the file there at this link? Regards Sujatha
Re: Two unrelated questions
for #1, i don't use DIH, but is there any possibility of that column having duplicate keys, with subsequent docs replacing existing ones? and for #2, for some cases you could use a negative filterquery: http://wiki.apache.org/solr/SimpleFacetParameters#Retrieve_docs_with_facets_missing so instead of that fq=-facetField:[* TO *], something like fq=-car_make:Taurus. picking negatives might even make the UI a bit easier. anyway, just some thoughts. cheers, rob On Wed, Sep 21, 2011 at 5:17 PM, Olson, Ron rol...@lbpc.com wrote: Thanks for the reply. As far as #1, my table that I'm indexing via DIH has a PK field, generated by a sequence, so there are records with ID of 1, 2, 3, etc. That same id is the one I use in my unique id field in the document (uniqueKeyID/uniqueID). I've noticed that the table has, say, 10 rows. My index only has 8. I don't know why that is, but I'd like to figure out which records are missing and add them (and hopefully understand why they weren't added in the first place). I was just wondering if there was some way to compare the two as part of a sql query, but on reflection, it does seem like an absurd request, so I apologize; I think what I'll have to do is write a solrj program that gets every ID in the table, then does a search on that ID in the index, and add the ones that are missing. Regarding the second item, yes, it's crazy but I'm not sure what to do; there really are that many options and some searches will be extremely specific, yet broad enough in terms for this to be a problem. -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Wednesday, September 21, 2011 3:55 PM To: solr-user@lucene.apache.org Subject: Re: Two unrelated questions for 1 I don't quite get what you're driving at. Your DIH query assigns the uniqueKey, it's not like it's something auto-generated. Perhaps a concrete example would help. 2 There's a limit you can adjust that defaults to 1024 (maxBooleanClauses in solrconfig.xml). You can bump this very high, but you're right, if anyone actually does something absurd it'll slow *that* query down. But just bumping this query higher won't change performance absent someone actually putting a ton of items in it... Best Erick On Mon, Sep 19, 2011 at 9:12 AM, Olson, Ron rol...@lbpc.com wrote: Hi all- I'm not sure if I should break this out into two separate questions to the list for searching purposes, or if one is more acceptable (don't want to flood). I have two (hopefully) straightforward questions: 1. Is it possible to expose the unique ID of a document to a DIH query? The reason I want to do this is because I use the unique ID of the row in the table as the unique ID of the Lucene document, but I've noticed that the counts of documents doesn't match the count in the table; I'd like to add these rows and was hoping to avoid writing a custom SolrJ app to do it. 2. Is there any limit to the number of conditions in a Boolean search? We're working on a new project where the user can choose either, for example, Ford Vehicles, in which case I can simply search for Ford, but if the user chooses specific makes and models, then I have to say something like Crown Vic OR Focus OR Taurus OR F-150, etc., where they could theoretically choose every model of Ford ever made except one. This could lead to a *very* large query, and was worried both that it was even possible, but also the impact on performance. Thanks, and I apologize if this really should be two separate messages. Ron DISCLAIMER: This electronic message, including any attachments, files or documents, is intended only for the addressee and may contain CONFIDENTIAL, PROPRIETARY or LEGALLY PRIVILEGED information. If you are not the intended recipient, you are hereby notified that any use, disclosure, copying or distribution of this message or any of the information included in or with it is unauthorized and strictly prohibited. If you have received this message in error, please notify the sender immediately by reply e-mail and permanently delete and destroy this message and its attachments, along with any copies thereof. This message does not create any contractual obligation on behalf of the sender or Law Bulletin Publishing Company. Thank you. DISCLAIMER: This electronic message, including any attachments, files or documents, is intended only for the addressee and may contain CONFIDENTIAL, PROPRIETARY or LEGALLY PRIVILEGED information. If you are not the intended recipient, you are hereby notified that any use, disclosure, copying or distribution of this message or any of the information included in or with it is unauthorized and strictly prohibited. If you have received this message in error, please notify the sender immediately by reply e-mail and permanently delete and destroy this message and its attachments, along with any
Re: synonyms.txt: different results on admin and on site..
you should probably post your schema.xml and some parts of your synonyms.txt. it could be differences between your index and query analysis chains, synonym expansion errors, etc, but folks will likely need more details to help you out. cheers, rob On Wed, Sep 7, 2011 at 9:46 PM, deniz denizdurmu...@gmail.com wrote: could it be related with analysis issue about synonyms once again? - Zeki ama calismiyor... Calissa yapar... -- View this message in context: http://lucene.472066.n3.nabble.com/synonyms-txt-different-results-on-admin-and-on-site-tp3318338p3318464.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Solr warming when using master/slave replication
it's always been my understanding that the caches are discarded, then rebuilt/warmed: http://wiki.apache.org/solr/SolrCaching#Caching_and_Distribution.2BAC8-Replication hth, rob On Mon, Aug 29, 2011 at 5:30 PM, Mike Austin mike.aus...@juggle.com wrote: How does warming work when a collection is being distributed to a slave. I understand that a temp directory is created and it is eventually copied to the live folder, but what happens to the cache that was built in with the old index? Does the cache get rebuilt, can we warm it before it becomes live, or can we keep the old cache? Thanks, Mike
Re: Exact matching on names?
exact can mean a lot of things (do diacritics count?, etc), but in this case, it sounds like you just need to turn off the stemmer you have on this fieldtype (or create a new one that doesn't include the stemmer). hth, rob On Tue, Aug 16, 2011 at 11:20 AM, Olson, Ron rol...@lbpc.com wrote: Hi all- I'm missing something fundamental yet I've been unable to find the definitive answer for exact name matching. I'm indexing names using the standard text field type and my search is for the name clarke. My results include clark, which is incorrect, it needs to match clarke exactly (case insensitive). I tried textType but that doesn't work because I believe it needs to be *really* exact, whereas I'm looking for things like clark oil, bob, frank, and clark, etc. Thanks for any help, Ron DISCLAIMER: This electronic message, including any attachments, files or documents, is intended only for the addressee and may contain CONFIDENTIAL, PROPRIETARY or LEGALLY PRIVILEGED information. If you are not the intended recipient, you are hereby notified that any use, disclosure, copying or distribution of this message or any of the information included in or with it is unauthorized and strictly prohibited. If you have received this message in error, please notify the sender immediately by reply e-mail and permanently delete and destroy this message and its attachments, along with any copies thereof. This message does not create any contractual obligation on behalf of the sender or Law Bulletin Publishing Company. Thank you.
Re: Searching for strings
chip, gonna need more information about your particular analysis chain, content, and example searches to give a better answer, but phrase queries (using quotes) are supported in both the standard and dismax query parsers that being said, lots of things may not match a person's idea of an exact string...stopwords, synonyms, slop, etc. cheers, rob On Mon, Jul 18, 2011 at 5:25 PM, Chip Calhoun ccalh...@aip.org wrote: Is there a way to search for a specific string using Solr, either by putting it in quotes or by some other means? I haven't been able to do this, but I may be missing something. Thanks, Chip
Re: Can I invert the inverted index?
sounds like the Luke request handler will get what you're after: http://wiki.apache.org/solr/LukeRequestHandler http://wiki.apache.org/solr/LukeRequestHandler#id cheers, rob On Tue, Jul 5, 2011 at 3:59 PM, Gabriele Kahlout gabri...@mysimpatico.com wrote: Hello, With an inverted index the term is the key, and the documents are the values. Is it still however possible that given a document id I get the terms indexed for that document? -- Regards, K. Gabriele --- unchanged since 20/9/10 --- P.S. If the subject contains [LON] or the addressee acknowledges the receipt within 48 hours then I don't resend the email. subject(this) ∈ L(LON*) ∨ ∃x. (x ∈ MyInbox ∧ Acknowledges(x, this) ∧ time(x) Now + 48h) ⇒ ¬resend(I, this). If an email is sent by a sender that is not a trusted contact or the email does not contain a valid code then the email is not received. A valid code starts with a hyphen and ends with X. ∀x. x ∈ MyInbox ⇒ from(x) ∈ MySafeSenderList ∨ (∃y. y ∈ subject(x) ∧ y ∈ L(-[a-z]+[0-9]X)).
Re: [Handling] empty fields
i also thought of the lengthFilter stuff, provided it's a text/KeywordTokenizer field: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.LengthFilterFactory cheers, rob On Wed, Jun 15, 2011 at 12:00 PM, Erick Erickson erickerick...@gmail.com wrote: Have you tried setting 'facet.missing=false '? See: http://wiki.apache.org/solr/SimpleFacetParameters#facet.missing Best Erick On Wed, Jun 15, 2011 at 11:52 AM, Adam Estrada estrada.adam.gro...@gmail.com wrote: All, I have a field foo with several thousand blank or non-existing records in it. This is also my faceting field. My question is, how can I deal with this field so that I don't get a blank facet at query time? int name=5000/int vs. int name=Flickr1000/int Adam
Re: indexing numbers
the default schema.xml provided in the Solr distribution is well-documented, and a good place to get started (including numeric fieldTypes): http://wiki.apache.org/solr/SchemaXml Lucid Imagination also provides a nice reference guide: http://www.lucidimagination.com/Downloads/LucidWorks-for-Solr/Reference-Guide hope that helps, rob On Wed, May 25, 2011 at 6:20 PM, antoniosi antonio...@gmail.com wrote: Hi, How does solr index a numeric value? Does it index it as a string or does it keep it as a numeric value? Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/indexing-numbers-tp2986424p2986424.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: aliasing?
a lot of this stuff is covered in the tutorial, and expanded in the wiki. still the best places to start in figuring out the fundamentals: http://lucene.apache.org/solr/tutorial.html http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory hth, rc On Mon, May 9, 2011 at 9:09 PM, deniz denizdurmu...@gmail.com wrote: well... if i knew what to do about aliasing, i wouldnt post my question here Grijesh :) My idea is this: for some search queries, I need to provide some synonyms... But it is just an idea... -- View this message in context: http://lucene.472066.n3.nabble.com/aliasing-tp2917733p2921305.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: *:* query with dismax
it does seem a little weird, but q.alt will get what you want: http://wiki.apache.org/solr/DisMaxQParserPlugin#q.alt hth, rc On Fri, May 6, 2011 at 7:41 PM, Jason Chaffee jchaf...@ebates.com wrote: Can you shed some light on what you did to configure it to handle *:*? I have the same issue that I need it to work for faceting, but I do need the dismax abilities as well. -Original Message- From: Mark Mandel [mailto:mark.man...@gmail.com] Sent: Friday, May 06, 2011 4:30 PM To: solr-user@lucene.apache.org Subject: Re: *:* query with dismax This is exactly what should be happening, as the dismax parser doesn't understand regular query syntax (and for good reason too). This tripped me up as well when I first started using dismax. Solution for me was to comfigure the handler to use *:* when the query is empty, so that you can still get back a full result set if you need it, say for faceting. HTH Mark On May 7, 2011 9:22 AM, Jason Chaffee jchaf...@ebates.com wrote: I am using dismax and trying to use q=*:* to return all indexed documents. However, it is always returning 0 found. If I used the default select (not dismax) handler and try q=*:* then it returns all documents. There is nothing in the logs to indicate why this happening. Does anyone have any clues? Thanks, Jason
Re: querying in Java
copyField should do the trick: http://wiki.apache.org/solr/SchemaXml#Copy_Fields A common requirement is to copy or merge all input fields into a single solr field. This can be done as follows:- copyField source=* dest=text/ hth, rob On Fri, Apr 29, 2011 at 2:06 PM, Saler, Jeff jsa...@ball.com wrote: Thanks for the reply. What I want is for the query to search all fields for the specified value. -Original Message- From: Anuj Kumar [mailto:anujs...@gmail.com] Sent: Friday, April 29, 2011 1:51 PM To: solr-user@lucene.apache.org Subject: Re: querying in Java Hi Jeff, In that case, it will query w.r.t default field. What is your default search field in the schema? Regards, Anuj On Fri, Apr 29, 2011 at 11:10 PM, Saler, Jeff jsa...@ball.com wrote: Is there any way to query for data that is in any field, i.e. not using a specific field name? For example, when I use the following statements: SolrQuery query = new SolrQuery(); Query.setQuery(ANALYST:John Schummers); QueryResponse rsp = server.query(query); I get the documents I'm looking for. But I would like to get the same set of documents without using the specific ANALYST field name. I have tried using just Schummers as the query, but no documents are returned. The ANALYST field is an indexed field. This message and any enclosures are intended only for the addressee. Please notify the sender by email if you are not the intended recipient. If you are not the intended recipient, you may not use, copy, disclose, or distribute this message or its contents or enclosures to any other person and any such actions may be unlawful. Ball reserves the right to monitor and review all messages and enclosures sent to or from this email address. This message and any enclosures are intended only for the addressee. Please notify the sender by email if you are not the intended recipient. If you are not the intended recipient, you may not use, copy, disclose, or distribute this message or its contents or enclosures to any other person and any such actions may be unlawful. Ball reserves the right to monitor and review all messages and enclosures sent to or from this email address.
Re: Result order when score is the same
you could just explicitly send multiple sorts...from the tutorial: sort=inStock asc, price desc cheers. On Wed, Apr 13, 2011 at 2:59 PM, kenf_nc ken.fos...@realestate.com wrote: Is sort order when 'score' is the same a Lucene thing? Should I ask on the Lucene forum? -- View this message in context: http://lucene.472066.n3.nabble.com/Result-order-when-score-is-the-same-tp2816127p2817330.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Facet Query Question
if i'm understanding your question, it sounds like localparams/tagging/exclusion might be what you want: http://wiki.apache.org/solr/SimpleFacetParameters#Multi-Select_Faceting_and_LocalParams hth, rob On Sun, Feb 27, 2011 at 6:41 PM, Tim Christensen tim.christen...@vanns.net wrote: Hi, I am trying to do the following: Where a query might return: Facet 1 A B C Facet 2 X Y Z User selects Facet 1 option A. Normally this paradigm would contract the results as in a refining paradigm. That would be fine and the obvious UI change. But by doing so, Facet 2 option X is no longer available -- again because of the refining. Let's say I still wanted Facet 2 option X to be available to instead of refining, expands the results. Normally, my query might look like: q=queryfq=Facet 1:A (for the first part of my question. What I have done is return two sets of facet results, one for the main query and one for the refined query. That way I can still offer option X. What I don't know how to do is query beyond that. I have tried some ORs and ANDs in my unit tests, but don't think this is the right way. My question is whether there is a way in a single query to bring back all the original facets regardless of any facet refining. If not, give that I return two sets of facets - a refined set and the 'original' querys' facet set, how would I fashion this query? My apologies if this is rookie, I have a few years of Solr under my belt, but can't think outside the refining and then expanding the result set with a facet query that was available in the original query results. Thank you, Tim Christensen
Re: Copying the index from one solr instance to another
just making sure that you're aware of the built-in replication: http://wiki.apache.org/solr/SolrReplication can pull the indexes, along with config files. cheers, rob 2010/12/15 Robert Gründler rob...@dubture.com: Hi again, let's say you have 2 solr Instances, which have both exactly the same configuration (schema, solrconfig, etc). Could it cause any troubles if we import an index from a SQL database on solr instance A, and copy the whole index to the datadir of solr instance B (both solr instances run on different servers) ?. As far as i can tell, this should work and solr instance B should have the exact same index as solr instance A after the copy-process. Do we miss something, or is this workflow safe to go with? -robert
Re: command line to check if Solr is up running
you could look at the ping stuff: http://wiki.apache.org/solr/SolrConfigXml#The_Admin.2BAC8-GUI_Section cheers, rob On Mon, Oct 25, 2010 at 3:56 PM, Xin Li x...@book.com wrote: As we know we can use browser to check if Solr is running by going to http://$hostName:$portNumber/$masterName/admin, say http://localhost:8080/solr1/admin. My questions is: are there any ways to check it using command line? I used curl http://localhost:8080; to check my Tomcat, it worked fine. However, no response if I try curl http://localhost:8080/solr1/admin; (even when my Solr is running). Does anyone know any command line alternatives? Thanks, Xin This electronic mail message contains information that (a) is or may be CONFIDENTIAL, PROPRIETARY IN NATURE, OR OTHERWISE PROTECTED BY LAW FROM DISCLOSURE, and (b) is intended only for the use of the addressee(s) named herein. If you are not an intended recipient, please contact the sender immediately and take the steps necessary to delete the message completely from your computer system. Not Intended as a Substitute for a Writing: Notwithstanding the Uniform Electronic Transaction Act or any other law of similar effect, absent an express statement to the contrary, this e-mail message, its contents, and any attachments hereto are not intended to represent an offer or acceptance to enter into a contract and are not otherwise intended to bind this sender, barnesandnoble.com llc, barnesandnoble.com inc. or any other person or entity.
Re: Search Results optimization
you might find these helpful...similar question came up last week: http://ln-s.net/7WpX http://robotlibrarian.billdueber.com/solr-forcing-items-with-all-query-terms-to-the-top-of-a-solr-search/ not exactly the same, as this case wanted to boost if *every* term matched, but a similar tactic may workhope that helps, rob On Thu, Aug 26, 2010 at 4:02 PM, Hasnain hasn...@hotmail.com wrote: perhaps i wasnt clear in my earlier post if user searches for swingline red stapler hammer hand rigid, then documents that matches max number of words written in query should come first e.g a document with name field as swingline stapler should come later than the document with swingline red stapler any suggestions how to achieve this functionality? -- View this message in context: http://lucene.472066.n3.nabble.com/Search-Results-optimization-tp1129374p1359916.html Sent from the Solr - User mailing list archive at Nabble.com.
spellcheck suggestion ordering and distance measures
poking around at the spellcheck component, and have a couple questions: 1) is there a way to return the distance measure with spellcheck.extendedResults? haven't poked too closely at the source, but it might be useful. 2) i'm not entirely clear on the order in which suggestions are returned. for a search of porgrams against a subset of my data, i get these suggestions...i've added my own levenshtein calculations, provided they're correct: program: 3 pulgram: 3 porras: 2 portrait: 3 relevant section of my solrconfig.xml: str name=queryAnalyzerFieldTypetext/str lst name=spellchecker str name=namedefault/str str name=fieldspell/str str name=spellcheckIndexDir./spellchecker/str /lst granted, i like that 'program' is at the top, and it has the highest frequency in my corpus, but just want to make sure i can reliably interpret these results. cheers, and apologies if i'm just being dense, rob
Re: filter querying working on dynamic int fields but not dynamic string fields?
http://localhost:8983/solr/select?indent=onversion=2.2q=climatefq=awardinstrument_s:Continuing+grant str name=awardinstrument_sContinuing grant /str everything that erik already mentioned, but looks like you also have a trailing space in the document, so even quoting it would require that last space.
Re: case-insensitive string type
from http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters On wildcard and fuzzy searches, no text analysis is performed on the search word. i'd just lowercase the wildcard-ed search term in your client code, before you send it to solr. hth, rob On Wed, Jan 13, 2010 at 2:18 PM, Harsch, Timothy J. (ARC-TI)[PEROT SYSTEMS] timothy.j.har...@nasa.gov wrote: Hi I have a field: field name=srcANYSTRStrCI type=string_ci indexed=true stored=true multiValued=true / With type definition: !-- A Case insensitive version of string type -- fieldType name=string_ci class=solr.StrField sortMissingLast=true omitNorms=true analyzer type=index tokenizer class=solr.KeywordTokenizerFactory/ filter class=solr.LowerCaseFilterFactory / /analyzer analyzer type=query tokenizer class=solr.KeywordTokenizerFactory/ filter class=solr.LowerCaseFilterFactory / /analyzer /fieldType When searching that field I can't get a case-insensitive match. It works as if it is a regular string, for instance I can do a prefix query and so long as the prefix matches the case of the value it works, but if I change the prefix case it doesn't Essentially I am trying to get case-insensitive matching that supports wild cards... Tim Harsch Sr. Software Engineer Dell Perot Systems (650) 604-0374
Re: query ( dynamicField ) in solr
sounds like a job for copyField: http://wiki.apache.org/solr/FAQ#How_do_I_use_copyField_with_wildcards.3F add sku_defaultPriceAll to your fields and then: copyField source=sku_defaultPrice_* dest=sku_defaultPriceAll / ...query on sku_defaultPriceAll. hth, rob On Tue, Dec 22, 2009 at 3:36 PM, Gustavo g.macha...@gmail.com wrote: Hello All, I have been trying to make a query at a dynamicField. for example: i have a product that can have many sku's. So at my xsd i have dynamicField name=sku_externalId_* type=text indexed=true stored=true / dynamicField name=sku_defaultPrice_* type=text indexed=true stored=true / . . . when a make a query: http://localhost:8983/solr/productIdx/select/?q=sku_defaultPrice_2:valueOfThePrice or http://localhost:8983/solr/productIdx/select/?q=sku_defaultPrice_1:valueOfThePriceor sku_defaultPrice_2:valueOfThePrice or sku_defaultPrice_3:valueOfThePrice it works !! but i can have many sku_defaultPrice_* but i have no idea in how to make the query with this kind of field thanks for helping -- -- Atenciosamente, Gustavo de Araujo Lima Machado Telefone: +55 +21 *** Celular: +55+21 99034401 E-Mail: g.macha...@gmail.com.br Residência :
Re: How to get Solr 1.4 to replicate spellcheck directories as well?
i don't think that's currently supported, but sure others will correct me if i'm wrong: http://www.lucidimagination.com/search/document/ac8cf41bdb761069/solr_replication_and_spellcheck_data cheers, rob On Wed, Dec 16, 2009 at 10:08 AM, michael8 mich...@saracatech.com wrote: I'm currently using Solr 1.4 with its built-in solr.ReplicationHandler enabled in solrconfig.xml for a master and slave as follows: requestHandler name=/replication class=solr.ReplicationHandler lst name=master str name=enable${enable.master:false}/str str name=replicateAftercommit/str str name=replicateAfterstartup/str str name=confFilesschema.xml,protwords.txt,spellings.txt,stopwords.txt,synonyms.txt/str /lst lst name=slave str name=enable${enable.slave:false}/str str name=masterUrlhttp://searchhost:8983/solr/items/replication/str str name=pollInterval00:00:60/str /lst /requestHandler Everything in the index is replicated perfectly except that my spellcheck directories are not being replicated. Here is my spellcheck config in solrconfig.xml: searchComponent name=spellcheck class=solr.SpellCheckComponent str name=queryAnalyzerFieldTypetextSpell/str lst name=spellchecker str name=namedefault/str str name=fieldspell/str str name=spellcheckIndexDir./spellchecker1/str str name=buildOnCommitfalse/str /lst lst name=spellchecker str name=namejarowinkler/str str name=fieldspell/str !-- Use a different Distance Measure -- str name=distanceMeasureorg.apache.lucene.search.spell.JaroWinklerDistance/str str name=spellcheckIndexDir./spellchecker2/str str name=buildOnCommitfalse/str /lst lst name=spellchecker str name=classnamesolr.FileBasedSpellChecker/str str name=namefile/str str name=sourceLocationspellings.txt/str str name=characterEncodingUTF-8/str str name=spellcheckIndexDir./spellcheckerFile/str str name=buildOnCommitfalse/str /lst /searchComponent I have set the buildOnCommit to 'false', but instead have a separate cron to build my spellcheck dictionaries on a nightly basis. Is there a way to tell Solr to also replicate the spellcheck files too? Is my setting 'buildOnCommit' to 'false' causing my spellcheck files to not replicate? I would think after the nightly build is triggered and done (via cron) that the spellcheck files would be replicated by that is not the case. Thanks for any help or info. Michael -- View this message in context: http://old.nabble.com/How-to-get-Solr-1.4-to-replicate-spellcheck-directories-as-well--tp26812569p26812569.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Upgrading from 1.2.0 to 1.3.0
this isn't advice on how to upgrade, but if you/your-project have a bit of time to wait, 1.4 sounds like it's getting close to an official releasefyi. cheers, rob On Tue, May 5, 2009 at 1:05 PM, Francis Yakin fya...@liquid.com wrote: What's the best way to upgrade solr from 1.2.0 to 1.3.0 ? We have the current index that our users search running on 1.2.0 Solr version. We would like to upgrade it to 1.3.0? We have Master/Slaves env. What's the best way to upgrade it without affecting the search? Do we need to do it on master or slaves first? Thanks Francis
Re: Filter query with wildcard, fq=a*
it sounds to me like the field you're using (artistText) is tokenized and lowercased. it might be good to go over the wiki pages again: http://wiki.apache.org/solr/SolrFacetingOverview if you keep having problems, post your schema...cheers, rob On Thu, Apr 30, 2009 at 10:21 AM, Andrew McCombe eupe...@gmail.com wrote: Hi It has now introduced a new issue. I am now getting results for my query but they are not what I wanted. I'm using a Dismax handler and am trying to filter a result set with a filter query: http://localhost:8180/solr/select?indent=onversion=2.2q=i+lovestart=0rows=30fl=*%2Cscoreqt=dismaxwt=standardfq=artistText:e* This is bringing results with 'e' or 'E' as the first letter and with 'e' or 'E' as the first letter after whitespace. e.g, Eric Prydz, Marc Et Claude. Looking at the solr wiki it seems that I am using the fq param to boost the results and not filter them. Is there a way to achieve the required result, i.e, filtering the result set for all artistText beginning with 'E'? Regards Andrew 2009/4/30 Andrew McCombe eupe...@gmail.com Hi I've sorted it. Wildcard term must be lowercase to get results. Thanks Andrew 2009/4/30 Andrew McCombe eupe...@gmail.com Hi I have half a million records indexed and need to filter results on a term by the first letter. For Example the search term is 'I love' and returns a few thousand records. I need to filter these results for all artists beginning with 'A'. I've tried: 'fq=artistText:A*' But then get no results. Can someone point me in the right direction please? Regards Andrew
Re: new faceting algorithm
very similar situation to those already reported. 2.9M bilbiographic records, with authors being the (previous) bottleneck, and the one we're starting to test with the new algorithm. so far, no load tests, but just in single requests i'm seeing the same improvements...phenomenal improvements, btw, with most example queries taking less than 1/100th of the time always very impressed with this project/product, and just thought i'd add a me-too to the list...cheers, and have a great weekend, rob On Mon, Nov 24, 2008 at 11:12 PM, Yonik Seeley [EMAIL PROTECTED] wrote: A new faceting algorithm has been committed to the development version of Solr, and should be available in the next nightly test build (will be dated 11-25). This change should generally improve field faceting where the field has many unique values but relatively few values per document. This new algorithm is now the default for multi-valued fields (including tokenized fields) so you shouldn't have to do anything to enable it. We'd love some feedback on how it works to ensure that it actually is a win for the majority and should be the default. -Yonik
Re: Compiling Solr 1.3.0 + KStem
i've experimented with the KStem stuff in the past, and just pulled a fresh copy of solr from trunk it looks like Hoss' suggestion #1 does the trick, by simply commenting out the super.init call...loaded the example data, tested some analysis, and it seems to work as before. just a confirmation, and thanks, rob On Fri, Nov 28, 2008 at 6:18 PM, Chris Hostetter [EMAIL PROTECTED] wrote: : /usr/local/build/apache-solr-1.3.0/src/java/org/apache/solr/analysis/ : KStemFilterFactory.java:63: : cannot find symbol : [javac] symbol : method : init(org.apache : .solr.core.SolrConfig,java.util.Mapjava.lang.String,java.lang.String) : [javac] location: class org.apache.solr.analysis.BaseTokenFilterFactory : [javac] super.init(solrConfig, args); : [javac] ^ that KStemFilterFactory seems to be trying to use a method that existed for a while on the trunk, but was never released. i'm not familiary with KStemFilterFactory to know why/if it needs a SolrConfig, but a few things you can try... 1) if there are no references to solrConfig anywhere except the init method (and the super.init method it calls) just remove the refrences to it (so the methods just deal with the Map) 2) if there are other refrences to the solrConfig, they *may* just be to take advantage of ResourceLoader methods, so after making the changes above, make KStemFilterFactory implements ResourceLoaderAware and then add a method like this... public void inform(ResourceLoader loader) { // code that used solrConfig should go here, but use loader } ...it will get called after the init(MapString,String) method and let KStemmFilterFactory get access to files on disk. 3) if that doesn't work ... i don't know what else to try (i'd need to get a lot more familiar with KStem to guess) -Hoss
Re: [VOTE] Community Logo Preferences
https://issues.apache.org/jira/secure/attachment/12394282/solr2_maho_impression.png https://issues.apache.org/jira/secure/attachment/12394266/apache_solr_b_red.jpg thanks to everyone who contributed, rob On Sun, Nov 23, 2008 at 11:59 AM, Ryan McKinley [EMAIL PROTECTED] wrote: Please submit your preferences for the solr logo. For full voting details, see: http://wiki.apache.org/solr/LogoContest#Voting The eligible logos are: http://people.apache.org/~ryan/solr-logo-options.html Any and all members of the Solr community are encouraged to reply to this thread and list (up to) 5 ranked choices by listing the Jira attachment URLs. Votes will be assigned a point value based on rank. For each vote, 1st choice has a point value of 5, 5th place has a point value of 1, and all others follow a similar pattern. https://issues.apache.org/jira/secure/attachment/12345/yourfrstchoice.jpg https://issues.apache.org/jira/secure/attachment/34567/yoursecondchoice.jpg ... This poll will be open until Wednesday November 26th, 2008 @ 11:59PM GMT When the poll is complete, the solr committers will tally the community preferences and take a final vote on the logo. A big thanks to everyone would submitted possible logos -- its great to see so many good options.
Re: Simple Searching Question
you're likely not copyField-ing *_facet to text, and we'd need to see what type of field it is to see how it will be analyzed at both search/index time. the default schema.xml file is pretty well documented, so you might want to spend some time looking thru it, and reading the commentslots of good info in there. cheers, rob On Thu, Aug 14, 2008 at 7:17 PM, Jake Conk [EMAIL PROTECTED] wrote: Hi Shalin, foobar_facet is a dynamic field. Its defined in my schema like this: dynamicField name=*_facet type=string indexed=true stored=true/ I have the default search field set to text. Can I use more than one default search field? defaultSearchFieldtext/defaultSearchField Thanks, - Jake On Thu, Aug 14, 2008 at 2:48 PM, Shalin Shekhar Mangar [EMAIL PROTECTED] wrote: Hi Jake, What is the type of the foobar_facet field in your schema.xml ? Did you add foobar_facet as the default search field? On Fri, Aug 15, 2008 at 3:13 AM, Jake Conk [EMAIL PROTECTED] wrote: Hello, I inserted the following documents into Solr: --- add doc field name=id124/field field name=foobar_facetJake Conk/field /doc doc field name=id125/field field name=foobar_facetJake Conk/field /doc /add --- id is the only required integer field. foobar_facet is a dynamic string field. When I try to search for anything with the word Jake in it the following ways I get no results. select?q=Jake select?q=Jake* I thought one of those two should work but the only way I got it to work was by specifying which field Jake is in along with a wild card. select?q=foobar_facet:Jake* 1) Does this mean for each field I would like to search if Jake exists I would have to add each field like I did above to the query? 2) How would I search if I want to find the name Jake anywhere in the string? The documentation (http://lucene.apache.org/java/docs/queryparsersyntax.html) states that I cannot use a wildcard as the first character such as *Jake* Thanks, - Jake -- Regards, Shalin Shekhar Mangar.
Re: schema for literal string, case insensitive
bram, you'll want to look at the KeywordTokenizerFactory (which doesn't actually tokenize), and then use the LowerCaseFilterFactory. the schema in the example has a fieldType called 'alphaOnlySort' that should get you started. cheers, rob On Thu, May 29, 2008 at 6:21 AM, Bram de Jong [EMAIL PROTECTED] wrote: hello all, a while ago I was looking into making the schema for my (rather rich) data set, and I wanted to have a field for username. I would need a case insensitive string for that, but literal (no tokenizing, ...). how can I do this within the Solr schema definition? - bram -- http://freesound.iua.upf.edu http://www.smartelectronix.com http://www.musicdsp.org
LowerCaseFilterFactory and spellchecker
think i'm just doing something wrong... was experimenting with the spellcheck handler with the nightly checkout from 11-28; seems my spellchecking is case-sensitive, even tho i think i'm adding the LowerCaseFilterFactory to both the index and query analyzers. here's a brief rundown of my testing steps. from schema.xml: fieldtype name=spell class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StandardFilterFactory/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ filter class=solr.LowerCaseFilterFactory/ /analyzer analyzer type=query tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StandardFilterFactory/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ filter class=solr.LowerCaseFilterFactory/ /analyzer /fieldtype field name=title type=text indexed=true stored=true multiValued=true/ field name=spelling type=spell indexed=true stored=stored multiValued=true/ copyField source=title dest=spelling/ from solrconfig.xml: requestHandler name=spellchecker class=solr.SpellCheckerRequestHandler startup=lazy lst name=defaults int name=suggestionCount1/int float name=accuracy0.5/float /lst str name=spellcheckerIndexDirspell/str str name=termSourceFieldspelling/str /requestHandler adding the doc: curl http://localhost:8983/solr/update -H Content-Type: text/xml --data-binary 'adddocfield name=titleThorne/field/doc/add' curl http://localhost:8983/solr/update -H Content-Type: text/xml --data-binary 'optimize /' building the spellchecker: http://localhost:8983/solr/select/?q=Thorneqt=spellcheckercmd=rebuild querying the spellchecker: results from http://localhost:8983/solr/select/?q=Thorneqt=spellchecker ?xml version=1.0 encoding=UTF-8? response lst name=responseHeader int name=status0/int int name=QTime1/int /lst str name=wordsThorne/str str name=existfalse/str arr name=suggestions strthorne/str /arr /response results from http://localhost:8983/solr/select/?q=thorneqt=spellchecker ?xml version=1.0 encoding=UTF-8? response lst name=responseHeader int name=status0/int int name=QTime2/int /lst str name=wordsthorne/str str name=existtrue/str arr name=suggestions/ /response any pointers as to what i'm doing wrong, misinterpreting? i suspect i'm just doing something bone-headed in the analyzer sections... thanks as always, rob casson miami university libraries
Re: LowerCaseFilterFactory and spellchecker
lance, thanks for the quick replylooks like 'thorne' is getting added to the dictionary, as it comes up as a suggestion for 'Thorne' i could certainly just lowercase in my client, but just confirming that i'm not just screwing it up in the firstplace :) thanks again, rc On Nov 28, 2007 8:11 PM, Norskog, Lance [EMAIL PROTECTED] wrote: There are a few parameters for limiting what words are added to the dictionary. You might be trimming out 'thorne'. See this page: http://wiki.apache.org/solr/SpellCheckerRequestHandler -Original Message- From: Rob Casson [mailto:[EMAIL PROTECTED] Sent: Wednesday, November 28, 2007 4:25 PM To: solr-user@lucene.apache.org Subject: LowerCaseFilterFactory and spellchecker think i'm just doing something wrong... was experimenting with the spellcheck handler with the nightly checkout from 11-28; seems my spellchecking is case-sensitive, even tho i think i'm adding the LowerCaseFilterFactory to both the index and query analyzers. here's a brief rundown of my testing steps. from schema.xml: fieldtype name=spell class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StandardFilterFactory/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ filter class=solr.LowerCaseFilterFactory/ /analyzer analyzer type=query tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StandardFilterFactory/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ filter class=solr.LowerCaseFilterFactory/ /analyzer /fieldtype field name=title type=text indexed=true stored=true multiValued=true/ field name=spelling type=spell indexed=true stored=stored multiValued=true/ copyField source=title dest=spelling/ from solrconfig.xml: requestHandler name=spellchecker class=solr.SpellCheckerRequestHandler startup=lazy lst name=defaults int name=suggestionCount1/int float name=accuracy0.5/float /lst str name=spellcheckerIndexDirspell/str str name=termSourceFieldspelling/str /requestHandler adding the doc: curl http://localhost:8983/solr/update -H Content-Type: text/xml --data-binary 'adddocfield name=titleThorne/field/doc/add' curl http://localhost:8983/solr/update -H Content-Type: text/xml --data-binary 'optimize /' building the spellchecker: http://localhost:8983/solr/select/?q=Thorneqt=spellcheckercmd=rebuild querying the spellchecker: results from http://localhost:8983/solr/select/?q=Thorneqt=spellchecker ?xml version=1.0 encoding=UTF-8? response lst name=responseHeader int name=status0/int int name=QTime1/int /lst str name=wordsThorne/str str name=existfalse/str arr name=suggestions strthorne/str /arr /response results from http://localhost:8983/solr/select/?q=thorneqt=spellchecker ?xml version=1.0 encoding=UTF-8? response lst name=responseHeader int name=status0/int int name=QTime2/int /lst str name=wordsthorne/str str name=existtrue/str arr name=suggestions/ /response any pointers as to what i'm doing wrong, misinterpreting? i suspect i'm just doing something bone-headed in the analyzer sections... thanks as always, rob casson miami university libraries
delete by negative query
i'm having no luck deleting by a negative query indexing the example docs from 1.2, these steps work: curl http://localhost:8983/solr/update --data-binary 'deletequerysolr/query/delete' -H 'Content-type:text/xml; charset=utf-8' curl http://localhost:8983/solr/update --data-binary 'optimize /' -H 'Content-type:text/xml; charset=utf-8' but if i reindex, and change the delete query to a negative, the non-'solr' docs don't get deleted: curl http://localhost:8983/solr/update --data-binary 'deletequery-solr/query/delete' -H 'Content-type:text/xml; charset=utf-8' curl http://localhost:8983/solr/update --data-binary 'optimize /' -H 'Content-type:text/xml; charset=utf-8' good chance i'm missing something obvious tia, r
Re: delete by negative query
piete, thanks for the quick reply. You need to explicitly define the field you are referring to in order to achieve this, otherwise the query parser will assume that the minus character is part of the query and interpret it as field:-solr (where field is the name of the default field set in your schema). Try: curl http://localhost:8983/solr/update --data-binary 'deletequery-field:solr/query/delete' -H 'Content-type:text/xml; charset=utf-8' does this work for you?: curl http://localhost:8983/solr/update --data-binary 'deletequery-name:solr/query/delete' -H 'Content-type:text/xml; charset=utf-8' curl http://localhost:8983/solr/update --data-binary 'optimize /' -H 'Content-type:text/xml; charset=utf-8' then, http://localhost:8983/solr/select/?q=-name:solrversion=2.2start=0rows=10indent=on i still get all of those documents returned parser will assume that the minus character is part of the query and interpret it as field:-solr (where field is the name of the default field set in your schema) my true use case is to deleteByQuery against my default field, but even in this case, with the field specified, and using the exampledocs, i'm not seeing the document get deleted. thanks again...maybe i should just grab a beer, and go to bed...cheers :) r
Re: delete by negative query
the work arround is to include *:* in yoru query ... deletequery*:* -solr/query/delete ... if/when this is fixed in Solr that's esentally what solr will do under the covers. (would you mind opening a bug to track this and mention the work arround for other people who encounter it) will do. thanks again...adding the *:* to any query (qualified or not) seems to be working as expected solr++, r