set-based and other less common approaches to search
Let's say I wanted to ask solr to find me any document that contains at least 100 out of some 300 search terms I give it. Can Solr do this out of the box? If not, what kind of customization would it require? Now let's say I want to further have the option to request that those terms a) must show up within the same column of an excel spreadsheet, or b) are exact matches (i.e. match on search, but not searched), or c) occur in the exact order that I specified, or d) occur contiguously and without any words in between, or e) are made up of non-word elements such as 92228345 or SJA12334. Can solr do any of these out of the box? If not, what of these tasks is relatively easy to do with some custom code, and what is not?
Re: set-based and other less common approaches to search
Thanks. So following up on a) below, could I set up and query Solr, without any customization of code, to match 10 of my given 20 terms, but only if it finds those 10 terms in an xls document under a column that is named MyID or My ID or My I.D.? If so, what would that query look like? On Jul 2, 2013, at 12:38 PM, Otis Gospodnetic wrote: Hi, Solr can do all of these. There are phrase queries, queries where you specify a field, the mm param for min should match, etc. Otis -- Solr ElasticSearch Support -- http://sematext.com/ Performance Monitoring -- http://sematext.com/spm On Tue, Jul 2, 2013 at 12:36 PM, gilawem mewa...@gmail.com wrote: Let's say I wanted to ask solr to find me any document that contains at least 100 out of some 300 search terms I give it. Can Solr do this out of the box? If not, what kind of customization would it require? Now let's say I want to further have the option to request that those terms a) must show up within the same column of an excel spreadsheet, or b) are exact matches (i.e. match on search, but not searched), or c) occur in the exact order that I specified, or d) occur contiguously and without any words in between, or e) are made up of non-word elements such as 92228345 or SJA12334. Can solr do any of these out of the box? If not, what of these tasks is relatively easy to do with some custom code, and what is not?
Re: broken links returned from solr search
Sorry, i thought it was obvious. The links that are broken are the links that are returned in the search results. Using the example in the documentation I mentioned below, to load a word doc via curl http://localhost:8983/solr/update/extract?literal.id=doc1commit=true; -F myfile=@myworddoc.doc the broken link that shows up in the search results is http://localhost:8983/solr/collection1/doc1 so I just need to know where in the solr config to be able to handle requests when the URL points to collection/some_doc On Jun 29, 2013, at 1:08 PM, Erick Erickson wrote: What links? You haven't shown us what link you're clicking on that generates the 404 error. You might want to review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Fri, Jun 28, 2013 at 2:04 PM, MA LIG mewa...@gmail.com wrote: Hello, I ran the solr example as described in http://lucene.apache.org/solr/4_3_1/tutorial.html and then loaded some doc files to solr as described in http://wiki.apache.org/solr/ExtractingRequestHandler. The commands I used to load the files were of the form curl http://localhost:8983/solr/update/extract?literal.id=doc1commit=true; -F myfile=@test.doc I can successfully see search results in http://localhost:8983/solr/collection1/browse http://192.168.3.72:8983/solr/collection1/browse?q=test . However, when I click on a link, I get a 404 not found error. How can I make these links work properly? Thanks in advance -gw
Re: broken links returned from solr search
OK thanks. So I guess I will set up my own normal webserver and have the solr server a sort of private web-based API (or possibly a front-end that, when a user clicks on a search result link, just redirects the user to my normal web server that has the related file). That's easy enough. If that's not how solr is supposed to be used, please feel free to let me know. Thanks! On Jun 29, 2013, at 3:34 PM, Erick Erickson wrote: There's nothing built into the indexing process that stores URLs allowing you to fetch the document, you have to do that yourself. I'm not sure how the link is getting into the search results, you're assigning doc1 as the ID of the doc, and I think the browse request handler, aka Solaritas is constructing the link as best it can. But that is only demo code, not intended to fetch the document. In a typical app, you'll construct a URL for display that has meaning in _your_ environment, typically some way for the app server to know where the document is and how to fetch it. the browse request handler is showing you how you'd do this, but isn't meant to actually fetch the doc. Best Erick On Sat, Jun 29, 2013 at 1:29 PM, gilawem mewa...@gmail.com wrote: Sorry, i thought it was obvious. The links that are broken are the links that are returned in the search results. Using the example in the documentation I mentioned below, to load a word doc via curl http://localhost:8983/solr/update/extract?literal.id=doc1commit=true; -F myfile=@myworddoc.doc the broken link that shows up in the search results is http://localhost:8983/solr/collection1/doc1 so I just need to know where in the solr config to be able to handle requests when the URL points to collection/some_doc On Jun 29, 2013, at 1:08 PM, Erick Erickson wrote: What links? You haven't shown us what link you're clicking on that generates the 404 error. You might want to review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Fri, Jun 28, 2013 at 2:04 PM, MA LIG mewa...@gmail.com wrote: Hello, I ran the solr example as described in http://lucene.apache.org/solr/4_3_1/tutorial.html and then loaded some doc files to solr as described in http://wiki.apache.org/solr/ExtractingRequestHandler. The commands I used to load the files were of the form curl http://localhost:8983/solr/update/extract?literal.id=doc1commit=true; -F myfile=@test.doc I can successfully see search results in http://localhost:8983/solr/collection1/browse http://192.168.3.72:8983/solr/collection1/browse?q=test . However, when I click on a link, I get a 404 not found error. How can I make these links work properly? Thanks in advance -gw