set-based and other less common approaches to search

2013-07-02 Thread gilawem
Let's say I wanted to ask solr to find me any document that contains at least 
100 out of some 300 search terms I give it. Can Solr do this out of the box? If 
not, what kind of customization would it require?

Now let's say I want to further have the option to request that those terms a) 
must show up within the same column of an excel spreadsheet, or b) are exact 
matches (i.e. match on search, but not searched), or c) occur in the exact 
order that I specified, or d) occur contiguously and without any words in 
between, or e) are made up of non-word elements such as 92228345 or 
SJA12334.

Can solr do any of these out of the box? If not, what of these tasks is 
relatively easy to do with some custom code, and what is not?

Re: set-based and other less common approaches to search

2013-07-02 Thread gilawem
Thanks. So following up on a) below, could I set up and query Solr, without any 
customization of code, to match 10 of my given 20 terms, but only if it finds 
those 10 terms in an xls document under a column that is named MyID or My 
ID or My I.D.? If so, what would that query look like?

On Jul 2, 2013, at 12:38 PM, Otis Gospodnetic wrote:

 Hi,
 
 Solr can do all of these.  There are phrase queries, queries where you
 specify a field, the mm param for min should match, etc.
 
 Otis
 --
 Solr  ElasticSearch Support -- http://sematext.com/
 Performance Monitoring -- http://sematext.com/spm
 
 
 
 On Tue, Jul 2, 2013 at 12:36 PM, gilawem mewa...@gmail.com wrote:
 Let's say I wanted to ask solr to find me any document that contains at 
 least 100 out of some 300 search terms I give it. Can Solr do this out of 
 the box? If not, what kind of customization would it require?
 
 Now let's say I want to further have the option to request that those terms 
 a) must show up within the same column of an excel spreadsheet, or b) are 
 exact matches (i.e. match on search, but not searched), or c) occur in 
 the exact order that I specified, or d) occur contiguously and without any 
 words in between, or e) are made up of non-word elements such as 92228345 
 or SJA12334.
 
 Can solr do any of these out of the box? If not, what of these tasks is 
 relatively easy to do with some custom code, and what is not?



Re: broken links returned from solr search

2013-06-29 Thread gilawem
Sorry, i thought it was obvious. The links that are broken are the links that 
are returned in the search results. Using the example in the documentation I 
mentioned below, to load a word doc via
curl 
http://localhost:8983/solr/update/extract?literal.id=doc1commit=true; -F 
myfile=@myworddoc.doc

the broken link that shows up in the search results is 
http://localhost:8983/solr/collection1/doc1

so I just need to know where in the solr config to be able to handle requests 
when the URL points to collection/some_doc


On Jun 29, 2013, at 1:08 PM, Erick Erickson wrote:

 What links? You haven't shown us what link you're clicking on
 that generates the 404 error.
 
 You might want to review:
 http://wiki.apache.org/solr/UsingMailingLists
 
 Best
 Erick
 
 
 On Fri, Jun 28, 2013 at 2:04 PM, MA LIG mewa...@gmail.com wrote:
 
 Hello,
 
 I ran the solr example as described in
 http://lucene.apache.org/solr/4_3_1/tutorial.html and then loaded some doc
 files to solr as described in
 http://wiki.apache.org/solr/ExtractingRequestHandler. The commands I used
 to load the files were of the form
 
  curl 
 http://localhost:8983/solr/update/extract?literal.id=doc1commit=true; -F
 myfile=@test.doc
 
 I can successfully see search results in
 http://localhost:8983/solr/collection1/browse
 http://192.168.3.72:8983/solr/collection1/browse?q=test
 .
 
 However, when I click on a link, I get a 404 not found error. How can I
 make these links work properly?
 
 Thanks in advance
 
 -gw
 



Re: broken links returned from solr search

2013-06-29 Thread gilawem
OK thanks. So I guess I will set up my own normal webserver and have the solr 
server a sort of private web-based API (or possibly a front-end that, when a 
user clicks on a search result link, just redirects the user to my normal web 
server that has the related file). That's easy enough. If that's not how solr 
is supposed to be used, please feel free to let me know. Thanks!

On Jun 29, 2013, at 3:34 PM, Erick Erickson wrote:

 There's nothing built into the indexing process that stores URLs allowing
 you to fetch the document, you have to do that yourself. I'm not sure how
 the link is getting into the search results, you're assigning doc1 as the
 ID of the doc, and I think the browse request handler, aka Solaritas is
 constructing the link as best it can. But that is only demo code, not
 intended to fetch the document.
 
 In a typical app, you'll construct a URL for display that has meaning in
 _your_ environment, typically some way for the app server to know where the
 document is and how to fetch it. the browse request handler is showing you
 how you'd do this, but isn't meant to actually fetch the doc.
 
 Best
 Erick
 
 
 On Sat, Jun 29, 2013 at 1:29 PM, gilawem mewa...@gmail.com wrote:
 
 Sorry, i thought it was obvious. The links that are broken are the links
 that are returned in the search results. Using the example in the
 documentation I mentioned below, to load a word doc via
curl 
 http://localhost:8983/solr/update/extract?literal.id=doc1commit=true; -F
 myfile=@myworddoc.doc
 
 the broken link that shows up in the search results is
 http://localhost:8983/solr/collection1/doc1
 
 so I just need to know where in the solr config to be able to handle
 requests when the URL points to collection/some_doc
 
 
 On Jun 29, 2013, at 1:08 PM, Erick Erickson wrote:
 
 What links? You haven't shown us what link you're clicking on
 that generates the 404 error.
 
 You might want to review:
 http://wiki.apache.org/solr/UsingMailingLists
 
 Best
 Erick
 
 
 On Fri, Jun 28, 2013 at 2:04 PM, MA LIG mewa...@gmail.com wrote:
 
 Hello,
 
 I ran the solr example as described in
 http://lucene.apache.org/solr/4_3_1/tutorial.html and then loaded some
 doc
 files to solr as described in
 http://wiki.apache.org/solr/ExtractingRequestHandler. The commands I
 used
 to load the files were of the form
 
 curl 
 http://localhost:8983/solr/update/extract?literal.id=doc1commit=true;
 -F
 myfile=@test.doc
 
 I can successfully see search results in
 http://localhost:8983/solr/collection1/browse
 http://192.168.3.72:8983/solr/collection1/browse?q=test
 .
 
 However, when I click on a link, I get a 404 not found error. How can I
 make these links work properly?
 
 Thanks in advance
 
 -gw