Multi-word searches in multi-valued fields

2011-09-22 Thread Olson, Ron
Hi all- I'm not clear on how to allow a user to search a multi-valued field with multiple words and return only those documents where all the words are together in one value, and not spread over multiple values. If I do a literal search on the company name field for smith trucking (with the

RE: Two unrelated questions

2011-09-21 Thread Olson, Ron
this very high, but you're right, if anyone actually does something absurd it'll slow *that* query down. But just bumping this query higher won't change performance absent someone actually putting a ton of items in it... Best Erick On Mon, Sep 19, 2011 at 9:12 AM, Olson, Ron rol...@lbpc.com wrote

Two unrelated questions

2011-09-19 Thread Olson, Ron
Hi all- I'm not sure if I should break this out into two separate questions to the list for searching purposes, or if one is more acceptable (don't want to flood). I have two (hopefully) straightforward questions: 1. Is it possible to expose the unique ID of a document to a DIH query? The

Add copyTo Field without re-indexing?

2011-09-16 Thread Olson, Ron
Hi all- I have an 11 gig index that I realize I need to add another field to, but not from the actual query using DIH, but via copyTo. Is there any way to re-parse an existing index, adding the new copyTo field, without having to basically start all over again with DIH? Thanks, Ron

Parent delta query, but no child delta query?

2011-09-02 Thread Olson, Ron
Hi all- I'm trying to set up a delta query for a parent entity query that has many sub-queries. The table referenced in the parent query has a last updated field, but none of the children do. The way the data is set up is that when a child table is updated, the last updated field of the parent

RE: Exact matching on names?

2011-08-17 Thread Olson, Ron
with rows=1. -sujit On Tue, 2011-08-16 at 10:20 -0500, Olson, Ron wrote: Hi all- I'm missing something fundamental yet I've been unable to find the definitive answer for exact name matching. I'm indexing names using the standard text field type and my search is for the name clarke. My results

Exact matching on names?

2011-08-16 Thread Olson, Ron
Hi all- I'm missing something fundamental yet I've been unable to find the definitive answer for exact name matching. I'm indexing names using the standard text field type and my search is for the name clarke. My results include clark, which is incorrect, it needs to match clarke exactly (case

Dates off by 1 day?

2011-08-10 Thread Olson, Ron
Hi all- I apologize in advance if this turns out to be a problem between the keyboard and the chair, but I'm confused about why my date field is correct in the index, but wrong in SolrJ. I have a field defined as a date in the index: field name=FILE_DATE type=date indexed=true

RE: Dates off by 1 day?

2011-08-10 Thread Olson, Ron
in different time-zones using Java Util date functions if required. Hope it helps! -param On 8/10/11 11:20 AM, Olson, Ron rol...@lbpc.com wrote: Hi all- I apologize in advance if this turns out to be a problem between the keyboard and the chair, but I'm confused about why my date field is correct

RE: Strategies for sorting by array, when you can't sort by array?

2011-08-04 Thread Olson, Ron
in the future. Thanks to everyone who offered suggestions! Ron -Original Message- From: Jonathan Rochkind [mailto:rochk...@jhu.edu] Sent: Wednesday, August 03, 2011 11:36 AM To: solr-user@lucene.apache.org Cc: Olson, Ron Subject: Re: Strategies for sorting by array, when you can't sort by array

RE: deleting index directory/files

2011-08-04 Thread Olson, Ron
I ran into a problem when I deleted just the index directory; I deleted the entire data directory and it was recreated on the next load. BTW, if you're using the DIH, its default behavior is to remove all records on a full import, so you can save yourself having to remove any actual files.

Strategies for sorting by array, when you can't sort by array?

2011-08-03 Thread Olson, Ron
Hi all- Well, this is a problem. I have a list of names as a multi-valued field and I am searching on this field and need to return the results sorted. I know from searching and reading the documentation (and getting the error) that sorting on a multi-valued field isn't possible. Okay, so,

RE: Strategies for sorting by array, when you can't sort by array?

2011-08-03 Thread Olson, Ron
@lucene.apache.org Cc: Olson, Ron Subject: Re: Strategies for sorting by array, when you can't sort by array? Although you weren't very clear about it, it sounds as if you want the results to be sorted by a name that actually matched the query? In general that is not going to be easy, since

RE: Strategies for sorting by array, when you can't sort by array?

2011-08-03 Thread Olson, Ron
Solr 4's join feature addresses this use case. In the end, I think Solr could be modified to support this, with some work. It would make a good feature request in JIRA. ~ David Smiley On Aug 3, 2011, at 10:39 AM, Olson, Ron wrote: Hi all- Well, this is a problem. I have a list of names

Determine which field term was found?

2011-07-21 Thread Olson, Ron
Hi all- Is there an easy way to find out which field matched a term in an OR query using Solr? I have a document with names in two multi-valued fields and I am searching for Smith, using the query A_NAMES:smith OR B_NAMES:smith. I figure I could loop through both result arrays, but that seems

RE: Determine which field term was found?

2011-07-21 Thread Olson, Ron
at 4:47 PM, Olson, Ron rol...@lbpc.com wrote: Is there an easy way to find out which field matched a term in an OR query using Solr? I have a document with names in two multi-valued fields and I am searching for Smith, using the query A_NAMES:smith OR B_NAMES:smith. I figure I could loop

Unique document count from index?

2011-06-27 Thread Olson, Ron
Hi all- I have a problem that I'm not sure how it can be (if it can be) solved in Solr. I am using Solr 3.2 with patch 2524 installed to provide grouping. I need to return the count of unique records that match a particular query. For an example of what I'm talking about, imagine I have an

DIH and updating specific record

2011-02-22 Thread Olson, Ron
Hi all- I am trying to determine if there is a way to tell Solr to update its index with a specific ID to a record in the database. All the examples and documentation seems to discuss using a last updated date/time field, but in this case modifying the table would not be an option. Instead,

RE: XML Stripping from DIH

2011-02-22 Thread Olson, Ron
Stripping from DIH Ron, http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripCharFilterFactory Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Olson, Ron rol

XML Stripping from DIH

2011-02-18 Thread Olson, Ron
Hi all- I have some XML in a database that I am trying to index and store; I am interested in the various pieces of text, but none of the tags. I've been trying to figure out a way to strip all the tags out, but haven't found anything within Solr to do so; the XML parser seems to want XPath to

RE: Setting up Solr for PDFs on JBoss

2011-01-04 Thread Olson, Ron
for PDFs on JBoss What's your solrconfig.xml look like for setting up the ExtractingReqHandler? -Grant On Jan 3, 2011, at 4:44 PM, Olson, Ron wrote: Hi all- After testing the PDF import functionality in my local copy of Solr 1.4.1 with the included Jetty app server, I tried replicating

Setting up Solr for PDFs on JBoss

2011-01-03 Thread Olson, Ron
Hi all- After testing the PDF import functionality in my local copy of Solr 1.4.1 with the included Jetty app server, I tried replicating it using my copy of Solr running in JBoss 5.10 (which uses Tomcat as its servlet container). When I try to add a PDF, I get an error buried in the stack

Using setStart in solrj

2010-11-04 Thread Olson, Ron
Hi all- First, thanks to all the folks to have helped me so far getting the hang of Solr; I promise to give back when I think my contributions will be useful :) I am at the point where I'm trying to return results back from a search in a war file, using Java with solrj. On the result page of

RE: Testing/packaging question

2010-11-04 Thread Olson, Ron
I believe it should point to the directory above, where conf and lib are located (though I have a multi-core setup). Mine is set to: /usr/local/jboss-5.1.0.GA/server/solr/solr_data/ And in solr_data the solr.xml defines the two cores, but in each core directory, is a conf, data, and lib

RE: Stored or indexed?

2010-11-02 Thread Olson, Ron
Thanks for the great info! I appreciate everybody's help in getting started with Solr, hopefully I'll be able to get my stuff working and move on to more difficult questions. :) -Original Message- From: Elizabeth L. Murnane [mailto:emurn...@architexa.com] Sent: Friday, October 29, 2010

Stored or indexed?

2010-10-27 Thread Olson, Ron
Hi all- I've read through the documentation, but I'm still a little confused about the field/ tag, in terms of the indexed and stored attributes. If I have something marked as indexed=true, why would I ever want stored=false? Are there any good tips-n-tricks anywhere about how to properly set

Confusion about entities and documents

2010-10-22 Thread Olson, Ron
Hi all- I've been checking the online docs about this, but I haven't found a suitable explanation about how entities and sub-entities work within a document. I am loading records from a SQL database and everything seems to be getting flattened in a way I was not expecting. For example, I have

RE: Confusion about entities and documents

2010-10-22 Thread Olson, Ron
Hmm, okay, I guess I wasn't taking the hierarchy-flattening aspect of Solr seriously enough. :) Based on your reply from the other thread, I guess the best solution, as far as I can tell, is to maintain the multiple value lists and take advantage of the fact that the arrays will always be in

Documents and cores

2010-10-19 Thread Olson, Ron
Hi all- I have a newbie design question about documents, especially with SQL databases. I am trying to set up Solr to go against a database that, for example, has items and people. The way I see it, and I don't know if this is right or not (thus the question), is that I see both as separate

Documents and Cores, take 2

2010-10-19 Thread Olson, Ron
Hi all- I have a newbie design question about documents, especially with SQL databases. I am trying to set up Solr to go against a database that, for example, has items and people. The way I see it, and I don't know if this is right or not (thus the question), is that I see both as separate