Newbie: For stopword query - All objects being returned

2008-11-20 Thread Sanjay Suri
Hi , I realize this might be too simple - Can someone tell me where to look? I'm new to solr and have to fix this for a demo asap. If my search query is the, all 91 objects are returned as search results. I expect 0 results. -- Sanjay Suri Videocrux Inc. http://videocrux.com +91 99102 66626

Re: How can i protect the SOLR Cores?

2008-11-20 Thread Noble Paul നോബിള്‍ नोब्ळ्
Setup an extra filter before SolrDispatchFilter to do authentication. On Thu, Nov 20, 2008 at 12:28 PM, RaghavPrabhu [EMAIL PROTECTED] wrote: Hi all, Im using multiple cores and all i need to do is,to make the each core in secure manner. If i am accessing the particular core via url,it

WordDelimeterFilter and its Factory: access to charTypeTable

2008-11-20 Thread Jerven Bolleman
Hi Solr Community, I was wondering if it is possible to access and modify the charTypeTable of the WordDelimeterFilter. The use case is that I do not want to split on a '*' char. Which the filter currently does. If I could modify the charTypeTable I could change the behaviour of the filter. Or

Total newbie question: unknown field

2008-11-20 Thread Elise Dawn Conradi
Hi all! For a school project at the master's program in LIS at Oslo University College, I am trying to index Marc-records to make a faceted browser of digital books. So far, I've transformed the Marc-records to Solr-friendly records. I am now trying to update my index (for the first

Re: TextProfileSigature using deduplication

2008-11-20 Thread Marc Sturlese
Hey there, I found couple of solutions that work fine for my case (is not exacly what I was looking for at the begining but I could adapt it). First one: Use always quantum=1 and minTokenLen=2. Instead of order the tokens by frequency, I order them alphabetically, doing this I am a little more

Re: Total newbie question: unknown field

2008-11-20 Thread Elise Dawn Conradi
Hi again! I've figured it out. Hadn't reloaded Solr after updating the schema. Doh. Regards, Elise Quoting Elise Dawn Conradi [EMAIL PROTECTED]: Hi all! For a school project at the master's program in LIS at Oslo University College, I am trying to index Marc-records to make a faceted

Re: TextProfileSigature using deduplication

2008-11-20 Thread Mark Miller
Thanks for sharing Marc, thats very nice to know. I'll take your experience as a starting point for some wiki recommendations. Sounds like we should add a switch to order alpha as well. Marc Sturlese wrote: Hey there, I found couple of solutions that work fine for my case (is not exacly what

not string or text fields and shards

2008-11-20 Thread Marc Sturlese
Hey there, I have started working with an index divided in 3 shards. When I did a distributed search I got an error with the fields that were not string or text. I read that the error was due to BinaryResponseWriter and not string/text empty fields. I found the solution in an old thread of this

Slow deleteById with solr 1.3.0

2008-11-20 Thread Uwe Klosa
Hi We have a problem with slow deleteById where one delete can take up to 30 minutes and the thread which initiated the deleteById is waiting for the method to return. The problem is not that the delete takes so much time. The problem is that the application that initiates deletes is halted that

Re: Unique id

2008-11-20 Thread Erik Hatcher
I'd suggest aggregating those three columns into a string that can serve as the Solr uniqueKey field value. Erik On Nov 20, 2008, at 1:10 AM, Raghunandan Rao wrote: Basically, I am working on two views. First one has an ID column. The second view has no unique ID column. What to do

Re: Newbie: For stopword query - All objects being returned

2008-11-20 Thread Erik Hatcher
On Nov 20, 2008, at 3:31 AM, Sanjay Suri wrote: Hi , I realize this might be too simple - Can someone tell me where to look? I'm new to solr and have to fix this for a demo asap. If my search query is the, all 91 objects are returned as search results. I expect 0 results. Add

Re: Total newbie question: unknown field

2008-11-20 Thread Erik Hatcher
On Nov 20, 2008, at 6:22 AM, Elise Dawn Conradi wrote: For a school project at the master's program in LIS at Oslo University College, I am trying to index Marc-records to make a faceted browser of digital books. Are you aware of the library projects that leverage Solr? Blacklight,

Re: Upgrade from 1.2 to 1.3 gives 3x slowdown

2008-11-20 Thread Grant Ingersoll
Hi Fergus, Were you overwriting the existing index or did you also clean out the Solr data directory, too? In other words, was it a fresh index, or an existing one? And was that also the case for the 22 minute time? Would it be possible to profile the two instance and see if you notice

Re: Field collapsing (SOLR-236) and Solr 1.3.0 release version

2008-11-20 Thread gurudev
We are about to release Field collapsing in our production site, but the index size is not as big as yours. Definitely collapsing is an added overhead. You can do some load testing and bench mark on some dataset as you would expect on your production project as SOLR-236 is currently available

Re: Field collapsing (SOLR-236) and Solr 1.3.0 release version

2008-11-20 Thread Mark Miller
gurudev wrote: One thing that you can go with is using adjacent field collapsing rather than simple collapsing. As internally SOLR would first sort on the collapse field to use simple collapsing, which is not the case with adjacent collapsing. This something that I think could be improved in

Re: Tomcat undeploy/shutdown exception

2008-11-20 Thread Mark Miller
Its pretty nuts, cause the null check protecting against that appears to have been in well pre 1.3. How the heck does a null get past a null check? Erik Hatcher wrote: 1.3.0 final release. Erik On Nov 20, 2008, at 2:03 AM, Shalin Shekhar Mangar wrote: Eric, which Solr version is that

Re: Upgrade from 1.2 to 1.3 gives 3x slowdown

2008-11-20 Thread Fergus McMenemie
Hello Grant, Were you overwriting the existing index or did you also clean out the Solr data directory, too? In other words, was it a fresh index, or an existing one? And was that also the case for the 22 minute time? No in each case it was a new index. I store the indexes (the data dir)

solr.WordDelimiterFilterFactory

2008-11-20 Thread Daniel Rosher
Hi, I'm trying to index some content that has things like 'java/J2EE' but with solr.WordDelimiterFilterFactory and parameters [generateWordParts=1 generateNumberParts=0 catenateWords=0 catenateNumbers=0 catenateAll=0 splitOnCaseChange=0] this ends up tokenized as 'java','j','2',EE' Does anyone

Re: Multi word Synonym

2008-11-20 Thread gurudev
Just use the query analysis link with appropriate values. It will show how each filter factories and analyzers breaks the terms during various analysis levels. Specially check EnglishPorterFilterFactory analysis Jeff Newburn wrote: I am trying to figure out how the synonym filter processes

Re: TextProfileSigature using deduplication

2008-11-20 Thread Andrzej Bialecki
Mark Miller wrote: Thanks for sharing Marc, thats very nice to know. I'll take your experience as a starting point for some wiki recommendations. Sounds like we should add a switch to order alpha as well. On the general note of near-duplicate detection ... I found this paper in the

Re: Field collapsing (SOLR-236) and Solr 1.3.0 release version

2008-11-20 Thread Ryan McKinley
if only i could magic all these damn pdfs I have into some code :) +1 I want some of that magic too!

RE: WordDelimeterFilter and its Factory: access to charTypeTable

2008-11-20 Thread Feak, Todd
I've found that creating a custom filter and filter factory isn't too burdensome when the filter doesn't quite do what I need. You could grab the source and create your own version. -Todd Feak -Original Message- From: Jerven Bolleman [mailto:[EMAIL PROTECTED] Sent: Thursday, November

RE: WordDelimeterFilter and its Factory: access to charTypeTable

2008-11-20 Thread Jerven Bolleman
On Thu, 2008-11-20 at 07:30 -0800, Feak, Todd wrote: I've found that creating a custom filter and filter factory isn't too burdensome when the filter doesn't quite do what I need. You could grab the source and create your own version. I will have to do so anyway. As a test I used reflection

Solr Core Admin

2008-11-20 Thread Jeff Newburn
I am trying to use the api for the solr cores. Reload works great but when I try to UNLOAD I get a massive exception in IOException. It seems to unload the module but doesn¹t remove it from the configuration file. The solr.xml file is full read and write but still errors. Any ideas? Solr.xml

Re: Tomcat undeploy/shutdown exception

2008-11-20 Thread Chris Hostetter
: 1.3.0 final release. that stack trace doesn't jive with 1.3.0 ... : java.lang.NullPointerException : at : org.apache : .solr.servlet.SolrDispatchFilter.destroy(SolrDispatchFilter.java:123) : at SolrDispatchFilter.java:123 in 1.3 (and 1.2, and trunk) is in the doFilter

Re: Logging in Solr.

2008-11-20 Thread Erik Holstad
Thanks for the help Ryan! Using the start.jar with 1.3 and added the slf4j jar to the classpath. When it comes to the setting up of the log4j I wonder which method is better. To put the redirect to the log server in the Jetty.xml file or to put a log4j.properties file in the web library, and if

Re: Solr Core Admin

2008-11-20 Thread Jeff Newburn
Ok just FYI solr replaces the file instead of editing. This means that the webserver needs permissions in the directory to delete and create the solr.xml file. Once I fixed that it no longer gave IOException errors. On 11/20/08 8:29 AM, Jeff Newburn [EMAIL PROTECTED] wrote: I am trying to

RE: Solr Core Admin

2008-11-20 Thread Jeffrey Heard
unsubscribe Date: Thu, 20 Nov 2008 08:29:20 -0800 Subject: Solr Core Admin From: [EMAIL PROTECTED] To: solr-user@lucene.apache.org I am trying to use the api for the solr cores. Reload works great but when I try to UNLOAD I get a massive exception in IOException. It seems to unload

Re: Logging in Solr.

2008-11-20 Thread Ryan McKinley
On Nov 20, 2008, at 11:57 AM, Erik Holstad wrote: Thanks for the help Ryan! Using the start.jar with 1.3 and added the slf4j jar to the classpath. When with 1.3 -- the logging is java.util.logging -- The slf4j advice only applies to 1.4-dev ryan

Re: Logging in Solr.

2008-11-20 Thread Erik Holstad
Ok, thanks Ryan! On Thu, Nov 20, 2008 at 9:03 AM, Ryan McKinley [EMAIL PROTECTED] wrote: On Nov 20, 2008, at 11:57 AM, Erik Holstad wrote: Thanks for the help Ryan! Using the start.jar with 1.3 and added the slf4j jar to the classpath. When with 1.3 -- the logging is java.util.logging

Question about dismax 'mm' - give boost to searches by location

2008-11-20 Thread anuvenk
I use the 'dismax handler' for my phrase matching. And i have the 'mm' set this way: Up to 3 words, match all up to 4, match 3 up to 4, match 3 so on Its been working fine, but for certain phrases like 'san diego drunk driving defense attorney', its brings up dui attorneys for other cities first

Phrase query search with stopwords

2008-11-20 Thread Robert Haschart
Greetings all, I'm having trouble tracking down why a particular query is not working. A user is trying to do a search for alternate_form_title_text:three films by louis malle specifically to find the 4 records that contain the phrase Three films by Louis Malle in their

RE: No search result behavior (a la Amazon)

2008-11-20 Thread Caligula
Thanks. I understand what Amazon is doing. The original question is how to achieve this with Solr. And to be more specific, how to achieve this within Solr and not involve multiple search queries to Solr. Nguyen, Joe-2 wrote: Seemed like its first search required match all terms. If

Re: Deadlock with DirectUpdateHandler2

2008-11-20 Thread Ryan McKinley
I'm also hitting some threading issues with autocommit -- JConsole does not show deadlock, but it shows some threads 'BLOCKED' on scheduleCommitWithin Perhaps this has something to do with the changes we made for: SOLR-793 I am able to fix this (at least I don't see the blocking with the

Filtering on blank fields

2008-11-20 Thread Manepalli, Kalyan
Hi, I want to fetch only the documents which have a certain field. For this I am using a fq query like this fq=rev.comments:[* TO *] rev.comments fields is of type string. The functionality works correctly but I am seeing a performance degradation Without the above fq, the

Re: Filtering on blank fields

2008-11-20 Thread Mike Klaas
On 20-Nov-08, at 12:23 PM, Manepalli, Kalyan wrote: Hi, I want to fetch only the documents which have a certain field. For this I am using a fq query like this fq=rev.comments:[* TO *] rev.comments fields is of type string. The functionality works correctly but I am seeing a

Re: filtering on blank OR specific range

2008-11-20 Thread Thijs Vonk
I actually am looking for the same answer. I have worked around it by indexing 'empty' fields with a dumpy value but this isn't an ideal situation Thijs On 11/19/08 10:38 PM, Geoffrey Young wrote: Lance Norskog wrote: Try: Type:blue OR -Type:[* TO *] You can't have a negative clause

solrQueryParser does not take effect - nightly build

2008-11-20 Thread ashokc
Hi, I have set solrQueryParser defaultOperator=AND/ but it is not taking effect. It continues to take it as OR. I am working with the latest nightly build 11/20/2008 For a querry like term1 term2 Debug shows str name=parsedquerycontent:term1 content:term2/str Bug? Thanks - ashok --

RE: Filtering on blank fields

2008-11-20 Thread Manepalli, Kalyan
Hi Mike, Thanks for the suggestion, I will test it out and post the results Thanks, Kalyan Manepalli -Original Message- From: Mike Klaas [mailto:[EMAIL PROTECTED] Sent: Thursday, November 20, 2008 2:38 PM To: solr-user@lucene.apache.org Subject: Re: Filtering on blank fields On

RE: :TODO: we should try to make a DTD for the schema, DONE as XSD instead

2008-11-20 Thread Peter_Lenahan
Hoss, There were a few comments about schema files in Mark Mail between you and Grant a couple of months ago, no big demand for them for the schema.xml file. Before I drop this would you consider taking a look at XSD file below for the schema.xml and perhaps submit the XSD file the SVN system? I

Re: solr.WordDelimiterFilterFactory

2008-11-20 Thread Mike Klaas
On 20-Nov-08, at 6:20 AM, Daniel Rosher wrote: Hi, I'm trying to index some content that has things like 'java/J2EE' but with solr.WordDelimiterFilterFactory and parameters [generateWordParts=1 generateNumberParts=0 catenateWords=0 catenateNumbers=0 catenateAll=0 splitOnCaseChange=0] this

Boosting by field contents

2008-11-20 Thread Lance Norskog
Is it possible to boost a document by the contents of a field? Given the query: text field:value I want to return all documents with 'text'. Documents where 'field = value' boosted over documents where 'field = some other value'. This query does it: (text field:value)^100 (text

Re: No search result behavior (a la Amazon)

2008-11-20 Thread Mike Klaas
On 20-Nov-08, at 11:40 AM, Caligula wrote: Thanks. I understand what Amazon is doing. The original question is how to achieve this with Solr. And to be more specific, how to achieve this within Solr and not involve multiple search queries to Solr. There isn't a way. The best way to

Re: Build Solr to run SolrJS

2008-11-20 Thread JCodina
I could not manage, yet to use it. :confused: My doubts are: - must I download solr from svn - trunk? - then, must I apply the patches of solrjs and velocity and unzip the files? or is this already in trunk? because trunk contains velocity and javascript in contrib. but does not find the

Re: How can i protect the SOLR Cores?

2008-11-20 Thread Norberto Meijome
On Wed, 19 Nov 2008 22:58:52 -0800 (PST) RaghavPrabhu [EMAIL PROTECTED] wrote: Im using multiple cores and all i need to do is,to make the each core in secure manner. If i am accessing the particular core via url,it should ask and validate the credentials say Username Password for each core.

RE: Filtering on blank fields

2008-11-20 Thread Lance Norskog
The problem with a zero-length string is that it is also returned by: field:[* TO *]. So you don't know if you're doing this right or not. For those of us who cannot reindex at the drop of a hat, this is a big deal. We went with -1. Lance -Original Message- From: Manepalli, Kalyan

RE: Unique id

2008-11-20 Thread Raghunandan Rao
Thanks Erik. If I convert that to a string then id field defined in schema.xml would fail as I have that as integer. If I change that to string then first view would fail as it is Integer there. What to do in such scenarios? Do I need to define multiple schema.xml or multiple unique key