OOE during indexing

2008-01-21 Thread Marcus Herou
Hi. I get OOE with Solr 1.3 Autowarm seem to be the villain in cojunction with FieldCache somehow. JVM args: -Xmx512m -Xms512m -Xss128k Index size is ~4 Million docs, where I index text and store database primary keys. du /srv/solr/feedItem/data/index/ 1.7G/srv/solr/feedItem/data/index/ To e

RE: copyField limitation

2008-01-21 Thread Lance Norskog
Sorting on a non-integer has space problems. As I understand it, sorting creates an array of integers the size of the number of records in the entire index. Sorting on a non-integer type also creates a separate array of the same size with the field data copied into it. Thus sorting a non-integer f

Re: DisMax and Search Components

2008-01-21 Thread Doug Steigerwald
We don't always want to use the dismax handler in our setup. Doug Yonik Seeley wrote: On Jan 21, 2008 9:06 PM, Doug Steigerwald <[EMAIL PROTECTED]> wrote: We've found a way to work around it. In our search components, we're doing something like: defType = defType == null ? DisMaxQParserP

Re: DisMax and Search Components

2008-01-21 Thread Yonik Seeley
On Jan 21, 2008 9:06 PM, Doug Steigerwald <[EMAIL PROTECTED]> wrote: > We've found a way to work around it. In our search components, we're doing > something like: > >defType = defType == null ? DisMaxQParserPlugin.NAME : defType; Would it be easier to just add it as a default parameter in t

Re: DisMax and Search Components

2008-01-21 Thread Doug Steigerwald
We've found a way to work around it. In our search components, we're doing something like: defType = defType == null ? DisMaxQParserPlugin.NAME : defType; If you add &defType=dismax to the query string, it'll use the DisMaxQParserPlugin. Unfortunately, I haven't been able to figure out an

Re: DisMax and Search Components

2008-01-21 Thread Yonik Seeley
The QueryComponent supports both lucene queryparser syntax and dismax query syntax. The dismax request handler now simply sets defType (the default base query type) to "dismax" -Yonik On Jan 21, 2008 1:23 PM, Doug Steigerwald <[EMAIL PROTECTED]> wrote: > Is there any support for DisMax (or any se

Re: DisMax and Search Components

2008-01-21 Thread Charles Hornberger
On Jan 21, 2008 10:23 AM, Doug Steigerwald <[EMAIL PROTECTED]> wrote: > Is there any support for DisMax (or any search request handlers) in search > components, or is that > something that still needs to be done? It seems like it isn't supported at > the moment. I was curious about this, too ..

Re: Wildcards

2008-01-21 Thread dojolava
Thanks a lot! I checked it, when I search for "g?rden" it works, only "g?rdener" does not... I will try the copyField solution. On Jan 21, 2008 11:23 PM, Yonik Seeley <[EMAIL PROTECTED]> wrote: > On Jan 21, 2008 5:18 PM, dojolava <[EMAIL PROTECTED]> wrote: > > I just started to use solr and I e

Re: Wildcards

2008-01-21 Thread Yonik Seeley
On Jan 21, 2008 5:18 PM, dojolava <[EMAIL PROTECTED]> wrote: > I just started to use solr and I experience strange behaviour when it comes > to wildcards. > > When I use the StandardRequestHandler queries like "eur?p?an" or "eur*an" > work fine. > But "garden?r" or "admini*tion" do not bring any re

RE: illegal characters in xml file to be posted?

2008-01-21 Thread zqzuk
Thanks for the quick advice! pbinkley wrote: > > You should encode those three characters, and it doesn't hurt to encode > the ampersand and double-quote characters too: > http://en.wikipedia.org/wiki/XML#Entity_references > > Peter > > -Original Message- > From: zqzuk [mailto:[EMAIL

Wildcards

2008-01-21 Thread dojolava
Hello, I just started to use solr and I experience strange behaviour when it comes to wildcards. When I use the StandardRequestHandler queries like "eur?p?an" or "eur*an" work fine. But "garden?r" or "admini*tion" do not bring any results (without wildcards there are some of course). All affecte

RE: illegal characters in xml file to be posted?

2008-01-21 Thread Binkley, Peter
You should encode those three characters, and it doesn't hurt to encode the ampersand and double-quote characters too: http://en.wikipedia.org/wiki/XML#Entity_references Peter -Original Message- From: zqzuk [mailto:[EMAIL PROTECTED] Sent: Monday, January 21, 2008 2:24 PM To: solr-user@l

illegal characters in xml file to be posted?

2008-01-21 Thread zqzuk
Hi, I am using the SimplePostTool to post files to solr. I have encoutered some problem with the content of xml files. I noticed that if my xml file has fields whose values contain the character "&" or "<" or ">", the post fails and I get the exception : "javax.xml.stream.XMLStreamException: Pars

Is it possible to have "append" kind update operation?

2008-01-21 Thread zqzuk
Hi, is it possible to have "append" like updates, where if two records of same id's are posted to solr, the contents of the two merges and composes a single record with the id? I am asking because my program works in a multi-thread manner where several threads produces serveral parts of a final re

Re: Missing Content Stream

2008-01-21 Thread Ismail Siddiqui
I am trying solrj to index.. using follwing code String url = "http://localhost:8080/solr";; SolrServer server = new CommonsHttpSolrServer( url ); its giving error that undifined symbol for constructor(string). can somoen tell me why this constructor thrwoing error while in source file i can c

Solr Warm up on Tomcat

2008-01-21 Thread Jae Joo
Hi, Does anyone have experience or solution how to warm up the solr instance on the tomcat automatically? I am using Apache 2 for load balancer and 3 Tomcat machines running Solr. If one of tomcat is needed to shutdown and startup again, the solr should be warm up before serving the request. Tha

Re: solr 1.3

2008-01-21 Thread Mike Klaas
Lance, That is a murky area, legally. Apache requires a considerable amount of auditing and process dedicated to anything called a "release". Nightly svn builds have a special exemption. Creating an svn label "clean for general use" veers slightly in the direction of a "release". If s

RE: solr 1.3

2008-01-21 Thread Lance Norskog
Would somone please consider marking a label on the Subversion repository that says, "This is a clean version"? I only do HTTP requests and have no custom software, so I don't care about internal interfaces changing. Thanks, Lance Norskog -Original Message- From: Mike Klaas [mailto:[EMA

Help - corrupted field in index

2008-01-21 Thread Lance Norskog
I have an 'integer' static field in my schema. Some the index for this field is corrupted. When I search on this field it works. When I use this field to sort against, I get this exception. Does this mean that there is a string in one of my entries? It is possible the field was not required or defa

Re: solr 1.3

2008-01-21 Thread Mike Klaas
On 20-Jan-08, at 5:07 PM, anuvenk wrote: when will this be released? where can i find the list of improvements/enhancements in 1.3 if its been documented already? see http://svn.apache.org/viewvc/lucene/solr/trunk/CHANGES.txt? view=markup We're not sure on a timeframe for release yet. -Mi

Re: Multisearching with Solr

2008-01-21 Thread Erick Erickson
Yep, it's from the SOLR user list. Well, not really. I mistakenly copied my gmail url when I was looking at the relevant post, which *of course* you can't access http://svn.apache.org/repos/asf/lucene/solr/trunk or http://lucene.apache.org/solr/version_control.html Sorry 'bout that. Erick

DisMax and Search Components

2008-01-21 Thread Doug Steigerwald
Is there any support for DisMax (or any search request handlers) in search components, or is that something that still needs to be done? It seems like it isn't supported at the moment. We want to be able to use a field collapsing component (https://issues.apache.org/jira/browse/SOLR-236), but

Re: spellcheckhandler

2008-01-21 Thread anuvenk
I did try with the latest nightly build. The problem still exists. I tested with the example data that comes with solr package. 1)with termsourcefield set to 'word' which is string fieldtype q=iped nano returns 'ipod nano' which is good 2) with termsourcefield set to 'spell' (which is the ca

Re: Multisearching with Solr

2008-01-21 Thread David Pratt
Hi Erick. Thank you for your reply. Unfortunately, I cannot access the link you provided. It this message from the solr-user list? Many thanks. Regards, David Erick Erickson wrote: You can always use the trunk build, but you'll have to check the status of SOLR-303 to be sure it's in the trunk.

Re: Newbie with Java + typo

2008-01-21 Thread Daniel Andersson
On Jan 21, 2008, at 4:53 PM, Michael Kimsal wrote: As a fellow 'non-java' person I feel your pain (well, felt it anyway). A lot depends on your load and the machine, but I successfully ran the stock jetty system on a box last summer for work and didn't have performance problems. Perf

Re: Newbie with Java + typo

2008-01-21 Thread Brian Whitman
On Jan 21, 2008, at 11:13 AM, Daniel Andersson wrote: Well, no. "Immutable Page", and as far as I know (english not being my mother tongue), that means I can't edit the page You need to create an account first.

Re: Newbie with Java + typo

2008-01-21 Thread Daniel Andersson
On Jan 21, 2008, at 5:00 PM, Ryan McKinley wrote: Daniel Andersson wrote: Hi people First the typo on http://wiki.apache.org/solr/mySolr: "Production Typically it's not recommended do have your front end" it should probably be "..recommended To have.." you can edit any of the wiki pages... f

Re: Multisearching with Solr

2008-01-21 Thread Erick Erickson
You can always use the trunk build, but you'll have to check the status of SOLR-303 to be sure it's in the trunk... Here's a thread that discusses this... http://mail.google.com/mail/?zx=wmtcqx3ngeup&shva=1#label/Solr/11799e3704804489 Best Erick On Jan 21, 2008 10:55 AM, David Pratt <[EMAIL PRO

Re: Newbie with Java + typo

2008-01-21 Thread Ryan McKinley
Daniel Andersson wrote: Hi people First the typo on http://wiki.apache.org/solr/mySolr: "Production Typically it's not recommended do have your front end" it should probably be "..recommended To have.." you can edit any of the wiki pages... fixing typos is a great contribution! As a newbi

Multisearching with Solr

2008-01-21 Thread David Pratt
Hi. I am checking out solr after having some experience with lucene using pyLucene. I am looking at the potential of solr to search over a large index divided over multiple servers to collect results, sort of what the parallel multisearcher does in Lucene on its own. From quick scan of archives

Re: Newbie with Java + typo

2008-01-21 Thread Michael Kimsal
Daniel: As a fellow 'non-java' person I feel your pain (well, felt it anyway). A lot depends on your load and the machine, but I successfully ran the stock jetty system on a box last summer for work and didn't have performance problems. The bigger issue was from the other java people complaining

Newbie with Java + typo

2008-01-21 Thread Daniel Andersson
Hi people First the typo on http://wiki.apache.org/solr/mySolr: "Production Typically it's not recommended do have your front end" it should probably be "..recommended To have.." Second, I don't know much about Java, nor about Jetty/Resin/JBoss/ Tomcat. I went through the tutorial and was im

Re: Term vector

2008-01-21 Thread Grant Ingersoll
Term vectors are, to some extent, the opposite of the inverted index. They store term, position and offset (the latter two are optional) on a per document basis, such that you can say "give me the terms, position and offsets for document X". In terms of MLT, they are used to figure out wh

Re: Update the index

2008-01-21 Thread farhanali
updating a document in Solr index does not require any tag just post the document with the same id it will be updated. -- View this message in context: http://www.nabble.com/Update-the-index-tp14991443p14994095.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Storing Related Data - At Different Times

2008-01-21 Thread Gavin
Hi Otis, Thanks. Was thinking along those lines. But having two indexes will hurt my search. 1 . Searching fields that belong only to the personal details should result in 5 resumes begin shown for the guy (if he has 5). But now it will only show 1 link to the personal details and no resu