Re: Storing Related Data - At Different Times

2008-01-21 Thread Gavin
Hi Otis, Thanks. Was thinking along those lines. But having two indexes will hurt my search. 1 . Searching fields that belong only to the personal details should result in 5 resumes begin shown for the guy (if he has 5). But now it will only show 1 link to the personal details and no

Re: Term vector

2008-01-21 Thread Grant Ingersoll
Term vectors are, to some extent, the opposite of the inverted index. They store term, position and offset (the latter two are optional) on a per document basis, such that you can say give me the terms, position and offsets for document X. In terms of MLT, they are used to figure out

Newbie with Java + typo

2008-01-21 Thread Daniel Andersson
Hi people First the typo on http://wiki.apache.org/solr/mySolr: Production Typically it's not recommended do have your front end it should probably be ..recommended To have.. Second, I don't know much about Java, nor about Jetty/Resin/JBoss/ Tomcat. I went through the tutorial and was

Re: Newbie with Java + typo

2008-01-21 Thread Michael Kimsal
Daniel: As a fellow 'non-java' person I feel your pain (well, felt it anyway). A lot depends on your load and the machine, but I successfully ran the stock jetty system on a box last summer for work and didn't have performance problems. The bigger issue was from the other java people

Multisearching with Solr

2008-01-21 Thread David Pratt
Hi. I am checking out solr after having some experience with lucene using pyLucene. I am looking at the potential of solr to search over a large index divided over multiple servers to collect results, sort of what the parallel multisearcher does in Lucene on its own. From quick scan of

Re: Newbie with Java + typo

2008-01-21 Thread Ryan McKinley
Daniel Andersson wrote: Hi people First the typo on http://wiki.apache.org/solr/mySolr: Production Typically it's not recommended do have your front end it should probably be ..recommended To have.. you can edit any of the wiki pages... fixing typos is a great contribution! As a newbie,

Re: Multisearching with Solr

2008-01-21 Thread Erick Erickson
You can always use the trunk build, but you'll have to check the status of SOLR-303 to be sure it's in the trunk... Here's a thread that discusses this... http://mail.google.com/mail/?zx=wmtcqx3ngeupshva=1#label/Solr/11799e3704804489 Best Erick On Jan 21, 2008 10:55 AM, David Pratt [EMAIL

Re: Newbie with Java + typo

2008-01-21 Thread Brian Whitman
On Jan 21, 2008, at 11:13 AM, Daniel Andersson wrote: Well, no. Immutable Page, and as far as I know (english not being my mother tongue), that means I can't edit the page You need to create an account first.

Re: Newbie with Java + typo

2008-01-21 Thread Daniel Andersson
On Jan 21, 2008, at 4:53 PM, Michael Kimsal wrote: As a fellow 'non-java' person I feel your pain (well, felt it anyway). A lot depends on your load and the machine, but I successfully ran the stock jetty system on a box last summer for work and didn't have performance problems.

Re: Multisearching with Solr

2008-01-21 Thread David Pratt
Hi Erick. Thank you for your reply. Unfortunately, I cannot access the link you provided. It this message from the solr-user list? Many thanks. Regards, David Erick Erickson wrote: You can always use the trunk build, but you'll have to check the status of SOLR-303 to be sure it's in the

Re: spellcheckhandler

2008-01-21 Thread anuvenk
I did try with the latest nightly build. The problem still exists. I tested with the example data that comes with solr package. 1)with termsourcefield set to 'word' which is string fieldtype q=iped nano returns 'ipod nano' which is good 2) with termsourcefield set to 'spell' (which is the

DisMax and Search Components

2008-01-21 Thread Doug Steigerwald
Is there any support for DisMax (or any search request handlers) in search components, or is that something that still needs to be done? It seems like it isn't supported at the moment. We want to be able to use a field collapsing component (https://issues.apache.org/jira/browse/SOLR-236), but

Re: Multisearching with Solr

2008-01-21 Thread Erick Erickson
Yep, it's from the SOLR user list. Well, not really. I mistakenly copied my gmail url when I was looking at the relevant post, which *of course* you can't access http://svn.apache.org/repos/asf/lucene/solr/trunk or http://lucene.apache.org/solr/version_control.html Sorry 'bout that. Erick

Re: solr 1.3

2008-01-21 Thread Mike Klaas
On 20-Jan-08, at 5:07 PM, anuvenk wrote: when will this be released? where can i find the list of improvements/enhancements in 1.3 if its been documented already? see http://svn.apache.org/viewvc/lucene/solr/trunk/CHANGES.txt? view=markup We're not sure on a timeframe for release yet.

RE: solr 1.3

2008-01-21 Thread Lance Norskog
Would somone please consider marking a label on the Subversion repository that says, This is a clean version? I only do HTTP requests and have no custom software, so I don't care about internal interfaces changing. Thanks, Lance Norskog -Original Message- From: Mike Klaas

Re: Missing Content Stream

2008-01-21 Thread Ismail Siddiqui
I am trying solrj to index.. using follwing code String url = http://localhost:8080/solr;; SolrServer server = new CommonsHttpSolrServer( url ); its giving error that undifined symbol for constructor(string). can somoen tell me why this constructor thrwoing error while in source file i can

Is it possible to have append kind update operation?

2008-01-21 Thread zqzuk
Hi, is it possible to have append like updates, where if two records of same id's are posted to solr, the contents of the two merges and composes a single record with the id? I am asking because my program works in a multi-thread manner where several threads produces serveral parts of a final

illegal characters in xml file to be posted?

2008-01-21 Thread zqzuk
Hi, I am using the SimplePostTool to post files to solr. I have encoutered some problem with the content of xml files. I noticed that if my xml file has fields whose values contain the character or or , the post fails and I get the exception : javax.xml.stream.XMLStreamException: ParseError at

RE: illegal characters in xml file to be posted?

2008-01-21 Thread Binkley, Peter
You should encode those three characters, and it doesn't hurt to encode the ampersand and double-quote characters too: http://en.wikipedia.org/wiki/XML#Entity_references Peter -Original Message- From: zqzuk [mailto:[EMAIL PROTECTED] Sent: Monday, January 21, 2008 2:24 PM To:

Wildcards

2008-01-21 Thread dojolava
Hello, I just started to use solr and I experience strange behaviour when it comes to wildcards. When I use the StandardRequestHandler queries like eur?p?an or eur*an work fine. But garden?r or admini*tion do not bring any results (without wildcards there are some of course). All affected

RE: illegal characters in xml file to be posted?

2008-01-21 Thread zqzuk
Thanks for the quick advice! pbinkley wrote: You should encode those three characters, and it doesn't hurt to encode the ampersand and double-quote characters too: http://en.wikipedia.org/wiki/XML#Entity_references Peter -Original Message- From: zqzuk [mailto:[EMAIL

Re: Wildcards

2008-01-21 Thread Yonik Seeley
On Jan 21, 2008 5:18 PM, dojolava [EMAIL PROTECTED] wrote: I just started to use solr and I experience strange behaviour when it comes to wildcards. When I use the StandardRequestHandler queries like eur?p?an or eur*an work fine. But garden?r or admini*tion do not bring any results (without

Re: Wildcards

2008-01-21 Thread dojolava
Thanks a lot! I checked it, when I search for g?rden it works, only g?rdener does not... I will try the copyField solution. On Jan 21, 2008 11:23 PM, Yonik Seeley [EMAIL PROTECTED] wrote: On Jan 21, 2008 5:18 PM, dojolava [EMAIL PROTECTED] wrote: I just started to use solr and I experience

Re: DisMax and Search Components

2008-01-21 Thread Charles Hornberger
On Jan 21, 2008 10:23 AM, Doug Steigerwald [EMAIL PROTECTED] wrote: Is there any support for DisMax (or any search request handlers) in search components, or is that something that still needs to be done? It seems like it isn't supported at the moment. I was curious about this, too ... If

Re: DisMax and Search Components

2008-01-21 Thread Yonik Seeley
The QueryComponent supports both lucene queryparser syntax and dismax query syntax. The dismax request handler now simply sets defType (the default base query type) to dismax -Yonik On Jan 21, 2008 1:23 PM, Doug Steigerwald [EMAIL PROTECTED] wrote: Is there any support for DisMax (or any search

Re: DisMax and Search Components

2008-01-21 Thread Doug Steigerwald
We've found a way to work around it. In our search components, we're doing something like: defType = defType == null ? DisMaxQParserPlugin.NAME : defType; If you add defType=dismax to the query string, it'll use the DisMaxQParserPlugin. Unfortunately, I haven't been able to figure out an

Re: DisMax and Search Components

2008-01-21 Thread Yonik Seeley
On Jan 21, 2008 9:06 PM, Doug Steigerwald [EMAIL PROTECTED] wrote: We've found a way to work around it. In our search components, we're doing something like: defType = defType == null ? DisMaxQParserPlugin.NAME : defType; Would it be easier to just add it as a default parameter in the

Re: DisMax and Search Components

2008-01-21 Thread Doug Steigerwald
We don't always want to use the dismax handler in our setup. Doug Yonik Seeley wrote: On Jan 21, 2008 9:06 PM, Doug Steigerwald [EMAIL PROTECTED] wrote: We've found a way to work around it. In our search components, we're doing something like: defType = defType == null ?

RE: copyField limitation

2008-01-21 Thread Lance Norskog
Sorting on a non-integer has space problems. As I understand it, sorting creates an array of integers the size of the number of records in the entire index. Sorting on a non-integer type also creates a separate array of the same size with the field data copied into it. Thus sorting a non-integer

OOE during indexing

2008-01-21 Thread Marcus Herou
Hi. I get OOE with Solr 1.3 Autowarm seem to be the villain in cojunction with FieldCache somehow. JVM args: -Xmx512m -Xms512m -Xss128k Index size is ~4 Million docs, where I index text and store database primary keys. du /srv/solr/feedItem/data/index/ 1.7G/srv/solr/feedItem/data/index/ To