Re: Clarification on facets

2008-08-18 Thread Gene Campbell
Thank you for the response. Always nice to have something willing to validate your thinking! Of course, if anyone has any ideas on how to get the numbers of times term is repeated in a document, I'm all ears. cheers gene On Tue, Aug 19, 2008 at 1:42 PM, Norberto Meijome <[EMAIL PROTECTED]> wro

Re: Deadlock in lucene?

2008-08-18 Thread Yonik Seeley
It's not a deadlock (just a synchronization bottleneck) , but it is a known issue in Lucene and there has been some progress in improving the situation. -Yonik On Mon, Aug 18, 2008 at 10:55 PM, Matthew Runo <[EMAIL PROTECTED]> wrote: > Hello folks! > > I was just wondering if anyone else has seen

Deadlock in lucene?

2008-08-18 Thread Matthew Runo
Hello folks! I was just wondering if anyone else has seen this issue under heavy load. We had some servers set to very high thread limits (12 core servers with 32 gigs of ram), and found several threads would end up in this state Name: http-8080-891 State: BLOCKED on [EMAIL PROTECTED]

RE: .wsdl for example....

2008-08-18 Thread Lance Norskog
Various Java web service libraries come with 'wsdl2java' and 'java2wsdl' programs. You just run 'java2wsdl' on the Java soap description. -Original Message- From: Ryan McKinley [mailto:[EMAIL PROTECTED] Sent: Monday, August 18, 2008 6:53 PM To: solr-user@lucene.apache.org Subject: Re: .w

Re: .wsdl for example....

2008-08-18 Thread Ryan McKinley
check SolrSharp http://wiki.apache.org/solr/SolrSharp On Aug 18, 2008, at 9:23 PM, Norberto Meijome wrote: On Mon, 18 Aug 2008 19:08:24 -0300 "Alexander Ramos Jardim" <[EMAIL PROTECTED]> wrote: Do you wanna a full web service for SOLR example? How a .wsdl will help you? Why don't you use t

Re: Clarification on facets

2008-08-18 Thread Norberto Meijome
On Tue, 19 Aug 2008 10:18:12 +1200 "Gene Campbell" <[EMAIL PROTECTED]> wrote: > Is this interpreted as meaning, there are 10 documents that will match > with 'car' in the title, and likewise 6 'boat' and 2 'bike'? Correct. > If so, is there any way to get counts for the *number times* a value >

Re: .wsdl for example....

2008-08-18 Thread Norberto Meijome
On Tue, 19 Aug 2008 11:23:48 +1000 Norberto Meijome <[EMAIL PROTECTED]> wrote: > On Mon, 18 Aug 2008 19:08:24 -0300 > "Alexander Ramos Jardim" <[EMAIL PROTECTED]> wrote: > > > Do you wanna a full web service for SOLR example? How a .wsdl will help you? > > Why don't you use the HTTP interface SOL

Re: .wsdl for example....

2008-08-18 Thread Norberto Meijome
On Mon, 18 Aug 2008 19:08:24 -0300 "Alexander Ramos Jardim" <[EMAIL PROTECTED]> wrote: > Do you wanna a full web service for SOLR example? How a .wsdl will help you? > Why don't you use the HTTP interface SOLR provides? > > Anyways, if you need to develop a web service (SOAP compliant) to access

Re: hello, a question about solr.

2008-08-18 Thread finy finy
thanks for your help. could you give me your gmail talk address or msn? 2008/8/19, Norberto Meijome <[EMAIL PROTECTED]>: > > On Mon, 18 Aug 2008 23:07:19 +0800 > "finy finy" <[EMAIL PROTECTED]> wrote: > > > because i use chinese character, for example "ibm___" > > solr will parse it

Re: hello, a question about solr.

2008-08-18 Thread Norberto Meijome
On Mon, 18 Aug 2008 23:07:19 +0800 "finy finy" <[EMAIL PROTECTED]> wrote: > because i use chinese character, for example "ibm___" > solr will parse it into a term "ibm" and a phraze "_ __" > can i use solr to query with a term "ibm" and a term "_" and a term > "__

Re: Solr won't start under jetty on RHEL5.2

2008-08-18 Thread Jon Drukman
Jon Drukman wrote: I just migrated my solr instance to a new server, running RHEL5.2. I installed java from yum but I suspect it's different from the one I used to use. Turns out my instincts were correct. The version from yum does not work. I installed the official sun jdk and now it star

Solr won't start under jetty on RHEL5.2

2008-08-18 Thread Jon Drukman
I just migrated my solr instance to a new server, running RHEL5.2. I installed java from yum but I suspect it's different from the one I used to use. Anyway, my Solr no longer works. 2008-08-18 18:01:12.079::INFO: Logging to STDERR via org.mortbay.log.StdErrLog 2008-08-18 18:01:12.229::INF

Re: Restrict Wildcards

2008-08-18 Thread Erlend Hamnaberg
I will try this tomorrow. Thanks for the suggestion. - Erlend On Mon, Aug 18, 2008 at 5:01 PM, Otis Gospodnetic < [EMAIL PROTECTED]> wrote: > Erlend, > > This doesn't work with "string"? Maybe something there is removing > numbers. Have you tried with an example without numbers? > e.g. fooaaa

Clarification on facets

2008-08-18 Thread Gene Campbell
I have to check I understand this right If I have the following response from a search like this http://&facet.field=title&facet.limit=-1&facet.mincount=1 10 6 2 Is this interpreted as meaning, there are 10 documents that will match with 'car' in the

Re: .wsdl for example....

2008-08-18 Thread Alexander Ramos Jardim
Do you wanna a full web service for SOLR example? How a .wsdl will help you? Why don't you use the HTTP interface SOLR provides? Anyways, if you need to develop a web service (SOAP compliant) to access SOLR, just remember to use an embedded core on your webservice. 2008/8/18 Norberto Meijome <[EM

Re: Localisation, faceting

2008-08-18 Thread Pierre Auslaender
Excellent point about the saved queries. Thanks! So I could sniff the locale (from the HTML page or the Java application,...) and infer the "query language", or try to do automatic "guessing" of the language based on the operator names (if they don't collide with indexed terms). This brings up

Re: Order of returned fields

2008-08-18 Thread Alexander Ramos Jardim
Hey Pierre, I don't know if my case helps you, but what I do to keep relational information is to put the related data all in the same field. Let me give you an example: I have a product index. Each product has a list of manufacturer properties, like dimensions, color, connections supported (usb

Re: Localisation, faceting

2008-08-18 Thread Walter Underwood
I would do it in the client, even if it meant parsing the query, modifying it, then unparsing it. This is exactly like changing "To:" to "Zu:" in a mail header. Show that in the client, but make it standard before it goes onto the network. If queries at the Solr/Lucene level are standard, then us

Re: Localisation, faceting

2008-08-18 Thread Pierre Auslaender
Would that be of any interest to the SOLR / Lucene community, given the trend to globalisation / regionalisation ? My base is Switzerland - 4 official national tongues, none of them English. If one were to localise the boolean operators, would that have to be at the Lucene level, or could that

Re: Order of returned fields

2008-08-18 Thread Pierre Auslaender
Order matters in my application because I'm indexing structured data - actually, a domain object model (a bit like with Hibernate Search), only I'm adding parents to children, instead of children to parents. So say I have Cities and People, with a 1-N relationship between City and People. I'm i

Re: partialResults, distributed search & SOLR-50

2008-08-18 Thread Ian Connor
I have been using HAProxy on different ports (same IP). It seems to work but have not tested it in production yet. On Mon, Aug 18, 2008 at 12:37 PM, Yonik Seeley <[EMAIL PROTECTED]> wrote: > On Mon, Aug 18, 2008 at 12:34 PM, Brian Whitman > <[EMAIL PROTECTED]> wrote: >> On Aug 18, 2008, at 12:31 P

Re: .wsdl for example....

2008-08-18 Thread Erik Hatcher
On Aug 18, 2008, at 11:27 AM, Norberto Meijome wrote: does anyone have a .wsdl definition for the example bundled with SOLR? WSDL? surely you jest. Erik

RE: Synonyms with spaces not working

2008-08-18 Thread Steven A Rowe
Hi Matthew, On 08/18/2008 at 1:39 PM, Matthew Runo wrote: > > > [...] > ignoreCase="true" expand="true"/> > [...] I can see from SOLR-702 that most of your synonym rules have a single term/phrase on the right-hand side. The SynonymFilterFactory section of the AnalyzersTok

Re: "Auto commit error" and java.io.FileNotFoundException

2008-08-18 Thread Yonik Seeley
On Mon, Aug 18, 2008 at 6:05 AM, Michael McCandless <[EMAIL PROTECTED]> wrote: > The output from CheckIndex shows quite a few missing files! Is there any > possibility that two instances of Solr were somehow sharing the same index > directory? To eliminate that possibility, the lock factory shoul

Re: "Auto commit error" and java.io.FileNotFoundException

2008-08-18 Thread Yonik Seeley
On Mon, Aug 18, 2008 at 1:12 PM, Michael McCandless <[EMAIL PROTECTED]> wrote: > > Alas, I think this won't actually turn on IndexWriter's infoStream. > > I think you may need to modify the SolrIndexWriter.java sources, in the init > method, to add a call to setInfoStream(...). > > Can any Solr dev

Re: "Auto commit error" and java.io.FileNotFoundException

2008-08-18 Thread Fuad Efendi
Lucene v.2.1 has a bug with autocommit...

Re: Synonyms with spaces not working

2008-08-18 Thread Otis Gospodnetic
Matthew, there is a good page page about synonyms on the Wiki that covers the multi-word synonyms stuff. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Matthew Runo <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Monday, Aug

Synonyms with spaces not working

2008-08-18 Thread Matthew Runo
Hello folks! Sorry to ask such a basic question but synonyms might be the end of me.. I suspect that there is something fundamentally wrong with the field type I've set up.. positionIncrementGap="100"> ignoreCase="true" expand="true"/> words="stopwords.txt"/> protected="protwords.txt

Re: "Auto commit error" and java.io.FileNotFoundException

2008-08-18 Thread Michael McCandless
Alas, I think this won't actually turn on IndexWriter's infoStream. I think you may need to modify the SolrIndexWriter.java sources, in the init method, to add a call to setInfoStream(...). Can any Solr developers confirm this? Mike Chris Harris wrote: I'm assuming that one way to do thi

Re: "Auto commit error" and java.io.FileNotFoundException

2008-08-18 Thread Chris Harris
I'm assuming that one way to do this would be to set the logging level to "FINEST" in the "logging" page in the solr admin tool, and then to make sure my logging.properties file is also set to record the FINEST logging level. Let me know if that won't enable to sort of debugging info you are talkin

Re: partialResults, distributed search & SOLR-5

2008-08-18 Thread Ian Connor
My interest now is beyond the initial problem and would love if someone could explain how you get from a QueryRequest being created to using the code in SolrIndexSearcher. On Mon, Aug 18, 2008 at 12:34 PM, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Right. And a LB that is configured to, say, m

Re: partialResults, distributed search & SOLR-50

2008-08-18 Thread Yonik Seeley
On Mon, Aug 18, 2008 at 12:34 PM, Brian Whitman <[EMAIL PROTECTED]> wrote: > On Aug 18, 2008, at 12:31 PM, Yonik Seeley wrote: > >> On Mon, Aug 18, 2008 at 12:16 PM, Otis Gospodnetic >> <[EMAIL PROTECTED]> wrote: >>> >>> Yes, as far as I know, what Brian said is correct. Also, as far as I >>> know

Re: Administrative questions

2008-08-18 Thread Otis Gospodnetic
Thanks! I put that up on http://wiki.apache.org/solr/Daemontools , so if you want to add/change anything, you can do so at any time (anyone can edit or create wiki pages). Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Jon Drukman <[EMAI

Re: partialResults, distributed search & SOLR-50

2008-08-18 Thread Otis Gospodnetic
Right. And a LB that is configured to, say, make use of Solr's ping response to determine if Solr healthy? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Yonik Seeley <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Monday,

Re: partialResults, distributed search & SOLR-50

2008-08-18 Thread Brian Whitman
On Aug 18, 2008, at 12:31 PM, Yonik Seeley wrote: On Mon, Aug 18, 2008 at 12:16 PM, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: Yes, as far as I know, what Brian said is correct. Also, as far as I know, there is nothing that gracefully handles problematic Solr instances during distributed s

Re: partialResults, distributed search & SOLR-50

2008-08-18 Thread Yonik Seeley
On Mon, Aug 18, 2008 at 12:16 PM, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Yes, as far as I know, what Brian said is correct. Also, as far as I know, > there is nothing that gracefully handles problematic Solr instances during > distributed search. Right... we punted that issue to a load b

Re: Boosting fields by default

2008-08-18 Thread Rakesh Godhani
Sweet, cool, thanks -Rakesh On 8/18/08 11:31 AM, "Shalin Shekhar Mangar" <[EMAIL PROTECTED]> wrote: > On Mon, Aug 18, 2008 at 7:12 PM, Rakesh Godhani <[EMAIL PROTECTED]> wrote: > >> >> Hi, I¹m using the data import mechanism to pull data into my index. If I >> want to boost a certain field f

Re: partialResults, distributed search & SOLR-502

2008-08-18 Thread Ian Connor
When I put logging into SolrIndexSearcher just to see if we get there, I don't see any messages. However, I do see logging without a problem in QueryRequest and above. My issue is that I just cannot understand how SolrIndexSearcher comes into play here. On Mon, Aug 18, 2008 at 11:57 AM, Brian Whit

Re: partialResults, distributed search & SOLR-502

2008-08-18 Thread Otis Gospodnetic
Yes, as far as I know, what Brian said is correct. Also, as far as I know, there is nothing that gracefully handles problematic Solr instances during distributed search. Solr 1.4 request? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From:

Re: Solr Logo thought

2008-08-18 Thread Otis Gospodnetic
I like it, even its asymmetry. :) Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Lukáš Vlček <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Sunday, August 17, 2008 7:02:25 PM > Subject: Re: Solr Logo thought > > Hi, > > My

Re: Localisation, faceting

2008-08-18 Thread Otis Gospodnetic
Hi, Regarding Boolean operator localization -- there was a person who submitted patches for the same functionality, but for Lucene's QueryParser. This was a few years ago. I think his patch was never applied. Perhaps that helps. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nut

Re: partialResults, distributed search & SOLR-502

2008-08-18 Thread Brian Whitman
On Aug 18, 2008, at 11:51 AM, Ian Connor wrote: On Mon, Aug 18, 2008 at 9:31 AM, Ian Connor <[EMAIL PROTECTED]> wrote: I don't think this patch is working yet. If I take a shard out of rotation (even just one out of four), I get an error: org.apache.solr.client.solrj.SolrServerException: java.

Re: partialResults, distributed search & SOLR-502

2008-08-18 Thread Ian Connor
Hi, I have traced this as far as I can figure. It does seem as though the patch is in the trunk. I can see that timeAllowed is certainly being set and the lucene class TimeLimitedCollector is being used when the param is there. However, I have tried to trace RequestHandlerBase from this stack thr

Re: Boosting fields by default

2008-08-18 Thread Shalin Shekhar Mangar
On Mon, Aug 18, 2008 at 7:12 PM, Rakesh Godhani <[EMAIL PROTECTED]> wrote: > > Hi, I¹m using the data import mechanism to pull data into my index. If I > want to boost a certain field for all docs, (e.g. the title over the body) > what is the best way to do that? I was expecting to change someth

.wsdl for example....

2008-08-18 Thread Norberto Meijome
hi :) does anyone have a .wsdl definition for the example bundled with SOLR? if nobody has it, would it be useful to have one ? cheers, B _ {Beto|Norberto|Numard} Meijome Intelligence: Finding an error in a Knuth text. Stupidity: Cashing that $2.56 check you got. I spe

SimpleFacets: Performance Boost for Tokenized Fields

2008-08-18 Thread Fuad Efendi
Hello: Term Vectors could be much faster than Intersectings with FilterCache. Exception: when size of DocSet is close (more than 50%) to the total count of documents in the index. When it works (100 times faster than current; very specific scenario): - use stored Term Vectors; - 10,000,000 d

Re: Order of returned fields

2008-08-18 Thread Erik Hatcher
Yes, this is normal behavior. Does order matter in your application? Could you explain why? Order is maintained with multiple values of the same field name, though - which is important. Erik On Aug 17, 2008, at 6:38 PM, Pierre Auslaender wrote: Hello, After a Solr query, I alwa

Re: hello, a question about solr.

2008-08-18 Thread finy finy
because i use chinese character, for example "ibm笔记本电脑" solr will parse it into a term "ibm" and a phraze "笔记本 电脑" can i use solr to query with a term "ibm" and a term "笔记本" and a term "电脑"? 2008/8/18, Norberto Meijome <[EMAIL PROTECTED]>: > > On Mon, 18 Aug 2008 15:33:02 +0800 > "finy finy" <[E

Re: Restrict Wildcards

2008-08-18 Thread Otis Gospodnetic
Erlend, This doesn't work with "string"? Maybe something there is removing numbers. Have you tried with an example without numbers? e.g. fooaaa and foobbb. Does foo* match them both? If it does, then perhaps you can create a custom field type and use KeywordTokenizer in it. Example schema.

Boosting fields by default

2008-08-18 Thread Rakesh Godhani
Hi, I¹m using the data import mechanism to pull data into my index. If I want to boost a certain field for all docs, (e.g. the title over the body) what is the best way to do that? I was expecting to change something in schema.xml but I don¹t see any info on boosting there. Thanks in advance -R

Re: hello, a question about solr.

2008-08-18 Thread Norberto Meijome
On Mon, 18 Aug 2008 15:33:02 +0800 "finy finy" <[EMAIL PROTECTED]> wrote: > the name field is text,which is analysed, i use the query > "name:ibmT63notebook" why do you search with no spaces? is this free text entered by a user, or is it part of a link which you control ? PS: please dont top-po

Re: partialResults, distributed search & SOLR-502

2008-08-18 Thread Ian Connor
I don't think this patch is working yet. If I take a shard out of rotation (even just one out of four), I get an error: org.apache.solr.client.solrj.SolrServerException: java.net.ConnectException: Connection refused org.apache.solr.common.SolrException: org.apache.solr.client.solrj.SolrServerExce

Re: IndexOutOfBoundsException

2008-08-18 Thread Michael McCandless
OK gotchya. Please keep us posted one way or another... Mike Ian Connor wrote: Hi Mike, I am currently ruling out some bad memory modules. Knowing that this is a index corruption, makes memory corruption more likely. If replacing RAM does not fix the problem (which I need to do anyway due t

RE: solr doc

2008-08-18 Thread dudes dudes
thanks :) > Date: Mon, 18 Aug 2008 17:54:20 +0530 > From: [EMAIL PROTECTED] > To: solr-user@lucene.apache.org > Subject: Re: solr doc > > Take a look at http://wiki.apache.org/solr/CollectionDistribution > > On Mon, Aug 18, 2008 at 4:44 PM, dudes dudes w

Re: solr doc

2008-08-18 Thread Shalin Shekhar Mangar
Take a look at http://wiki.apache.org/solr/CollectionDistribution On Mon, Aug 18, 2008 at 4:44 PM, dudes dudes <[EMAIL PROTECTED]> wrote: > > Hello all, > > I'm looking for a doc that full-fill the following situation? > > How can two solr servers synchronised with each other ? And if one of them

Re: solr doc

2008-08-18 Thread Noble Paul നോബിള്‍ नोब्ळ्
keep a slave handy as the second aster and if the real master goes down let the second one take over. On Mon, Aug 18, 2008 at 4:44 PM, dudes dudes <[EMAIL PROTECTED]> wrote: > > Hello all, > > I'm looking for a doc that full-fill the following situation? > > How can two solr servers synchronised w

Restrict Wildcards

2008-08-18 Thread Erlend Hamnaberg
Hi list. Is it possible to create a field type in solr that does not match with wildcard queries? I want it to only match the complete string, so if I have indexed "foo123" and "foo234" i dont want "foo*" to match any of these. This does not work with just using the predefined "string" type. An

solr doc

2008-08-18 Thread dudes dudes
Hello all, I'm looking for a doc that full-fill the following situation? How can two solr servers synchronised with each other ? And if one of them down for whatever reason the how other one can take over... does solr has anything like master/slave tajke over ? any docs or suggestions are th

Re: IndexOutOfBoundsException

2008-08-18 Thread Ian Connor
Hi Mike, I am currently ruling out some bad memory modules. Knowing that this is a index corruption, makes memory corruption more likely. If replacing RAM does not fix the problem (which I need to do anyway due to segmentation faults), I will package up the crash into a reproducible scenario. On

Re: "Auto commit error" and java.io.FileNotFoundException

2008-08-18 Thread Michael McCandless
The output from CheckIndex shows quite a few missing files! Is there any possibility that two instances of Solr were somehow sharing the same index directory? It looks like you are using the 2.3 version of the Lucene jar (not the trunk version). Which version of Solr are you using? Si

Re: IndexOutOfBoundsException

2008-08-18 Thread Michael McCandless
Hi Ian, I sent this to java-user, but maybe you didn't see it, so let's try again on solr-user: It looks like your stored fields file (_X.fdt) is corrupt. Are you using multiple threads to add docs? Can you try switching to SerialMergeScheduler to verify it's reproducible? When you hi

Re: Jetty Multicore installation doesn't work

2008-08-18 Thread Shalin Shekhar Mangar
It seems that you are trying to use a Solr 1.3 feature (multiple cores) with a Solr 1.2 war file. If you want to use multiple core, you must use a nightly build of Solr and take a look at the CoreAdmin page (formerly known as MultiCore) http://wiki.apache.org/solr/CoreAdmin On Mon, Aug 18, 2008

Jetty Multicore installation doesn't work

2008-08-18 Thread parthad76
Hi I tried to run the multicore installation of Jetty after downloading it. Its throwing the following error and I am not sure why. I added the multicore.xml file in solr.home but that too doesn't work.Can someone please help? INFO: Solr home set to 'multicore/' 2008-08-18 14:18:31.796::WARN: f

Re: Can facet numbers be constrained to one result doc or a group of result docs?

2008-08-18 Thread Gene Campbell
Awesome, thanks for the quick reply!!! gene On Mon, Aug 18, 2008 at 5:33 PM, Shalin Shekhar Mangar <[EMAIL PROTECTED]> wrote: > The computed frequency will be for the current set of results (the result of > q and fq parameters). Setting facet.mincount=1 will give you only those > terms which are

Re: hello, a question about solr.

2008-08-18 Thread finy finy
the name field is text,which is analysed, i use the query "name:ibmT63notebook" 2008/8/18, Shalin Shekhar Mangar <[EMAIL PROTECTED]>: > > Hi, > > What is the type of the field "name"? > Does a query like "name:ibm OR name:T63 OR name:notebook" work for you? > > On Mon, Aug 18, 2008 at 10:43 AM, fi