Re: sorting on aggregate averages

2008-04-02 Thread Umar Shah
Thanks! I'll have a look at that. On Wed, Apr 2, 2008 at 6:25 AM, Chris Hostetter [EMAIL PROTECTED] wrote: : I am computing a sorted rank list and returning a slice (for pagination) but : have to recompute the result for each request, although the actual q : parameter and fq would be

Search exact terms

2008-04-02 Thread Tim Mahy
Hi all, is there a Solr wide setting that with which I can achieve the following : if I now search for q=onderwij, I also receive documents with results of onderwijs etc.. this is ofcourse the behavior that is described but if I search on onderwij, I still get the onderwijs hits, I use for

problem with ShowFileRequestHandler

2008-04-02 Thread 李银松
Edward.Zhang had commit the problem before I want to programmatically retrieve the schema and the config from the ShowFileRequestHandler. I encounter some trouble. There are CJK characters in the xml files as follows: !-- Field to use to determine and enforce document uniqueness.

Re: Search exact terms

2008-04-02 Thread Umar Shah
If you want this behavior then the field type should not be 'text'. for default fieldtype=text there are many filters applied before the values are indexed, this includes stemming (reducing the word to root word, removing s in ur case. try using fieldtype=string instead. this will match strictly

Wildcard search + case insensitive

2008-04-02 Thread Tim Mahy
Hi all, I use this type definition in my schema.xml : fieldtype name=exactText class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt/

java.io.FileNotFoundException?

2008-04-02 Thread Doug Steigerwald
We just started hitting a FileNotFoundException for no real apparent reason for both our regular index and our spellchecker index, and only a few minute after we restarted Solr. I did some searching and didn't find much that helped. We started to do some load testing, and after about 10

Re: How to use Solr in java program

2008-04-02 Thread khirb7
hossman wrote: : I recommend using Solr as a webservice, even if your client is Java. but : there are options for embedding Solr directly into your applications using : thank you hossman for your response,I have another question : I have writen : a small java program using

RE: Wildcard search + case insensitive

2008-04-02 Thread Tim Mahy
Hi all, I already found the answer to my question on the following blog : http://michaelkimsal.com/blog/2007/04/solr-case-sensitivty/ greetings, Tim -Oorspronkelijk bericht- Van: Tim Mahy [mailto:[EMAIL PROTECTED] Verzonden: wo 2-4-2008 13:19 Aan: solr-user@lucene.apache.org

Re: Multiple unique field?

2008-04-02 Thread Ryan McKinley
Thank you for your reply In other word, can I set 2 unique key field? directly in solr: no In your own code, yes -- either in the client or in custom plugin. ryan

Help with XmlPullParserException

2008-04-02 Thread Phillip Farber
Hello all, I'm indexing a body of OCR and encountered this exception. Apparently it's some kind of XML parser error. Out of thousands of documents, which I create with significant processing to make sure they are XML compliant, only this one appears to have a problem. But can anyone tell

Re: problem with ShowFileRequestHandler

2008-04-02 Thread Ryan McKinley
On Apr 2, 2008, at 5:03 AM, 李银松 wrote: Edward.Zhang had commit the problem before I want to programmatically retrieve the schema and the config from the ShowFileRequestHandler. I encounter some trouble. There are CJK characters in the xml files as follows: !-- Field to use to

Re: Search exact terms

2008-04-02 Thread Ryan McKinley
search is based on the fields you index and how you index them. If you index using the text field -- with stemming etc, you will have to search with the same criteria. If you want exact search, consider the string type. If you want both, you can use the copyField to copy the same content

Re: java.io.FileNotFoundException?

2008-04-02 Thread Otis Gospodnetic
Hi Doug, Sounds fishy, especially increasing/decreasing mergeFactor to funny values (try changing your OS setting instead). My guess is this is happening only with the 2 indices that are being modified and I'll guess that the FNFE is due to a bad/incomplete rsync from the master. Do

Brazilian Portuguese synonyms

2008-04-02 Thread Rogerio Pereira
Hi guys! Lucas, I would like know more about your work with support of brazilian portguese synonyms in solr. Thanks for any help. -- Yours truly (Atenciosamente), Rogério (_rogerio_) http://faces.eti.br Faça a diferença! Ajude o seu país a crescer, não retenha conhecimento, distribua e

Re: java.io.FileNotFoundException?

2008-04-02 Thread Doug Steigerwald
The user that runs our apps is configured to allow 65536 open files in limits.conf. Shouldn't even come close to that number. Solr is the only app we have running on these machines as our app user. We hit the same type of issue when we had our mergeFactor set to 40 for all of our indexes. We

Re: Help with XmlPullParserException

2008-04-02 Thread Phillip Farber
I just looked at this again and I think the problem is that the message is referring to the garbage string of characters 2#1a where #1a looks like a decimal numeric character reference but the letter 'a' is a hex digit. I'll have to go back to my OCR cleanup routine ... Thanks for reading.

Re: Brazilian Portuguese synonyms

2008-04-02 Thread Lucas F. A. Teixeira
Synonyms support? Actually, we just have a big list of portuguese synonyms. I was talking about a portuguese steemer. Interested? Anything, just mail me @ [EMAIL PROTECTED] []s, Lucas Rogerio Pereira wrote: Hi guys! Lucas, I would like know more about your work with support of brazilian

Re: Wildcard search + case insensitive

2008-04-02 Thread Matthew Runo
Hmm. I'd like the ability to turn on or off in the config case sensitivity... I'm looking forward to this patch. Thanks! Matthew Runo Software Developer Zappos.com 702.943.7833 On Apr 2, 2008, at 5:48 AM, Tim Mahy wrote: Hi all, I already found the answer to my question on the following

Re: Brazilian Portuguese synonyms

2008-04-02 Thread Rogerio Pereira
Yes! 2008/4/2, Lucas F. A. Teixeira [EMAIL PROTECTED]: Synonyms support? Actually, we just have a big list of portuguese synonyms. I was talking about a portuguese steemer. Interested? Anything, just mail me @ [EMAIL PROTECTED] []s, Lucas Rogerio Pereira wrote: Hi guys!

numDocs and maxDoc

2008-04-02 Thread Vinci
Hi, I am trying to update the index by 2 stage posting: part of the index will be posted in stage 1 by 1.xml, then after a meanwhiles the left of the index of the entry will be posted by 2.xml. Assume both 1.xml and 2.xml have 3 document and id is used as unique field, what I see in the admin

Re: Indexing a word in url

2008-04-02 Thread Simon Rosenthal
I also couldn't get the exact results I wanted for indexing URL components using WordDelimeterFilter or patternTokenizer, so resorted to adding a new field ('pathparts'), plus a few lines of code to generate the tokens in our content preprocessor which submits documents to SOLR for indexing.

Re: numDocs and maxDoc

2008-04-02 Thread Mike Klaas
On 2-Apr-08, at 11:29 AM, Vinci wrote: Hi, I am trying to update the index by 2 stage posting: part of the index will be posted in stage 1 by 1.xml, then after a meanwhiles the left of the index of the entry will be posted by 2.xml. Assume both 1.xml and 2.xml have 3 document and id is

Re: Wildcard search + case insensitive

2008-04-02 Thread Chris Hostetter
: Hmm. I'd like the ability to turn on or off in the config case sensitivity... : I'm looking forward to this patch. FYI: here's the relevant issue... http://issues.apache.org/jira/browse/SOLR-218 NOTE: no one has ever contributed any patches to address this problem. (although yonik

Re: numDocs and maxDoc

2008-04-02 Thread Chris Hostetter
: I am trying to update the index by 2 stage posting: part of the index will : be posted in stage 1 by 1.xml, then after a meanwhiles the left of the index : of the entry will be posted by 2.xml. Assume both 1.xml and 2.xml have 3 : document and id is used as unique field, what I see in the admin

RE: Search exact terms

2008-04-02 Thread Norskog, Lance
This is confusing advice to a beginner. A string field will not find a word in the middle of a sentence. To get normal searches without this confusions, copy the 'text' type and make a variant without the Stemmer. The problem is that you are using an English language stemmer for what appears to

dataimport handler multiple databases

2008-04-02 Thread Ismail Siddiqui
Hi I have a situaion where I am using dataimport handler with development db and going to use it with production database in production environment I have entry in solr-config.xml like this requestHandler name=/dataimport class=org.apache.solr.handler.dataimport.DataImportHandler lst

searching like RDBMS way

2008-04-02 Thread Sunil . Sarje
This is very general requirement and I am sure somebody might have thought about the solution. Sample scenario to explain my question --- There is a many-to-many relationship between 2 entities - Sales Person Client One sales person can work for many clients. One Client may be served by many

Re: problem with ShowFileRequestHandler

2008-04-02 Thread 李银松
Thanks Ryan 2008/4/2, Ryan McKinley [EMAIL PROTECTED]: On Apr 2, 2008, at 5:03 AM, 李银松 wrote: Edward.Zhang had commit the problem before I want to programmatically retrieve the schema and the config from the ShowFileRequestHandler. I encounter some trouble. There are CJK

Re: searching like RDBMS way

2008-04-02 Thread Norberto Meijome
On Wed, 2 Apr 2008 15:31:43 -0500 [EMAIL PROTECTED] wrote: This is very general requirement and I am sure somebody might have thought about the solution. Hi Sunil, - please don't hijack the thread :) - why don't you use the right tool for the problem? from what you said, a RDBMS sounds like

Re: dataimport handler multiple databases

2008-04-02 Thread Noble Paul നോബിള്‍ नोब्ळ्
each entity has an optional attribute called dataSource. If you have multiple dataSources give them a name and use the name is dataSource .So you solrconfig must look like requestHandler name=/dataimport class=org.apache.solr.handler.dataimport.DataImportHandler lst name=defaults str

Re: numDocs and maxDoc

2008-04-02 Thread Vinci
Hi, Thanks hossman, this is exactly what I want to do. Final question: so I need to merge the field by myself first? (Actually my original plan is to do 2 consecutive postingso merging is possible) Thank you, Vinci hossman wrote: : I am trying to update the index by 2 stage posting:

Re: Multiple unique field?

2008-04-02 Thread Vinci
Hi, Thank you for your reply. When I set 2 unique key field, it looks like Solr only accept the first definition in schema.xml...question: so once the unique Key defined, it can't be overrided? Thank you, Vinci ryantxu wrote: Thank you for your reply In other word, can I set 2 unique key