Re: Mulitple facet - fq

2010-10-21 Thread Yavuz Selim YILMAZ
Thnx guys. -- Yavuz Selim YILMAZ 2010/10/20 Tim Gilbert tim.gilb...@morningstar.com Sorry, what Pradeep said, not Prasad. My apologies Pradeep. -Original Message- From: Tim Gilbert Sent: Wednesday, October 20, 2010 12:18 PM To: 'solr-user@lucene.apache.org' Subject: RE:

Re: RAM increase

2010-10-21 Thread Gora Mohanty
On Thu, Oct 21, 2010 at 10:46 AM, satya swaroop satya.yada...@gmail.com wrote: Hi all,              I increased my RAM size to 8GB and i want 4GB of it to be used for solr itself. can anyone tell me the way to allocate the RAM for the solr. [...] You will need to set up the allocation of

Re:why sorl is slower than lucene so much?

2010-10-21 Thread kafka0102
I found the problem's cause.It's the DocSetCollector. my fitler query result's size is about 300,so the DocSetCollector.getDocSet() is OpenBitSet. And 300 OpenBitSet.fastSet(doc) op is too slow. So I used SolrIndexSearcher's TopFieldDocs search(Query query, Filter filter, int n,

Using a custom repository to store solr index files

2010-10-21 Thread Tharindu Mathew
Hi everyone, I was looking at using the Embedded Solr server through SolrJ and I have a couple of concerns. I'd like to use a custom repository to store my index. Is there a way I can define this. Is there a data output interface I can implement for this purpose? Or can this be done in some

A bug in ComplexPhraseQuery ?

2010-10-21 Thread jmr
Hi, We have installed ComplexPhraseQuery and since that we can see strange behaviour in proximity search. We have the 2 following queries: (text:(protein digest~50)) (text:(digest protein~50)) Without ComplexPhraseQuery, both queries are returning 6 documents matching. With ComplexPhraseQuery,

Re: RAM increase

2010-10-21 Thread Jean-Sebastien Vachon
You will also need to switch to a 64 bits JVM You might have to add the `-d64` flag as well as the `-Xms` and `-Xmx` - Original Message - From: Gora Mohanty g...@mimirtech.com To: solr-user@lucene.apache.org Sent: Thursday, October 21, 2010 2:34 AM Subject: Re: RAM increase On Thu,

Re: Using a custom repository to store solr index files

2010-10-21 Thread Upayavira
On Thu, 21 Oct 2010 14:42 +0530, Tharindu Mathew mcclou...@gmail.com wrote: Hi everyone, I was looking at using the Embedded Solr server through SolrJ and I have a couple of concerns. I'd like to use a custom repository to store my index. Is there a way I can define this. Is there a

MoreLikeThis explanation?

2010-10-21 Thread darren
Hi, Does the latest Solr provide an explanation for results returned by MLT? I want to get the interesting terms for each result that overlap with the source document. This set of terms will vary from result to result possibly. Thanks! Darren

Re: Import From MYSQL database

2010-10-21 Thread virtas
You need to look into actual logs of the system. There you will see more details why import failed. check tomcat or jetty logs -- View this message in context: http://lucene.472066.n3.nabble.com/Import-From-MYSQL-database-tp1738753p1745246.html Sent from the Solr - User mailing list archive

Re: MoreLikeThis explanation?

2010-10-21 Thread Koji Sekiguchi
(10/10/21 20:33), dar...@ontrenet.com wrote: Hi, Does the latest Solr provide an explanation for results returned by MLT? No, but there is an open issue: https://issues.apache.org/jira/browse/SOLR-860 Koji -- http://www.rondhuit.com/en/

FieldCache

2010-10-21 Thread Mathias Walter
Hi, does a field which should be cached needs to be indexed? I have a binary field which is just stored. Retrieving it via FieldCache.DEFAULT.getTerms returns empty ByteRefs. Then I found the following post: http://www.mail-archive.com/d...@lucene.apache.org/msg05403.html How can I use the

Re: why sorl is slower than lucene so much?

2010-10-21 Thread Yonik Seeley
2010/10/21 kafka0102 kafka0...@163.com: I found the problem's cause.It's the DocSetCollector. my fitler query result's size is about 300,so the DocSetCollector.getDocSet() is OpenBitSet. And 300 OpenBitSet.fastSet(doc) op is too slow. As I said in my other response to you, that's a

Re: MoreLikeThis explanation?

2010-10-21 Thread Darren Govoni
Thank you! On Thu, 2010-10-21 at 23:03 +0900, Koji Sekiguchi wrote: (10/10/21 20:33), dar...@ontrenet.com wrote: Hi, Does the latest Solr provide an explanation for results returned by MLT? No, but there is an open issue: https://issues.apache.org/jira/browse/SOLR-860 Koji

Re: RAM increase

2010-10-21 Thread Jonathan Rochkind
Jean-Sebastien Vachon wrote: You will also need to switch to a 64 bits JVM You might have to add the `-d64` flag as well as the `-Xms` and `-Xmx` I've actually had no luck googling what's up with the -d64. Can you point me to any documentation on what effect it has, and on particular on

Re: RAM increase

2010-10-21 Thread Dennis Gearon
Everything ovger ~3.7 3.7GB RAM (2^32, use your calculator) needs 64 bit addressing. Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is usually a better idea to learn from others’ mistakes, so you do not have to make them yourself.

DistributedSearchDesign and multiple requests

2010-10-21 Thread Jeff Wartes
I'm using Solr 1.4. My observations and this page http://wiki.apache.org/solr/DistributedSearchDesign#line-254 indicate that the general strategy for Distributed Search is something like: 1. Query the shards with the user's query and fl=unique_field,score 2. Re-query (maybe a

RE: RAM increase

2010-10-21 Thread Steven A Rowe
Memory limits info: http://www.oracle.com/technetwork/java/hotspotfaq-138619.html#gc_heap_32bit -d64 usage info: http://stackoverflow.com/questions/1443677/what-impact-if-any-does-the-d64-swtich-have-on-sun-jvm-resident-memory-usage Steve -Original Message- From: Dennis Gearon

how well does multicore scale?

2010-10-21 Thread mike anderson
I'm exploring the possibility of using cores as a solution to bookmark folders in my solr application. This would mean I'll need tens of thousands of cores... does this seem reasonable? I have plenty of CPUs available for scaling, but I wonder about the memory overhead of adding cores (aside from

[solrmarc-tech] JVM XX:+UseCompressedOops

2010-10-21 Thread Jonathan Rochkind
Is anyone using the newish JVM XX:+UseCompressedOops with Solr? Do you have reason to believe it's helpful? Is there any way it can be harmful? I am hoping it reduces my memory consumption somewhat. An old thread with someone asking the same question, but with no answers:

multiple cores, solr.xml and replication

2010-10-21 Thread didier deshommes
Hi there, I noticed that the java-based replication does not make replication of multiple core automatic. For example, if I have a master with 7 cores, any slave I set up has to explicitly know about each of the 7 cores to be able to replicate them. This information is stored in solr.xml, and

Re: how well does multicore scale?

2010-10-21 Thread Jonathan Rochkind
No, it does not seem reasonable. Why do you think you need a seperate core for every user? mike anderson wrote: I'm exploring the possibility of using cores as a solution to bookmark folders in my solr application. This would mean I'll need tens of thousands of cores... does this seem

Re: multiple cores, solr.xml and replication

2010-10-21 Thread Shawn Heisey
On 10/21/2010 1:42 PM, didier deshommes wrote: I noticed that the java-based replication does not make replication of multiple core automatic. For example, if I have a master with 7 cores, any slave I set up has to explicitly know about each of the 7 cores to be able to replicate them. This

Re: multiple cores, solr.xml and replication

2010-10-21 Thread didier deshommes
On Thu, Oct 21, 2010 at 3:00 PM, Shawn Heisey s...@elyograg.org wrote: On 10/21/2010 1:42 PM, didier deshommes wrote: I noticed that the java-based replication does not make replication of multiple core automatic. For example, if I have a master with 7 cores, any slave I set up has to

OutOfMemory and auto-commit

2010-10-21 Thread Jonathan Rochkind
If I do _not_ have any auto-commit enabled, and add 500k documents and commit at end, no problem. If I instead set auto-commit maxDocs to 10 (pretty large number), and try to add 500k docs, with autocommits theoretically happening every 100k... I run into an OutOfMemory error. Can

Re: A bug in ComplexPhraseQuery ?

2010-10-21 Thread Ahmet Arslan
--- On Thu, 10/21/10, jmr jmpala...@free.fr wrote: From: jmr jmpala...@free.fr Subject: A bug in ComplexPhraseQuery ? To: solr-user@lucene.apache.org Date: Thursday, October 21, 2010, 12:53 PM Hi, We have installed ComplexPhraseQuery and since that we can see strange behaviour in

Re: Multiple Similarity

2010-10-21 Thread Ahmet Arslan
Is it possible to define different Similarity classes for different fields? No. See http://search-lucene.com/m/g9cVf23EQO11/ We have a use case where we are interested in avoid term frequency (tf) when our fields are multiValued. May be omitTermFreqAndPositions=true?

Re: multiple cores, solr.xml and replication

2010-10-21 Thread Shawn Heisey
On 10/21/2010 2:14 PM, didier deshommes wrote: I use this configuration too but doesn't this assume that solr.xml is the same in master and slave? what happens when master creates a new core? That's a very good question, one that I can't answer. I don't dynamically create new cores. If you

Re: different results depending on result format

2010-10-21 Thread Mike Sokolov
quick follow-up: I also notice that the query from solrj gets version=1, whereas the admin webapp puts version=2.2 on the query string, although this param doesn't seem to change the xml results at all. Does this indicate an older version of solrj perhaps? -Mike On 10/21/2010 04:47 PM, Mike

Solr sorting problem

2010-10-21 Thread Moazzam Khan
Hey guys, I have a list of people indexed in Solr. I am trying to sort by their first names but I keep getting results that are not alphabetically sorted (I see the names starting with W before the names starting with A). I have a feeling that the results are first being sorted by relevancy then

Re: Solr sorting problem

2010-10-21 Thread Jayendra Patil
need additional information . Sorting is easy in Solr just by passing the sort parameter However, when it comes to text sorting it depends on how you analyse and tokenize your fields Sorting does not work on fields with multiple tokens.

Strange file name after installing solr

2010-10-21 Thread Bac Hoang
apache-solr-1.4.1Hello folks, I'm very new user to solr. Please help What I have in hand: 1) apache-solr-1.4.1; 2) Geronimo After installing solr.war using Geronimo administration GUI, I got a strange file, under the

Re: OutOfMemory and auto-commit

2010-10-21 Thread Lance Norskog
Yes. Indexing activity suspends until the commit finishes, then starts. Having both queries and indexing on the same Solr will have this memory problem. Lance On Thu, Oct 21, 2010 at 1:16 PM, Jonathan Rochkind rochk...@jhu.edu wrote: If I do _not_ have any auto-commit enabled, and add 500k

Re: how can i use solrj binary format for indexing?

2010-10-21 Thread Jason, Kim
Hi Gora, I really appreciate. Your reply was a great help to me. :) I hope everything is fine with you. Regards, Jason Gora Mohanty-3 wrote: On Mon, Oct 18, 2010 at 8:22 PM, Jason, Kim hialo...@gmail.com wrote: Sorry for the delay in replying. Was caught up in various things this

Re: Using a custom repository to store solr index files

2010-10-21 Thread Tharindu Mathew
Thanks for your answer Upayavira. Appreciate it. I want to do this because of a clustering requirement. When clustering takes place in the product I'm working on the custom repository we use replicates accordingly and makes data available to all nodes. But if this is available on the file system

Re: how well does multicore scale?

2010-10-21 Thread Tharindu Mathew
Hi Mike, I've also considered using a separate cores in a multi tenant application, ie a separate core for each tenant/domain. But the cores do not suit that purpose. If you check out documentation no real API support exists for this so it can be done dynamically through SolrJ. And all use cases

Re: A bug in ComplexPhraseQuery ?

2010-10-21 Thread jmr
iorixxx wrote: ComplexPhraseQuery is ordered phrase query where default Lucene's PhraseQuery is unordered. With ComplexPhrase order or terms are important. Thanks for your answer. With this request: (text:(protein digest~50)) || (text:(digest protein~50)) I get my 6 documents. In my