Re: Solr replication, HAproxy and data management

2010-12-13 Thread Paolo Castagna
Paolo Castagna wrote: Hi, we are using Solr v1.4.x with multi-cores and a master/slaves configuration. We also use HAProxy [1] to load balance search requests amongst slaves. Finally, we use MapReduce to create new Solr indexes. I'd like to share with you what I am doing when I need to: 1

Solr replication, HAproxy and data management

2010-12-09 Thread Paolo Castagna
Hi, we are using Solr v1.4.x with multi-cores and a master/slaves configuration. We also use HAProxy [1] to load balance search requests amongst slaves. Finally, we use MapReduce to create new Solr indexes. I'd like to share with you what I am doing when I need to: 1. add a new index 2.

Re: EmbeddedSolrServer with one core and schema.xml loaded via ClassLoader, is it possible?

2010-10-25 Thread Paolo Castagna
? Is there a better way? Paolo Paolo Castagna wrote: Hi, I am trying to use EmbeddedSolrServer with just one core and I'd like to load solrconfig.xml, schema.xml and other configuration files from a jar via getResourceAsStream(...). I've tried to use SolrResourceLoader, but all my attempts failed

EmbeddedSolrServer with one core and schema.xml loaded via ClassLoader, is it possible?

2010-10-20 Thread Paolo Castagna
Hi, I am trying to use EmbeddedSolrServer with just one core and I'd like to load solrconfig.xml, schema.xml and other configuration files from a jar via getResourceAsStream(...). I've tried to use SolrResourceLoader, but all my attempts failed with a RuntimeException: Can't find resource [...].

Faceting and omitNorms=true

2010-10-12 Thread Paolo Castagna
Hi, I am not completely sure on what's the recommended setting for fields used for faceting regarding omitNorms and potitionIncrementGap. Should I used omitNorms=true? What about positionIncrementGap? At the moment I have this in my schema.xml: dynamicField name=*_facet type=keyword

Re: Faceting and omitNorms=true

2010-10-12 Thread Paolo Castagna
Paolo Castagna wrote: Hi, I am not completely sure on what's the recommended setting for fields used for faceting regarding omitNorms and potitionIncrementGap. Should I used omitNorms=true? What about positionIncrementGap? At the moment I have this in my schema.xml: dynamicField name=*_facet

facet.method: enum vs. fc

2010-10-11 Thread Paolo Castagna
Hi, I am using Solr v1.4 and I am not sure which facet.method I should use. What should I use if I do not know in advance if the number of values for a given field will be high or low? What are the pros/cons of using facet.method=enum vs. facet.method=fc? When should I use enum vs. fc? I have

Re: facet.method: enum vs. fc

2010-10-11 Thread Paolo Castagna
to need to finely tune your speed. HTH Erick On Mon, Oct 11, 2010 at 11:30 AM, Paolo Castagna castagna.li...@googlemail.com wrote: Hi, I am using Solr v1.4 and I am not sure which facet.method I should use. What should I use if I do not know in advance if the number of values for a given field

SolrQuery and escaping special characters

2010-06-18 Thread Paolo Castagna
Hi, I am using Solr v1.4 and SolrJ on the client side. I am not sure how SolrJ behaves regarding escaping special characters [1] in a query string. SolrJ does URL encoding of the query string it sends to Solr. Do I need to escape special characters [1] when I construct a SolrQuery object or

Re: Can I use per field analyzers and dynamic fields?

2010-05-13 Thread Paolo Castagna
Chris Hostetter wrote: : However, I'd like to hear a comment on the approach of doing the parsing : using Lucene and then constructing a SolrQuery from a Lucene Query: I believe you are asking about doing this in the client code? using the Lucene QueryParser to parse a string using an

Re: Can I use per field analyzers and dynamic fields?

2010-05-09 Thread Paolo Castagna
Hi, thank you for your reply. What you suggested is a good idea and I am probably going to follow it. However, I'd like to hear a comment on the approach of doing the parsing using Lucene and then constructing a SolrQuery from a Lucene Query: QueryParser parser = new QueryParser(, analyzer);

Can I use per field analyzers and dynamic fields?

2010-05-05 Thread Paolo Castagna
Hi, I have an existing Lucene application which I want to port to Solr. A scenario I need to support requires me to use dynamic fields with Solr, since users can add new fields at runtime. At the same time, the existing Lucene application is using a PerFieldAnalyzerWrapper in order to use

Re: Can I use per field analyzers and dynamic fields?

2010-05-05 Thread Paolo Castagna
type, so you'll want to use different dynamic field patterns for different types of fields.        Erik On May 5, 2010, at 9:14 AM, Paolo Castagna wrote: Hi, I have an existing Lucene application which I want to port to Solr. A scenario I need to support requires me to use dynamic fields

Re: LucidWorks Solr

2010-04-18 Thread Paolo Castagna
Thanks for asking, I am interested as well in reading the response to your questions. Paolo Andy wrote: Just wanted to know if anyone has used LucidWorks Solr. - How do you compare it to the standard Apache Solr? - the non-blocking IO of LucidWorks Solr -- is that for networking IO or disk

Re: Benchmarking Solr

2010-04-12 Thread Paolo Castagna
Hi, I do not have an answer to your questions. But, I have the same issue/problem you have. It would be good if Solr community would agree and share their approach for benchmarking Solr. Indeed, it would be good to have a benchmark for information retrieval systems. AFIK there isn't one. :-/

Re: Benchmarking Solr

2010-04-12 Thread Paolo Castagna
Paolo Castagna wrote: I do not have an answer to your questions. But, I have the same issue/problem you have. Some related threads: - http://markmail.org/message/pns4dtfvt54mu3vs - http://markmail.org/message/7on6lvabsosvj7bc - http://markmail.org/message/ftz7tkd7ekhnk4bc - http

Re: Benchmarking Solr

2010-04-12 Thread Paolo Castagna
Shawn Heisey wrote: Anyone got a recommendation about where to put it on the wiki? There are already two related pages: - http://wiki.apache.org/solr/SolrPerformanceFactors - http://wiki.apache.org/solr/SolrPerformanceData Why not to create a new page? -

Re: Searching Lucene Indexes with Solr

2010-04-07 Thread Paolo Castagna
Erick Erickson wrote: It is possible but you have to take care to match Solr's schema with the structure of documents in the Lucene index. The correct field names and query-analyzers should be configured in schema.xml Is it possible to use Solr v1.4 together with a legacy Lucene (v2.1.0 and/or

Is it possible to have Lucene and Solr (or two Solr instances) pointing at the same index directory?

2010-04-07 Thread Paolo Castagna
Hi, (I know that this is probably not recommended and not a common scenario, but...) Is it possible to have an application using Lucene and a separate (i.e. different JVM) instance of Solr both pointing at the same index and read/write to the index from both applications? I am trying

Re: solr best practice to submit many documents

2010-04-07 Thread Paolo Castagna
Hi Brian, I had similar questions when I begun to try and evaluate Solr. If you use Java and SolrJ you might find these useful: - http://wiki.apache.org/solr/Solrj#Streaming_documents_for_an_update -

Backup/restore strategies for Solr cores and legacy Lucene applications

2010-04-06 Thread Paolo Castagna
Hi, I have an existing web application which is using Lucene (v2.1.0 and/or v2.4.x) and which I'd like to gradually migrate to Solr. I am already using multiple cores, master/slave replication and SolrJ to re-implement current functionalities. One use case I have is: backup/restore indexes. I