Re: DataImport

2008-06-11 Thread Shalin Shekhar Mangar
Hi Mihails, The solr home is a directory which contains the conf/ and data/ folders. The conf folder contains solrconfig.xml, schema.xml and other such configuration files. The data/ folder contains the index files. Other than adding the war file to tomcat, you also need to designate a certain

Re: DataImport

2008-06-11 Thread Mihails Agafonovs
I'm stuck... I now have /tomcat5.5/webapps/solr (exploded solr.war), /tomcat5.5/webapps/solr/solr-example/. I've ran export JAVA_OPTS=$JAVA_OPTS-Dsolr.solr.home=/usr/share/tomcat5.5/webapps/solr/example/solr/ to make /example/solr/ as a home directory. What am I doing wrong? Quoting Shalin

Re: DataImportHandler : How to mix XPathEntityProcessor and TemplateTransformer

2008-06-11 Thread Nicolas Pastorino
Thanks a million for your time and help. It indeed works smoothly now. I also, by the way, had to apply the patch attached to the following message : http://www.nabble.com/Re%3A-How-to-describe-2-entities-in-dataConfig- for-the-DataImporter--p17577610.html in order to have the

Re: DataImportHandler : How to mix XPathEntityProcessor and TemplateTransformer

2008-06-11 Thread Noble Paul നോബിള്‍ नोब्ळ्
We are cutting a a patch which incorporates all the recent bug fixes, so that you guys do not have to apply patches over patches --Noble On Wed, Jun 11, 2008 at 3:49 PM, Nicolas Pastorino [EMAIL PROTECTED] wrote: Thanks a million for your time and help. It indeed works smoothly now. I also,

Re: Problem with add a XML

2008-06-11 Thread Grant Ingersoll
ESET NOD32 Antivirus, Signaturdatenbank- Version 3175 (20080611) __ E-Mail wurde geprüft mit ESET NOD32 Antivirus. http://www.eset.com -- Grant Ingersoll http://www.lucidimagination.com Lucene Helpful Hints: http://wiki.apache.org/lucene-java/BasicsOfPerformance

Re: searching only within allowed documents

2008-06-11 Thread Geoffrey Young
Solr allows you to specify filters in separate parameters that are applied to the main query, but cached separately. q=the user queryfq=folder:f13fq=folder:f24 I've been wanting more explanation around this for a while, so maybe now is a good time to ask :) the cached separately verbiage

Re: DataImport

2008-06-11 Thread Shalin Shekhar Mangar
Ok, let's start again from scratch with a clean Tomcat installation. 1. Download example-solr-home.jar from the wiki and extract it to a local folder for example to /home/your_username/ 2. You will now see a folder called example-solr-home where you extracted the jar file in the above step 3.

Re: DataImport

2008-06-11 Thread Mihails Agafonovs
Exception in Lucene Index Updater. Anyway, for some reasons I'm able to start Solr only using its own Jetty. Everything else works fine on my Tomcat, except Solr. Quoting Shalin Shekhar Mangar : Ok, let's start again from scratch with a clean Tomcat installation. 1. Download

range query highlighting

2008-06-11 Thread Stefan Oestreicher
Hi, I'm using solr built from trunk and highlighting for range queries doesn't work. If I search for 2008 everything works as expected but if I search for [2000 TO 2008] nothing gets highlighted. The field I'm searching on is a TextField and I've confirmed that the query and index analyzers are

RE: [jira] Updated: (SOLR-469) Data Import RequestHandler

2008-06-11 Thread Julio Castillo
Shalin, Thanks for consolidating the patch. Any idea, when the dB Import request handler will be part of the nightly build? Thanks again ** julio -Original Message- From: Shalin Shekhar Mangar (JIRA) [mailto:[EMAIL PROTECTED] Sent: Wednesday, June 11, 2008 8:43 AM To: [EMAIL

Re: [jira] Updated: (SOLR-469) Data Import RequestHandler

2008-06-11 Thread Shalin Shekhar Mangar
Hi Julio, That was fast! I just uploaded a patch :) Actually, it is waiting on SOLR-563 ( http://issues.apache.org/jira/browse/SOLR-563) which deals with modifying the build scripts to create a contrib project area in Solr. I'm planning to work on that this week. Once that is done, it would be

Re: range query highlighting

2008-06-11 Thread Yonik Seeley
It's a known deficiency... ConstantScoreRangeQuery and ConstantScorePrefixQuery which Solr uses rewrite to a ConstantScoreQuery and don't expose the terms they match. Performance-wise it seems like a bad idea if the number of terms matched is large (esp when used in a MultiSearcher or later in

CSV output

2008-06-11 Thread Marshall Weir
Hi, Does SOLR have .csv output? I can find references to .csv input, but not output. Thank you, Marshall

Re: CSV output

2008-06-11 Thread Otis Gospodnetic
Hi Marshall, I don't think there is a CSV Writer, but here are some pointers for writing one: $ ff \*Writer\*java | grep -v Test | grep request ./src/java/org/apache/solr/request/PHPResponseWriter.java ./src/java/org/apache/solr/request/XSLTResponseWriter.java

Re: CSV output

2008-06-11 Thread Walter Underwood
I recommend using the OpenCSV package. Works fine, Apache 2.0 license. http://opencsv.sourceforge.net/ wunder On 6/11/08 10:00 AM, Otis Gospodnetic [EMAIL PROTECTED] wrote: Hi Marshall, I don't think there is a CSV Writer, but here are some pointers for writing one: $ ff \*Writer\*java

Question about fieldNorm

2008-06-11 Thread Brendan Grainger
Hi, I've just changed the stemming algorithm slightly and am running a few tests against the old stemmer versus the new stemmer. I did a query for 'hanger' and using the old stemmer I get the following scoring for a document with the title: Converter Hanger Assembly Replacement 6.4242806

Searching for words with accented characters.

2008-06-11 Thread Robert Haschart
We are using Solr as the search engine for our public access library catalog. In testing I did a search for a French movie that I know is in the catalog named: Kirikou et la sorcière and nothing was returned. If I search for just the work Kirikou several results are returned, and the

Re: CSV output

2008-06-11 Thread Brendan Grainger
When I was asked for something similar I quickly cobbled together a stylesheet (I'm no xsl expert so it's probably pretty bad). Invoked like this: http://localhost:8982/solr/select?q=testingfl=id,title_t,scorewt=xslttr=csv.xslrows=10 YMMV, but feel free to use it if it helps, I've attached

Re: Question about fieldNorm

2008-06-11 Thread Yonik Seeley
That is strange... did you re-index or change the index? If so, you might want to verify that docid=3454 still corresponds to the same document you queried earlier. -Yonik On Wed, Jun 11, 2008 at 1:09 PM, Brendan Grainger [EMAIL PROTECTED] wrote: I've just changed the stemming algorithm

Re: Question about fieldNorm

2008-06-11 Thread Brendan Grainger
Hi Yonik, Yes I did rebuild the index and they are the same document (just verified). The only thing that changed was the stemmer, but that makes no sense to me. Also, if the equation for the fieldNorm is: fieldBoost * lengthNorm = fieldBoost * 1 /sqrt(numTermsForField) Then that would

Re: Ignore fields in XML response

2008-06-11 Thread Shalin Shekhar Mangar
Sure, use the fl parameter to specify the fields that you want (comma-separated) On Wed, Jun 11, 2008 at 11:31 PM, Yves Zoundi [EMAIL PROTECTED] wrote: Hi guys, Is it possible to remove some fields from the XML response? I have a field which can contains a huge amount of data

Re: Ignore fields in XML response

2008-06-11 Thread Erik Hatcher
Yves - you can control which fields are returned from a search using the fl (field list) parameter. fl=* provides all fields except score. fl=id,title,score provides only those selected fields, etc. Erik On Jun 11, 2008, at 2:01 PM, Yves Zoundi wrote: Hi guys, Is

Re: Question about fieldNorm

2008-06-11 Thread Brendan Grainger
Hi Yonik, I just realized that the stemmer does make a difference because of synonyms. So on indexing using the new stemmer converter hanger assembly replacement gets expanded to: converter hanger assembly assemble replacement so there are 5 terms which gets a length norm of 0.4472136

RE: Ignore fields in XML response

2008-06-11 Thread Yves Zoundi
Thank you guys! -Message d'origine- De : Erik Hatcher [mailto:[EMAIL PROTECTED] Envoyé : 11 juin 2008 14:07 À : solr-user@lucene.apache.org Objet : Re: Ignore fields in XML response Yves - you can control which fields are returned from a search using the fl (field list) parameter.

Re: Question about fieldNorm

2008-06-11 Thread Yonik Seeley
Field norms have limited precision (it's encoded as an 8 bit float) so you are probably seeing rounding. -Yonik On Wed, Jun 11, 2008 at 2:13 PM, Brendan Grainger [EMAIL PROTECTED] wrote: Hi Yonik, I just realized that the stemmer does make a difference because of synonyms. So on indexing

Re: Question about fieldNorm

2008-06-11 Thread Brendan Grainger
Thanks so much, that explains it. Brendan On Jun 11, 2008, at 4:00 PM, Yonik Seeley wrote: Field norms have limited precision (it's encoded as an 8 bit float) so you are probably seeing rounding. -Yonik On Wed, Jun 11, 2008 at 2:13 PM, Brendan Grainger [EMAIL PROTECTED] wrote: Hi Yonik, I

Re: Searching for words with accented characters.

2008-06-11 Thread solrtom
Hi Robert, Did you rebuild the index after changing your config? The index time analyzer is only applied when a document is indexed, changing it has no effect on already indexed documents. Tom Robert Haschart wrote: We are using Solr as the search engine for our public access library

synonym token types and ranking

2008-06-11 Thread Uri Boness
Hi, I've noticed that currently the SynonymFilter replaces the original token with the configured tokens list (which includes the original matched token) and each one of these tokens is of type word. Wouldn't it make more sense to only mark the original token as type word and the the other

Strategy for presenting fresh data

2008-06-11 Thread James Brady
Hi, The product I'm working on requires new documents to be searchable very quickly (inside 60 seconds is my goal). The corpus is also going to grow very large, although it is perfectly partitionable by user. The approach I tried first was to have write-only masters and read- only slaves

Re: synonym token types and ranking

2008-06-11 Thread Otis Gospodnetic
Hi Uri, Yes, I think that would make sense (word vs. synonym token types). Custom boosting/weighting of original token vs. synonym token(s) also makes sense. Is this something you can provide a patch for? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original

Re: Strategy for presenting fresh data

2008-06-11 Thread Otis Gospodnetic
Hi James, Yes, this makes sense. I've recommended doing the same to others before. It would be good to have this be a part of Solr. There is one person (named Jason) working on adding more real-time search support to both Lucene and Solr. Otis -- Sematext -- http://sematext.com/ -- Lucene

Re: searching only within allowed documents

2008-06-11 Thread climbingrose
It depends on your query. The second query is better if you know that fieldb:bar filtered query will be reused often since it will be cached separately from the query. The first query occuppies one cache entry while the second one occuppies two cache entries, one in queryCache and one in

Re: searching only within allowed documents

2008-06-11 Thread climbingrose
Just correct myself, in the last setence, the first query is better if fieldb:bar isn't reused often On Thu, Jun 12, 2008 at 2:02 PM, climbingrose [EMAIL PROTECTED] wrote: It depends on your query. The second query is better if you know that fieldb:bar filtered query will be reused often since

Re: Strategy for presenting fresh data

2008-06-11 Thread rohit arora
Hi, I am new to Solr Lucene I have only one defaule core i am working on creating multiple core. Can you help me in this matter. with regards nbsp;Rohit Arora --- On Thu, 6/12/08, James Brady lt;[EMAIL PROTECTED]gt; wrote: From: James Brady lt;[EMAIL PROTECTED]gt; Subject: Strategy for

DataImportHandler questions ..

2008-06-11 Thread Neville Burnell
Hi, I'm playing with the Solr Data Import Handler, and everything looks great so far! Hopefully we will be able to replace our homegrown ODBC indexing service [using camping+ferret] with Solr! The wiki page mentions scheduling full imports and delta imports but I couldn't find any further