Using Solr to build a product matcher, with learning to rank

2018-03-28 Thread Xavier Schepler
regards, Xavier Schepler

Tomcat6 and Log4j

2011-02-10 Thread Xavier Schepler
Hi, I added “slf4j-log4j12-1.5.5.jar” and “log4j-1.2.15.jar” to $CATALINA_HOME/webapps/solr/WEB-INF/lib , then deleted the library “slf4j-jdk14-1.5.5.jar” from $CATALINA_HOME/webapps/solr/WEB-INF/lib, then created a directory $CATALINA_HOME/webapps/solr/WEB-INF/classes. and created

Re: Tomcat6 and Log4j

2011-02-10 Thread Xavier SCHEPLER
with -Dlog4j.configuration=$CATALINA_HOME/webapps/solr/WEB-INF/classes/log4j.properties -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com On 10. feb. 2011, at 09.41, Xavier Schepler wrote: Hi, I added “slf4j-log4j12-1.5.5.jar” and “log4j-1.2.15.jar” to $CATALINA_HOME/webapps

Re: Tomcat6 and Log4j

2011-02-10 Thread Xavier SCHEPLER
I added it to /etc/default/tomcat6. What happened is that the same error message appeared twice in /var/log/tomcat6/catalina.out. Like the same file was loaded twice. -- Tous les courriers électroniques émis depuis la

Re: Tomcat6 and Log4j

2011-02-10 Thread Xavier SCHEPLER
Yes thanks. This works fine : log4j.rootLogger=INFO, SOLR log4j.appender.SOLR=org.apache.log4j.DailyRollingFileAppender log4j.appender.SOLR.file=/home/quetelet_bdq/logs/bdq.log log4j.appender.SOLR.datePattern='.'-MM-dd log4j.appender.SOLR.layout=org.apache.log4j.PatternLayout

Re: Local param tag voodoo ?

2011-01-20 Thread Xavier SCHEPLER
the hints in your last message, I suspect nested queries _might_ be helpful to you. On 1/19/2011 3:46 AM, Xavier SCHEPLER wrote: Ok I was already at this point. My facetting system use exactly what is described in this page. I read it from the Solr 1.4 book. Otherwise I would'nt ask

Re: Local param tag voodoo ?

2011-01-20 Thread Xavier SCHEPLER
Since it seems to be no voodoo available I did it on the client side. I send a first request to get the facets and a second to get the documents and their highlighting. It works well but requires more processing. From: Xavier SCHEPLER xavier.schep

Re: Local param tag voodoo ?

2011-01-19 Thread Xavier SCHEPLER
You're right the second query didn't result in an error but neither gave the expected result. I'm gone to have a look at the link you gave me. Thanks ! From: Markus Jelsma markus.jel...@openindex.io Sent: Tue Jan 18 21:31:52 CET 2011 To:

Re: Local param tag voodoo ?

2011-01-19 Thread Xavier SCHEPLER
Ok I was already at this point. My facetting system use exactly what is described in this page. I read it from the Solr 1.4 book. Otherwise I would'nt ask. The problem is that the filter queries doesn't affect the relevance score of the results so I want the terms in the main query.

Local param tag voodoo ?

2011-01-18 Thread Xavier Schepler
Hey, here are my needs : - a query that has tagged and untagged contents - facets that ignore the tagged contents I tryed : q=({!tag=toExclude} ignored) taken into account q={tag=toExclude v='ignored'} take into account Both resulted in a error. Is this possible or do I have to try another

Solr boolean operators

2011-01-13 Thread Xavier Schepler
Hi, with the Lucene query syntax, is : a AND (a OR b) equivalent to : a (absorption) ?

Re: Solr boolean operators

2011-01-13 Thread Xavier SCHEPLER
by your query ... it's the same. In terms of the score of the results no, since, if you are using the first query, the documents that will match both the a and the b terms, will match higher then the ones matching just the a term. On Thu, Jan 13, 2011 at 3:29 PM, Xavier Schepler

Re: No response from Solr on complex request after several days

2010-10-29 Thread Xavier Schepler
to do it, but you can automatically restart it once a day. On Thu, Oct 28, 2010 at 2:20 AM, Xavier Schepler xavier.schep...@sciences-po.fr wrote: Hi, We are in a beta testing phase, with several users a day. After several days of waiting, the solr server didn't respond to requests

No response from Solr on complex request after several days

2010-10-28 Thread Xavier Schepler
Hi, We are in a beta testing phase, with several users a day. After several days of waiting, the solr server didn't respond to requests that require a lot of processing time. I'm using Solr inside Tomcat. This is the request that had no response from the server :

No response from Solr on complex request (real issue explained)

2010-10-28 Thread Xavier Schepler
Hi, We are in a beta testing phase, with several users a day. After several days of running well, the solr server stopped responding to requests that require a lot of processing time, like this one :

More like this and terms positions

2010-10-04 Thread Xavier Schepler
Hi, does the more like this search uses terms positions information in the score formula ?

Re: More like this and terms positions

2010-10-04 Thread Xavier Schepler
On 04/10/2010 16:40, Robert Muir wrote: On Mon, Oct 4, 2010 at 10:16 AM, Xavier Schepler xavier.schep...@sciences-po.fr wrote: Hi, does the more like this search uses terms positions information in the score formula ? no, it would be nice if it did use them though (based upon

Phrase search + multi-word index time expanded synonym

2010-09-08 Thread Xavier Schepler
the results. How can I have both queries working ? I'm thinking about different configurations but I didn't found any solution at the moment. Thx for reading, Xavier Schepler

Re: Phrase search + multi-word index time expanded synonym

2010-09-08 Thread Xavier Schepler
On 08/09/2010 12:21, Grijesh.singh wrote: see the analysis.jsp with debug verbose and see what happens at index time and search time during analysis with your data Also u can use debugQuery=on for seeing what actually parsed query is. - Grijesh I've found a first solution by myself,

spellcheck distance measure algorithms error ?

2010-09-03 Thread Xavier Schepler
Hi, When I take the two letters from the middle of a word and put the first in place of the second and the second in place of the first, ex : jospin = jopsin, I don't get any suggestion from the spellchecker component. I tryed the default algorithm and the Jaro Winkler Distance, with a

Re: spellcheck distance measure algorithms error ?

2010-09-03 Thread Xavier Schepler
On 03/09/2010 15:31, Grant Ingersoll wrote: On Sep 3, 2010, at 9:14 AM, Xavier Schepler wrote: On 03/09/2010 14:47, Grant Ingersoll wrote: On Sep 3, 2010, at 6:02 AM, Xavier Schepler wrote: no, jopsin isn't in the index. I tryed this with other words and I had the same

Proximity search + Highlighting

2010-09-01 Thread Xavier Schepler
Hi, can the highlighting component highlight terms only if the distance between them matches the query ? I use those parameters : hl=onhl.fl=qFR,iFR,mFR,vlFRhl.usePhraseHighlighter=falsehl.highlightMultiTerm=truehl.simple.pre=bhl.simple.post=%2Fbhl.mergeContiguous=false

Re: Proximity search + Highlighting

2010-09-01 Thread Xavier Schepler
On 01/09/2010 12:38, Markus Jelsma wrote: I think you need to enable usePhraseHighlighter in order to use the highlightMultiTerm parameter. On Wednesday 01 September 2010 12:12:11 Xavier Schepler wrote: Hi, can the highlighting component highlight terms only if the distance between them

Re: Proximity search + Highlighting

2010-09-01 Thread Xavier Schepler
On 01/09/2010 13:54, Xavier Schepler wrote: On 01/09/2010 12:38, Markus Jelsma wrote: I think you need to enable usePhraseHighlighter in order to use the highlightMultiTerm parameter. On Wednesday 01 September 2010 12:12:11 Xavier Schepler wrote: Hi, can the highlighting component

Re: Highlighting, return the matched terms only

2010-08-31 Thread Xavier Schepler
Chris Hostetter wrote: : how could I have the highlighting component return only the terms that were : matched, without any surrounding text ? I'm not a Highlighter expert, but this is somethign that certainly *sounds* like it should be easy. I took a shot at it and this is hte best i could

Expanded Synonyms + phrase search

2010-08-30 Thread Xavier Schepler
Hi, several documents from my index contain the phrase : PS et. However, PS is expanded to parti socialiste and a phrase search for PS et fails. A phrase search for parti socialiste et succeeds. Can I have both queries working ? Here's the field type : fieldtype name=SyFR

Highlighting, return the matched terms only

2010-08-03 Thread Xavier Schepler
Hi, how could I have the highlighting component return only the terms that were matched, without any surrounding text ?

Multi word synonyms + highlighting

2010-06-04 Thread Xavier Schepler
Hi, Here's a field type using synonyms : fieldtype name=SFR class=solr.TextField analyzer type=index tokenizer class=solr.StandardTokenizerFactory/ filter class=solr.StandardFilterFactory/ filter class=solr.SynonymFilterFactory synonyms=french-synonyms.txt ignoreCase=true expand=true/

Targeting two fields with the same query or one field gathering contents from both ?

2010-05-17 Thread Xavier Schepler
Hey, let's say I have : - a field named A with specific contents - a field named B with specific contents - a field named C witch contents only from A and B added with copyField. Are those queries equivalents in terms of performance : - A: (the lazy fox) AND B: (the lazy fox) - C: (the

Re: Targeting two fields with the same query or one field gathering contents from both ?

2010-05-17 Thread Xavier Schepler
Le 17/05/2010 16:57, Xavier Schepler a écrit : Hey, let's say I have : - a field named A with specific contents - a field named B with specific contents - a field named C witch contents only from A and B added with copyField. Are those queries equivalents in terms of performance

Re: Targeting two fields with the same query or one field gathering contents from both ?

2010-05-17 Thread Xavier Schepler
Le 17/05/2010 17:49, Marco Martinez a écrit : No, the equivalent for this will be: - A: (the lazy fox) *OR* B: (the lazy fox) - C: (the lazy fox) Imagine the situation that you dont have in B 'the lazy fox', with the AND you get 0 results although you have 'the lazy fox' in A and C Marco

What hardware do I need ?

2010-04-23 Thread Xavier Schepler
Hi, I'm working with Solr 1.4. My schema has about 50 fields. I'm using full text search in short strings (~ 30-100 terms) and facetted search. My index will have 100 000 documents. The number of requests per second will be low. Let's say between 0 and 1000 because of auto-complete. Is a

Re: What hardware do I need ?

2010-04-23 Thread Xavier Schepler
Le 23/04/2010 17:08, Otis Gospodnetic a écrit : Xavier, 0-1000 QPS is a pretty wide range. Plus, it depends on how good your auto-complete is, which depends on types of queries it issues, among other things. 100K short docs is small, so that will all fit in RAM nicely, assuming those other

More like this - setting a minimum number of terms used to build queries

2010-03-29 Thread Xavier Schepler
Hey, Is there a way to make the more like this feature build its queries from a minimum number of interesting terms ? It looks like this component fires query with only 1 term in them. I got a lot of results that aren't similar at all with the parsed document fields. My parameters :

Highlighting inside a field with HTML contents

2010-02-22 Thread Xavier Schepler
Hello, this field would not be searched, but it would be used to display results. A query could be : q=tablehl=truehl.fl=htmlfieldhl.fragsize=0 It would be tokenized with the HTMLStripStandardTokenizerFactory, then analyzed the same way as the searcheable fields. Could this result in

Re: Need feedback on solr security

2010-02-17 Thread Xavier Schepler
Vijayant Kumar wrote: Hi Group, I need some feedback on solr security. For Making by solr admin password protected, I had used the Path Based Authentication form http://wiki.apache.org/solr/SolrSecurity. In this way my admin area,search,delete,add to index is protected.But Now when I make

Re: Need feedback on solr security

2010-02-17 Thread Xavier Schepler
Vijayant Kumar wrote: Hi Xavier, Thanks for your feedback the firewall rule for the trusted IP is not fessiable for us because the application is open for public so we can not work through IP banning. Vijayant Kumar wrote: Hi Group, I need some feedback on solr security. For Making

Re: Need feedback on solr security

2010-02-17 Thread Xavier Schepler
Xavier Schepler wrote: Vijayant Kumar wrote: Hi Xavier, Thanks for your feedback the firewall rule for the trusted IP is not fessiable for us because the application is open for public so we can not work through IP banning. Vijayant Kumar wrote: Hi Group, I need some feedback on solr

Re: Dynamic fields with more than 100 fields inside

2010-02-09 Thread Xavier Schepler
Shalin Shekhar Mangar a écrit : On Mon, Feb 8, 2010 at 9:47 PM, Xavier Schepler xavier.schep...@sciences-po.fr wrote: Hey, I'm thinking about using dynamic fields. I need one or more user specific field in my schema, for example, concept_user_*, and I will have maybe more than 200 users

Re: Dynamic fields with more than 100 fields inside

2010-02-09 Thread Xavier Schepler
Shalin Shekhar Mangar a écrit : On Tue, Feb 9, 2010 at 2:43 PM, Xavier Schepler xavier.schep...@sciences-po.fr wrote: Shalin Shekhar Mangar a écrit : On Mon, Feb 8, 2010 at 9:47 PM, Xavier Schepler xavier.schep...@sciences-po.fr wrote: Hey, I'm thinking about using

Dynamic fields with more than 100 fields inside

2010-02-08 Thread Xavier Schepler
Hey, I'm thinking about using dynamic fields. I need one or more user specific field in my schema, for example, concept_user_*, and I will have maybe more than 200 users using this feature. One user will send and retrieve values from its field. It will then be used to filter result. How

Field highlighting

2010-01-07 Thread Xavier Schepler
Hi, I'm trying to highlight short text values. The field they came from has a type shared with other fields. I have highlighting working on other fields but not on this one. Why ?

Re: Field highlighting

2010-01-07 Thread Xavier Schepler
AM, Xavier Schepler xavier.schep...@sciences-po.fr wrote: Hi, I'm trying to highlight short text values. The field they came from has a type shared with other fields. I have highlighting working on other fields but not on this one. Why ? Thanks for your response. Here are some