Re: Scoring of DisMax in Solr

2011-10-05 Thread Bill Bell
This seems like a bug to me. On 10/4/11 6:52 PM, David Ryan help...@gmail.com wrote: Hi, When I examine the score calculation of DisMax in Solr, it looks to me that DisMax is using tf x idf^2 instead of tf x idf. Does anyone have insight why tf x idf is not used here? Here is the score

Re: A simple query?

2011-10-05 Thread tamanjit.bin...@yahoo.co.in
Hi, Set your default operator to OR i.e. solrQueryParser defaultOperator=OR/ in schema.xml Also keep your fieldType=text i.e. field name=myfield type=text indexed=true stored=true/ As you would want whitespace tokenization and try your query with () i.e. /select/?q=myfields:(a

is there a way to know which mm value was used?

2011-10-05 Thread elisabeth benoit
Hello, I'd like to be able to know programmaticaly what value mm was set to for one request (to avoid having to parse the query, identify stopwords, calculate mm based on solrconfig.xml). Is there a way to get mm value in solr response? Thanks, Elisabeth

Re: boosting and relevancy options from solr extensibility points -java-

2011-10-05 Thread pravesh
in a certain time period (say christmas) I will promote a doc in christmas keyword You might check the QueryElevation component in SOLR. or based on users interest I will boost a specific category of products. or (I am not sure how can I do this one) I will boost docs that current user's friends

Re: is there a way to know which mm value was used?

2011-10-05 Thread pravesh
You can explicitly pass /mm/ for every search, and get it in your response, otherwise use /debugQuery=true/, it will give you all implicitly used defaults (but you wouldn't want to use this in production) Thanx Pravesh -- View this message in context:

Re: is there a way to know which mm value was used?

2011-10-05 Thread Em
Hi, since this isn't logged anywhere, as far as I can say, there are two ways: Either you apply mm within your url-call, so that you get the whole mm param back per request and calculate the applied mm with this information (sounds bad), or you recalculate it within your own custom search

Re: Hierarchical faceting with Date

2011-10-05 Thread pravesh
You count index the date as a text field(or use a new text field to store date as text) and then try it on this new field Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Hierarchical-faceting-with-Date-tp3394521p3395824.html Sent from the Solr - User mailing

Re: Indexing PDF

2011-10-05 Thread Héctor Trujillo
It seems unreasonable that if I want to index a local file, I have to references this local file by an URL. This isn't a estrange file, this is a file downloaded from lucid web portal called: Starting a Search Application.pdf This problem may be a codification problem, or char set problem. I

Re: Indexing PDF

2011-10-05 Thread Héctor Trujillo
Sorry you have the reason, this file was indexed with a .Net web service client, that calls a Java application(a web service) that calls Solr using SolrJ. I will try to index this in a different way, may be this resolve the problem. Thanks Best regards El 5 de octubre de 2011 08:42, Héctor

How do i get results for quering with separated words?

2011-10-05 Thread Mike Mander
Hello, i have configured a catchall searchword field. In this i copy the value of field name. Name value = Star Wars. Now i try to find this document by searchword starwars. But it's not found. Vice versa same problem. Name value = SuperRTL, searchword is super rtl. Replacing all whitespaces

Re: How do i get results for quering with separated words?

2011-10-05 Thread stockii
which type in the schema.xml do you use. try out WordDelimiterFilterFactory or some other filters from this site: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory - --- System

Re: Indexing PDF

2011-10-05 Thread Paul Libbrecht
Héctor, I was meaning you need another way to reference the file *to the mailing list*. Sorry for the confusion. I do not think there's anything special to the set of interfaces you're using if the delivery is the same for the solr client and the acrobat plugin. To make sure of it, you could

Re: indexing FTP documet with solrj

2011-10-05 Thread Marc SCHNEIDER
Hello, To crawl the document you can use Apache Tika before sending the content to Solr (via Solrj). Regards, Marc. On Wed, Oct 5, 2011 at 1:16 AM, Chris Hostetter hossman_luc...@fucit.orgwrote: : I want to index some document with solrj API's but the URL of theses : documents is FTP, :

Re: How do i get results for quering with separated words?

2011-10-05 Thread Mike Mander
Thanks stockii, but WDFF ist splitting on Numeric or NameChange only. For Star Wars in index and starwars in query this means that both are not equal. Or? Thanks Mike which type in the schema.xml do you use. try out WordDelimiterFilterFactory or some other filters from this site:

Re: Indexing PDF

2011-10-05 Thread Michael McCandless
Can you attach this PDF to an email send to the list? Or is it too large for that? Or, you can try running Tika directly on the PDF to see if it's able to extract the text. Mike McCandless http://blog.mikemccandless.com 2011/10/5 Héctor Trujillo hecto...@gmail.com: Sorry you have the

Re: How do i get results for quering with separated words?

2011-10-05 Thread stockii
index this field without whitespaces ? XD - --- System One Server, 12 GB RAM, 2 Solr Instances, 8 Cores, 1 Core with 45 Million Documents other Cores 200.000 - Solr1 for Search-Requests - commit every Minute - 5GB Xmx -

Re: A simple query?

2011-10-05 Thread alexw
Thanks but, unfortunately that will not solve the problem since it will bring back both the first and second doc. Besides, the query terms is: a b y z, not just: a b -- View this message in context: http://lucene.472066.n3.nabble.com/A-simple-query-tp3395465p3396297.html Sent from the Solr -

Re: Indexing PDF

2011-10-05 Thread Michael McCandless
Hmm, no attachment; maybe it's too large? Can you send it directly to me? Mike McCandless http://blog.mikemccandless.com 2011/10/5 Héctor Trujillo hecto...@gmail.com: This is the file that give me errors. 2011/10/5 Michael McCandless luc...@mikemccandless.com Can you attach this PDF to an

Re: Indexing PDF

2011-10-05 Thread Héctor Trujillo
I've uloaded the file here: http://www.filesonic.com/file/2342166624/Starting_a_Search_Application.pdf try this, thanks 2011/10/5 Michael McCandless luc...@mikemccandless.com Hmm, no attachment; maybe it's too large? Can you send it directly to me? Mike McCandless

Re: is there a way to know which mm value was used?

2011-10-05 Thread Shawn Heisey
On 10/5/2011 1:01 AM, elisabeth benoit wrote: Hello, I'd like to be able to know programmaticaly what value mm was set to for one request (to avoid having to parse the query, identify stopwords, calculate mm based on solrconfig.xml). Is there a way to get mm value in solr response? To

Re: How do i get results for quering with separated words?

2011-10-05 Thread Mike Mander
Isn't this more a problem of the query string? Let's assume i have a game name like Nintentdo 3DS - 'Star Wars - Clone Wars'. Can i copy that name to a field cutting the - and ', lowercase the result string and remove the whitespaces? So that i have nintendo3dsstarwarsclonewars. Is that

Re: schema changes changes 3.3 to 3.4?

2011-10-05 Thread jo
Okay I did use the analysis tool and it did make me notice a few things but more important what changed there is no longer a field type named text on the new schema, there is only text_en which is weird as text field is the default when doing a query.. anyway, when I used the analysis tool and

Sorting by article title

2011-10-05 Thread themanwho
Hi all! I have documents, all of which have a title, and I would like to sort by that title. The catch is, I wish to sort ignoring any A or The at the beginning of the title. My first (and only) attempt is by creating a type that looks like: fieldType name=titleSort

Re: is there a way to know which mm value was used?

2011-10-05 Thread elisabeth benoit
thanks for answering. echoParams just echos mm value in solrconfig.xml (in my case mm = 4-1 6-2), not the actual value of mm for one particular request. I think would be very useful to be able to know which mm value was effectively used, in particular for request with stopwords. It's of course

Re: How do i get results for quering with separated words?

2011-10-05 Thread elisabeth benoit
I think you could define star wars and starwars as synonyms in synonyms.txt... maybe not generic enough? 2011/10/5 Mike Mander wicket-m...@gmx.de Isn't this more a problem of the query string? Let's assume i have a game name like Nintentdo 3DS - 'Star Wars - Clone Wars'. Can i copy that

How to empty SolR Cache

2011-10-05 Thread David GUYOT
Hello, everybody. Firstly, I must advise you that I'm a probie with mailing lists and a Froggie, so please excuse that could look as obvious errors, in both computing and language. I'm currently trying to benchmark my SolR install with a custom script, but this benchmark must be run with all

Re: is there a way to know which mm value was used?

2011-10-05 Thread Shawn Heisey
On 10/5/2011 9:06 AM, elisabeth benoit wrote: thanks for answering. echoParams just echos mm value in solrconfig.xml (in my case mm = 4-1 6-2), not the actual value of mm for one particular request. I think would be very useful to be able to know which mm value was effectively used, in

Re: How to empty SolR Cache

2011-10-05 Thread Shawn Heisey
On 10/5/2011 9:18 AM, David GUYOT wrote: I'm currently trying to benchmark my SolR install with a custom script, but this benchmark must be run with all SolR caches empty; is there a way to erase SolR caches by a command or to restart SolR with an option to avoid cache autowarming? Remove any

more like this

2011-10-05 Thread Fred Zimmerman
Hi, for my application, I would like to be able to create web queries (wget/curl) that get more like this for either a single arbitrarily specified URL or for the first x terms in a search query. I want to return the results to myself as a csv file using wt=csv. How can I accomplish the MLT

Re: Scoring of DisMax in Solr

2011-10-05 Thread David Ryan
Thanks! What's the procedure to report this if it's a bug? EDisMax has similar behavior. On Tue, Oct 4, 2011 at 11:24 PM, Bill Bell billnb...@gmail.com wrote: This seems like a bug to me. On 10/4/11 6:52 PM, David Ryan help...@gmail.com wrote: Hi, When I examine the score calculation

Re: How do i get results for quering with separated words?

2011-10-05 Thread Mikhail Khludnev
Have you tried to correct spaces by spelling dictionary? if you build you dictionary from non tokenized terms, you'll have starwars - Star Wars and super rtl-superrtl corrections. WDYT? On Wed, Oct 5, 2011 at 7:13 PM, elisabeth benoit elisaelisael...@gmail.comwrote: I think you could define

Re: CopyField copying to self

2011-10-05 Thread Gora Mohanty
On Thu, Oct 6, 2011 at 1:49 AM, Jamie Johnson jej2...@gmail.com wrote: I have a field named test_txt which I am populating in some cases, and not in others.  I also have a copy field directive to copy data from _txt to text_txt.  Thigns seem to work except I believe the field is also copying