RE: Ebay Kleinanzeigen and Auto Suggest

2011-05-03 Thread Charton, Andre
Hi, yes we do. If you use a limit number of categories (like 100) you can use dynamic fields with the termscomponent and by choosing a category specific prefix, like: {schema.xml} ... dynamicField name=*_suggestion type=textAS indexed=true stored=false multiValued=true omitNorms=true/ ...

Re: fq parameter with partial value

2011-05-03 Thread elisabeth benoit
Ok, thanks a lot. After making a few tests, I finally understood what you meant. Best regards, Elisabeth 2011/5/2 Jonathan Rochkind rochk...@jhu.edu So if you have a field that IS tokenized, regardless of what it's called, then when you send My Great Restaurant to it for _indexing_, it gets

Re: Indexing multiple languages

2011-05-03 Thread Stefan Matheis
Peter, is there a specific need to split these entities? why not just fetch both columns in one entity? like this: entity name=artist_category query=select categoryid from artist_categories where objectid=${artist.id} entity name=category query=select title_en, title_nl from

stemming for English

2011-05-03 Thread Dmitry Kan
Dear list, In SOLR schema on the index side we use no stemming to support favor wildcard search. On the query side of the index we use Porter stemming. I have noticed the following issue: the term pretty gets stemmed to pretti and thus not found. What would be the approach to handle such

Re: stemming for English

2011-05-03 Thread lboutros
Hi, I think you have to use stemming on both side (index and query) if you really want to use stemming. Ludovic 2011/5/3 Dmitry Kan [via Lucene] ml-node+2893599-894006307-383...@n3.nabble.com Dear list, In SOLR schema on the index side we use no stemming to support favor wildcard search.

Re: stemming for English

2011-05-03 Thread Dmitry Kan
Hi Ludovic, That's an option we had before we decided to go for a full-blown support of wildcards. Do you know of a way to keep both stemming and consistent wildcard support in the same field?` Dmitry On Tue, May 3, 2011 at 12:56 PM, lboutros boutr...@gmail.com wrote: Hi, I think you have

Re: stemming for English

2011-05-03 Thread lboutros
Dmitry, I don't know any way to keep both stemming and consistent wildcard support in the same field. To me, you have to create 2 different fields. Ludovic. 2011/5/3 Dmitry Kan [via Lucene] ml-node+2893628-993677979-383...@n3.nabble.com Hi Ludovic, That's an option we had before we decided

Re: Nutch Web Interface - not anymore in 1.3

2011-05-03 Thread Gabriele Kahlout
Hello, I'm also in favor of maintaing a web interface that ships with nutch. As has been mentioned it say well be a bridge to Solr. If I find the time to contribute my solution (and make it general enough), I'll happily do it. Earlier I was wondering of actually using the previous nutch web

Re: stemming for English

2011-05-03 Thread Dmitry Kan
Yes, Ludovic. Thus effectively we get index doubled. Given the volume of data we store, we very carefully consider such cases, where the doubling of index is must. Dmitry On Tue, May 3, 2011 at 1:08 PM, lboutros boutr...@gmail.com wrote: Dmitry, I don't know any way to keep both stemming and

Re: Dismax scoring multiple fields TIE

2011-05-03 Thread Erick Erickson
I'm not sure you can. very short fields aren't differentiated on the basis of field length due to rounding errors. Here's a cut-n-paste from Jay Hill: So the values are not pre-set for the lengthNorm, but for some counts the fieldLength value winds up being the

Re: Dismax scoring multiple fields TIE

2011-05-03 Thread roySolr
No, but i think the difference between fieldlength is large and the score is still the same. Same score for this results(q=chelsea): 1. Chelsea is a very very big club in london, england Chelsea Sport 2. Chelsea

facet search and UnInverted multi-valued field?

2011-05-03 Thread Bernd Fehling
Dear list, we use solr 3.1.0. my logs have the following entry: May 3, 2011 2:01:39 PM org.apache.solr.request.UnInvertedField uninvert INFO: UnInverted multi-valued field

Re: Dismax scoring multiple fields TIE

2011-05-03 Thread elisabeth benoit
for category:chelsea, you have a fieldNorm=1.0, so your category field must have a type with omitNorms=true. if you don't have omitNorms=true, then shorter field will score higher. I'm new to Solr, but from what I've experienced, this is the cause. Regards, Elisabeth 2011/5/3 roySolr

full-import called simultaneously for multiple core.

2011-05-03 Thread Kannan
Hi I am running one instance with multiple core. If call full-import URI simultanesously for multiple core few of the fields are not getting indexed.If i do full-import one by one.Its works fine. Thanks in advance -- View this message in context:

Unable to use DataImportHandler

2011-05-03 Thread serenity keningston
Hello Friends, I am new to Solr and experiencing issue while trying to use DataImportHandler. I added the required fields to schema.xml file and here is my data-config.xml file : dataConfig dataSource driver=com.mysql.jdbc.Driver url=jdbc:mysql://localhost:3306/mp3 user=root

Re: Unable to use DataImportHandler

2011-05-03 Thread Stefan Matheis
Serenity, there is no field fileNo in your SELECT-Query? you've defined it in the pk-Attribut of the entity Tag, but it's also required in the Query itself. just to note it: you can skip your field definition, if the query returns the field with exactly the same name as the solr-schema requires

Re: Unable to use DataImportHandler

2011-05-03 Thread serenity keningston
Dear Stefan, Am still getting the following error message even after including the pk-Attribute to the query : WARNING: Error creating document : SolrInputDocument[{lname=lname(1.0)={cindy}, file=file(1.0)={ http://localhost:8084/Access/UploadFiles/laura.mp3}, fname=fname(1.0)={troutman}}]

Re: full-import called simultaneously for multiple core.

2011-05-03 Thread Erick Erickson
Do you log files show any errors? Erick On Tue, May 3, 2011 at 11:06 AM, Kannan ramkannan2...@gmail.com wrote: Hi   I am running one instance with multiple core. If call full-import URI simultanesously for multiple core few of the fields are not getting indexed.If i do full-import one by

Getting field information inside a Tokenizer

2011-05-03 Thread FatMan Corp
Hi, I would like to get another's field information for the same document within a Tekonizer class. How can this be achieved? Thanks

Re: Unable to use DataImportHandler

2011-05-03 Thread Erick Erickson
The next thing to check is if your select statement returns the fileNo for every field. Wait.. You took out the field column=fileNo name=id bit of your entity definition, is that a cut/paste error? You might get some joy from the DIH debug page at: solr homesolr/admin/dataimport.jsp it's not

Re: Unable to use DataImportHandler

2011-05-03 Thread serenity keningston
Dear Erick, I am using Solr 1.4 version. Yes, for each row , we will get one fileNo which is the primary key for the table file. No, I intentionally removed the field column=fileNo name=id from the data-config.xml I tried opening the dataimport.jsp to debug but I don't know, it doesn't show any

getLuceneVersion parsing xml node on every request

2011-05-03 Thread Stephane Bailliez
I' m using Solr 3.1 right now. I was looking at a threadump trying to figure out why queries were not exactly fast and noticed that it keeps parsing xml over and over from the schema to get the lucene version. SolrQueryParser are created for each request and in the constructor there is a call

UIMA analysisEngine path

2011-05-03 Thread Barry Hathaway
I'm new to Solr and trying to get it call a UIMA aggregate analysis engine and not having much luck. The null pointer exception indicates that it can't find the xml file associated with the engine. I have tried a number of combinations of a path in the analysisEngine element, but nothing seems

Re: facet search and UnInverted multi-valued field?

2011-05-03 Thread Jay Hill
UnInvertedField is similar to Lucene's FieldCache, except, while the FieldCache cannot work with multivalued fields, UnInvertedField is designed for that very purpose. So since your f_dcperson field is multivalued, by default you use UnInvertedField. You're not doing anything wrong, that's default

RE: stemming for English

2011-05-03 Thread Robert Petersen
From what I have seen, adding a second field with the same terms as the first does *not* double your index size at all. -Original Message- From: Dmitry Kan [mailto:dmitry@gmail.com] Sent: Tuesday, May 03, 2011 4:06 AM To: solr-user@lucene.apache.org Subject: Re: stemming for English

RE: Getting field information inside a Tokenizer

2011-05-03 Thread Steven A Rowe
Hi FMC, On 5/3/2011 at 12:37 PM, FatMan Corp wrote: Hi, I would like to get another's field information for the same document within a Tekonizer class. How can this be achieved? Use copyFields in your schema http://wiki.apache.org/solr/SchemaXml#Copy_Fields, and associate different analysis

Re: Unable to use DataImportHandler

2011-05-03 Thread Erick Erickson
OK, put it back G According to this page: http://wiki.apache.org/solr/DataImportHandler#Configuration_in_data-config.xml-1 the pk is used for delta imports and has no relation to uniqueKey defined in the schema.xml The error you're getting is because your schema.xml defines the id field

Re: How to debug if termsComponent is used

2011-05-03 Thread cyang2010
I tried it. It just does not work. the debug component only works when query component is there, and it is just showing debugging information for query result, not term match result. -- View this message in context:

Re: How to debug if termsComponent is used

2011-05-03 Thread Erick Erickson
Saying it does not work doesn't give us much to go on. Can you describe what you've tried? *How* it fails? Have you looked in the log for any clues? You might review this page: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Tue, May 3, 2011 at 3:35 PM, cyang2010

How to know which value matched for a multi-value field

2011-05-03 Thread cyang2010
Hi, I have a use case where i need to know for a paritcular multivalue field, which particular value match when a query is run on that field. For example, for a movie document, movie title name is single-value field, movie actors is multi-value field. When user search colin, i know it is colin

Re: How to debug if termsComponent is used

2011-05-03 Thread cyang2010
Sorry i didn't mean to give random reply. It is just today my solr configuration/schema is different now and can't get the error message. Anyway, i rerun the test. Basically by specifying such searchcomponent and requesthandler, you won't get any error. When you query this it is fine without

An error I can't manage to fix: java.lang.NoClassDefFoundError: org/apache/solr/search/QParserPlugin

2011-05-03 Thread Gavin Engel
Hello all, I've been trying to add the Spatial Search Plugin to my Solr 1.4.1 setup, and I get this error: java.lang.NoClassDefFoundError: org/apache/solr/search/QParserPlugin at java.lang.ClassLoader.defineClass1(Native Method) at java.lang.ClassLoader.defineClass(ClassLoader.java:634) at

Re: An error I can't manage to fix: java.lang.NoClassDefFoundError: org/apache/solr/search/QParserPlugin

2011-05-03 Thread Markus Jelsma
Where did you store the jar? Is it in a directory Solr looks for libs? Depending on your distro or set up there can be different places to store the jar. The easiest solution is to put it in a dir where other Solr libs are found or in a dir that you configured in a lib directive in solrconfig.

Re: An error I can't manage to fix: java.lang.NoClassDefFoundError: org/apache/solr/search/QParserPlugin

2011-05-03 Thread Gavin Engel
Oh, I apparently figured out how to get the jar file to load, so problem is solved I suppose. The fix seems very odd to me, but I got it from a comment on the SSP 2 blog page ( http://blog.jteam.nl/2009/08/03/geo-location-search-with-solr-and-lucene/comment-page-1/#comment-4774 ): The solution,

RE: Ebay Kleinanzeigen and Auto Suggest

2011-05-03 Thread Andy
--- On Tue, 5/3/11, Charton, Andre achar...@ebay-kleinanzeigen.de wrote: yes we do. If you use a limit number of categories (like 100) you can use dynamic fields with the termscomponent and by choosing a category specific prefix, like: {schema.xml} ... dynamicField name=*_suggestion

Re: Has NRT been abandoned?

2011-05-03 Thread Nagendra Nagarajayya
Thanks Andy! Solr-RA is the same as Solr, except that the underlying search library is now RankingAlgorithm library instead of Lucene. BoostQParserPlugin works at the Solr level, so this should still work as before. A query of the form q={!boost b=log(x)}abcde comes back with results but am

Re: getLuceneVersion parsing xml node on every request

2011-05-03 Thread Stephane Bailliez
I went ahead and patched locally the SolrQueryParser in current 3_x branch. Doing a quick test, baring any obvious mistake due to sleep deprivation I get close to a 10X performance boost from 200qps to 2000qps. I opened https://issues.apache.org/jira/browse/SOLR-2493 cheers, -- stephane On

RE: How to take differential backup of Solr Index

2011-05-03 Thread Gaurav Shingala
how we can configure query server in solr using replication feature? Thanks, Gaurav Date: Mon, 2 May 2011 22:05:33 -0700 Subject: Re: How to take differential backup of Solr Index From: goks...@gmail.com To: solr-user@lucene.apache.org The Replication feature does this. If you configure

Re: Replicaiton Fails with Unreachable error when master host is responding.

2011-05-03 Thread Jed Glazner
So it turns out that it's the host names. According the DNS RFC underscores are not valid in host names. Most DNS servers now support them, but it's not in the rfc strictly speaking. So there must be something in the underlying java classes that bork when using underscores in host names,

Using lowercase as field type

2011-05-03 Thread Isan Fulia
Hi , My schema consists of a field of type lowercase(for applying the lowercase filter factory) and is the unique key . But its no longer behaving as unique key. Multiple documents with same value for the unique key are getting indexed. Does anyone know why this is happening or is it that the

How to preserve filelist / commit-points after master restart

2011-05-03 Thread Maduranga Kannangara
Hi All, We use Solr 1.4.1. Single core setup with a repeater (for QA) and a few slaves (for Production). Master will index many sources and make data ready. Once all data is ready-for-production, optimization will take place. On master replicateAfter is set to optimize. (Subsequently on repeater