Re: Getting a wierd Class Not Found Exception: SolrParams

2011-07-25 Thread Sowmya V.B.
Hi Eric Yes, it was a classpath issue. Sowmya. On Mon, Jul 25, 2011 at 4:01 PM, Erick Erickson erickerick...@gmail.comwrote: Well, MultiMapSolrParams is a subclass of SolrParams, so you actually do use it in your code G But this looks like a classpath problem. You say your code

Re: Wiki Error JSON syntax

2011-07-25 Thread Gabriel Farrell
On Mon, Jul 25, 2011 at 12:24 PM, Stefan Matheis matheis.ste...@googlemail.com wrote: Hi Remy, so you may open an Issue for this on the github Project? i mean .. just creating another client, because i have one problem, does not sound like a good plan? Agreed, and thanks for calling my

Re: Wiki Error JSON syntax

2011-07-25 Thread Remy Loubradou
Hi, 2011/7/25 Gabriel Farrell gsf...@gmail.com On Mon, Jul 25, 2011 at 12:24 PM, Stefan Matheis matheis.ste...@googlemail.com wrote: Hi Remy, so you may open an Issue for this on the github Project? i mean .. just creating another client, because i have one problem, does not sound like

CoreAdminHandler: can I specify custom properties when creating cores?

2011-07-25 Thread Yury Kats
When crating cores through solr.xml, I am able to specify custom properties, to be referenced in solrconfig.xml. For example: cores adminPath=/admin/cores defaultCoreName=master core name=master instanceDir=core1 shard=shard1 collection=myconf property name=enable.master value=true /

Re: Ignore records that are missing a value in a field

2011-07-25 Thread Erick Erickson
Hmmm, I think that should be fq=field:[* TO *] since the problem is to include docs with anything in that field. You could also index a special EMPTY_FIELD string and do something like fq=-field:[EMPTY_FIELD].. Best Erick On Mon, Jul 25, 2011 at 12:10 PM, Stefan Matheis

Re: multivalue or denormalise

2011-07-25 Thread Erick Erickson
I'm a little confused. Are you searching against these different titles or is the search something else and you're really only interested in displaying different titles for documents returned for the query? If it's just a display issue, you can use multivalued fields, the order in which you put

Re: Updating fields in an existing document

2011-07-25 Thread Chris Hostetter
: A followup. The wiki has a whole discussion of the 'update' XML : message. But solrj has nothing like it. Does that really exist? Is : there a reason to use it? If I just 'add' the document a second time, : it will replace? You should only see update in Solr docs used in the context of

please help explaining debug output

2011-07-25 Thread Robert Petersen
I have three documents with the following product titles in a text field called moreWords with analysis stack matching the solr example text field definition. 1. HP LaserJet P1102W Monochrome Laser Printer http://www.buy.com/prod/hp-laserjet-p1102w-monochrome-laser-printer/q/l

RE: please help explaining debug output

2011-07-25 Thread Robert Petersen
Sorry, to clarify a search for P1102W matches all three docs but a search for p1102w LaserJet only matches the second two. Someone asked me a question while I was typing and I got distracted, apologies for any confusion. -Original Message- From: Robert Petersen [mailto:rober...@buy.com]

Re: dih fetching but not adding records to index

2011-07-25 Thread abhayd
thanks!! it worked. I was just wondering if xpath can be used to use process default xml format for solr index doc -- View this message in context: http://lucene.472066.n3.nabble.com/dih-fetching-but-not-adding-records-to-index-tp3189438p3198705.html Sent from the Solr - User mailing list

Re: please help explaining debug output

2011-07-25 Thread Erick Erickson
Hmmm, I'm assuming that moreWords is your default text field, yes? But it works for me (tm), using 1.4.1. What version of Solr are you on? Also, take a glance at the admin/analysis page, that might help... Gotta run Erick On Mon, Jul 25, 2011 at 4:52 PM, Robert Petersen rober...@buy.com

Re: multivalue or denormalise

2011-07-25 Thread abhayd
hi erick, I will be searching only on search_term. I did exactly as u said in application layer, I was not sure how multi-valued fields works in co-relation -- View this message in context: http://lucene.472066.n3.nabble.com/multivalue-or-denormalise-tp3197942p3198710.html Sent from the

RE: please help explaining debug output

2011-07-25 Thread Robert Petersen
I'm still on solr 1.4.0 and the analysis page looks like they should match, and other products with the same content do in fact match. I'm reindexing the non-matching ones to rule that out. -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Monday, July 25,

Re: Updating fields in an existing document

2011-07-25 Thread Benson Margulies
As in http://wiki.apache.org/solr/UpdateXmlMessages? On Mon, Jul 25, 2011 at 4:10 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : A followup. The wiki has a whole discussion of the 'update' XML : message. But solrj has nothing like it. Does that really exist? Is : there a reason to use

Re: please help explaining debug output

2011-07-25 Thread Erick Erickson
Hmmm, I can't find a convenient 1.4.0 to download, but re-indexing is a good idea since this seems like it *should* work. Erick On Mon, Jul 25, 2011 at 5:32 PM, Robert Petersen rober...@buy.com wrote: I'm still on solr 1.4.0 and the analysis page looks like they should match, and other

Re: Updating fields in an existing document

2011-07-25 Thread Grant Ingersoll
This is a pretty low level issue with inverted indexes (i.e. the underlying data structure used) and not so much the architecture. It is possible, I suppose, to solve it at the architectural level, but in many cases this causes performance problems that are not usually acceptable. On Jul 20,

Re: commit time and lock

2011-07-25 Thread Jonathan Rochkind
Thanks, this is helpful. I do indeed periodically update or delete just about every doc in the index, so it makes sense that optimization might be neccesary even in post 1.4, but I'm still on 1.4 -- add this to another thing to look into rather than assume after I upgrade. Indeed I was

Re: Updating fields in an existing document

2011-07-25 Thread Chris Hostetter
: As in http://wiki.apache.org/solr/UpdateXmlMessages? Exactly ... the title is XML Messages for Updating a Solr Index But i do see some confusing usages of add/update in the context of documents that definitely don't belong there -- so i've changed them to add/replace. Thanks for bringing

How to query solr status

2011-07-25 Thread ZiLi
Anybody who knows how to query an solr server whether it is optimized or not ? As replication can config slave to pull the indexes after optimized ,so I think there must be someway to query that .But I didn't find any document to identify that , anyone knows ? Thanks so much O(n_n)O

Re: Logically equivalent queries but vastly different no of results?

2011-07-25 Thread cnyee
Yes - I am using edismax but the reason is not obvious to me can you give me a pointer? Thanks Yee -- View this message in context: http://lucene.472066.n3.nabble.com/Logically-equivalent-queries-but-vastly-different-no-of-results-tp3190278p3199362.html Sent from the Solr - User mailing

Re: Document IDs instead of count for facets?

2011-07-25 Thread Jeff Schmidt
Hi Yonik: On Jul 17, 2011, at 9:30 AM, Yonik Seeley wrote: On Sun, Jul 17, 2011 at 10:38 AM, Jeff Schmidt j...@535consulting.com wrote: I don't want to query for a particular facet value, but rather have Solr do a grouping of facet values. I'm not sure about the appropriate nomenclature

How to make a valid date facet query?

2011-07-25 Thread Floyd Wu
Hi all, I need to make date faceted query and I tried to use facet.range but can't get result I need. I want to make 4 facet like following. 1 Months,3 Months, 6Months, more than 1 Year The onlinedate field in schema.xml like this field name=onlinedate type=tdate indexed=true stored=true/ I

Re: dih fetching but not adding records to index

2011-07-25 Thread Gora Mohanty
On Fri, Jul 22, 2011 at 12:42 AM, abhayd ajdabhol...@hotmail.com wrote: hi I m trying to load data into solr index from a xml file using dih my promotions.xml file -- add        doc                field name=id3/field        /doc        doc                field

Re: Is anobdy using lotsofcores feature in production?

2011-07-25 Thread Shalin Shekhar Mangar
LotsOfCores does not exist as a feature. It is just a write-up, some jira issues and a couple of patches. Did I miss something? On Sun, Jul 24, 2011 at 8:26 PM, Markus Jelsma markus.jel...@openindex.iowrote: It works fine but you would keep an eye on additional overhead, cores `stealing` too

Re: Is anobdy using lotsofcores feature in production?

2011-07-25 Thread Markus Jelsma
No i missed something and interpreted the question as using a lot of cores. LotsOfCores does not exist as a feature. It is just a write-up, some jira issues and a couple of patches. Did I miss something? On Sun, Jul 24, 2011 at 8:26 PM, Markus Jelsma markus.jel...@openindex.iowrote: It

Re: Wiki Error JSON syntax

2011-07-25 Thread Remy Loubradou
Hey Stephan, Thanks, but I already used this solr client and I got an error when I add too much documents FATAL ERROR: JS Allocation failed - process out of memory. I didn't find the source of the problem in the solr client. So I decided to write my own without this error hopefully and also I'm

SolrJ and class versions

2011-07-25 Thread Tarjei Huse
Hi, I recently went through a little hell when I upgraded my Solr servers to 3.2.0. What I didn't anticipate was that my Java SolrJ clients depend on the server version. I would like to add a note about this in the SolrJ docs: http://wiki.apache.org/solr/Solrj#Streaming_documents_for_an_update

in fragsize whats the pre hit number and post hit number

2011-07-25 Thread jame vaalet
hi, while searching for word SOLR in highlighting in solr can be manipulated with frag-size =10 . how is the fragment decided ? how many characters are taken before the world SOLR and after the word SOLR ? jame

Re: strip html from data

2011-07-25 Thread Markus Jelsma
You've three analyzer elements, i wonder what that would do. You need to add the char filter to the index-time analyzer. On Monday 25 July 2011 13:09:14 Merlin Morgenstern wrote: Hi there, I am trying to strip html tags from the data before adding the documents to the index. To do that I

Re: strip html from data

2011-07-25 Thread Merlin Morgenstern
sounds logical. I just changed it to the following, restarted and reindexed with commit: fieldType name=text class=solr.TextField positionIncrementGap=100 autoGeneratePhraseQueries=true analyzer type=index tokenizer

Re: strip html from data

2011-07-25 Thread Markus Jelsma
Are you looking at the returned result set or what you've actually indexed? Analyzers are not run on the stored data, only on indexed data. On Monday 25 July 2011 15:03:18 Merlin Morgenstern wrote: sounds logical. I just changed it to the following, restarted and reindexed with commit:

Re: how to get solr core information using solrj

2011-07-25 Thread Erick Erickson
http://lucene.apache.org/solr/api/org/apache/solr/client/solrj/request/CoreAdminRequest.html That should get you started. Best Erick On Tue, Jul 19, 2011 at 11:40 PM, Jiang mingyuan mailtojiangmingy...@gmail.com wrote: hi all, Our solr server contains two cores:core0,core1,and they both

Re: - character in search query

2011-07-25 Thread Erick Erickson
dismax is a fairly narrow-use parser. By that I mean it was created to solve a specific issue. It has some pronounced warts as you've discovered. edismax is the preferred parser if you have access to it. I'd just ignore dismax if you have access to edismax. There's been some talk of deprecating

Re: Schema Design/Data Import

2011-07-25 Thread Erick Erickson
I'd seriously consider going with SolrJ as your indexing strategy, it allows you to do anything you need to do in Java code. You can call the Tika library yourself on the files pointed to by your rows as you see fit, indexing them as you choose, perhaps one Solr doc per attachment, perhaps one per

highlighting fragsize

2011-07-25 Thread jame vaalet
hi when u highlight and get back snippet fragments , can you over write the default hl.regex.pattern through url . can some quote an example url of that sort ? what if i make pass hl.slop=0 will this stop considering regex pattern at all ? -- -JAME

Re: Getting a wierd Class Not Found Exception: SolrParams

2011-07-25 Thread Erick Erickson
Well, MultiMapSolrParams is a subclass of SolrParams, so you actually do use it in your code G But this looks like a classpath problem. You say your code compiles, but do you make all the jars you path to during compilation available to your servlet? And/or do you have any old jar files in

Re: filter query parameter not working as expected

2011-07-25 Thread Erick Erickson
Well, WAY_ANALYZED:de l hotel de ville parses as WAY_ANALYZED:de default:l default:hotel default:de default:ville You probably want something like WAY_ANALYZED:(de l hotel de ville), perhaps with AND between them. Try adding debugQuery=on to your queries and you can sometimes see this kind of

Re: Frange Function Query

2011-07-25 Thread Erick Erickson
I'm no expert on frange, but fq clauses are intersections. So if your two frange queries have no terms in common, you won't get anything. You can think of it as an implied AND between all the fq clauses you specify... Best Erick On Thu, Jul 21, 2011 at 5:29 AM, Rohit Gupta ro...@in-rev.com

Re: Rounding errors in solr

2011-07-25 Thread Brian Lamb
Yes and that's causing some problems in my application. Is there a way to truncate the 7th decimal place in regards to sorting by the score? On Fri, Jul 22, 2011 at 4:27 PM, Yonik Seeley yo...@lucidimagination.comwrote: On Fri, Jul 22, 2011 at 4:11 PM, Brian Lamb brian.l...@journalexperts.com

Re: Schema Design/Data Import

2011-07-25 Thread Travis Low
Thanks so much Erick (and Stefan). Yes, I did some reading on SolrJ and Tika and you are spot-on. We will write our own importer using SolrJ and then we can grab the DB records and parse any attachments along the way. Now it comes down to a schema design question. The issue I'm struggling with

using distributed search with the suggest component

2011-07-25 Thread Tobias Rübner
Hi, I try to use the suggest component (solr 3.3) with multiple cores. I added a search component and a request handler as described in the docs ( http://wiki.apache.org/solr/Suggester) to my solrconfig. That works fine for 1 core but querying my solr instance with the shards parameter does not

Re: filter query parameter not working as expected

2011-07-25 Thread elisabeth benoit
thanks using parenthesis select?q=VINCI Parkfq=WAY_ANALYZED:(rue de l hotel de ville) AND (TOWN_ANALYZED:paris OR DEPARTEMENT_ANALYZED:paris)rows=200fl=NAME,TOWN,WAY,scoredebugQuery=on works but I would rather not use parenthesis or AND between those words this brings another question:

RE: Spellcheck compounded words

2011-07-25 Thread Dyer, James
I'm afraid there currently isn't much support for correcting misplaced whitespace. Solr is going to look at each word individually and won't even try to combine ajacent words (or split a word into 2 or more). So there is no good way to get these kinds of suggestions. One thing that might

Re: commit time and lock

2011-07-25 Thread Erick Erickson
Yeah, the 1.4 code base is older. That is, optimization will have more effect on that vintage code than on 3.x and trunk code. I should have been a bit more explicit in that other thread. In the case where you add a bunch of documents, optimization doesn't buy you all that much currently. If you

Re: problem with ? wild card searches in solr

2011-07-25 Thread Tomás Fernández Löbbe
Are you using stemming on that field? Sometimes stemming and wildcards don't get along very well. If you are, take a look at how the terms that should match ban?le are analyzed on the Analysis section of the admin. On Sat, Jul 23, 2011 at 6:33 AM, Romi romijain3...@gmail.com wrote: I am using

Re: Schema Design/Data Import

2011-07-25 Thread Stefan Matheis
Travis, that sounds like a perfect usecase for dynamic fields .. attachment_* and there you go. works for no attachment, as well as one, three or 50. for the user interface, you could iterate over them and show them as list - or something else that would fit your need. also, maybe, you

Re: Schema Design/Data Import

2011-07-25 Thread Erick Erickson
Well, the attachment_1, attachment_2 idea would be awkward to form queries (i.e. there would be 100 clauses if there were 100 docs?) Dynamic fields have this same problem. You could certainly index them all into a big field, just make it multivalued and do a SolrDocument.add(bigtextfield,

Re: filter query parameter not working as expected

2011-07-25 Thread Erick Erickson
Not that I know of, although it does give you the parsed fq results, which you could then use as query parameters (i.e. the 'q' parameter) for debugging... You have to use parens or fully qualify each term (e.g. WAY_ANALYZED:rue WAY_ANALYZED:de), that's just how the query parsing works...

Re: Schema Design/Data Import

2011-07-25 Thread Stefan Matheis
Am 25.07.2011 16:58, schrieb Erick Erickson: Well, the attachment_1, attachment_2 idea would be awkward to form queries (i.e. there would be 100 clauses if there were 100 docs?) Dynamic fields have this same problem. Oh, yes .. correct .. overlooked that part :/ sorry.

Re: Spellcheck compounded words

2011-07-25 Thread Markus Jelsma
This will work for mispelled compounds indeed but not when the compound word is actually queried as two separate correctly spelled words. Most likely both sail and boat exist in the index as single token. There is a work around but that's limited to a scenario where users never use more than 1

RE: Spellcheck compounded words

2011-07-25 Thread Dyer, James
Related to this is this jira issue: https://issues.apache.org/jira/browse/SOLR-2585 . With this patch, Solr will consider alternatives in cases where a word is mispelled in its context, but nevertheless exists in the index and/or dictionary. This is a work-in-progress and is for trunk only,

Strange suggestions with spell checker

2011-07-25 Thread Jens Hoffrichter
Hello all, I'm getting a strange suggestion for a purposely mistyped word in Solr 1.4.1 I search for the term snia, and I would expect the term sina to be suggested, as this is a fairly common word in quite a bit of the indexed documents. Instead, I'm getting india as a suggestion, which is

Ignore records that are missing a value in a field

2011-07-25 Thread Brian Lamb
Hi all, I have an optional field called common_names. I would like to keep this field optional but at the same, occasionally do a search where I do not include results where there is no value set for this field. Is this possible to do within solr? In other words, I would like to do a search

Re: strip html from data

2011-07-25 Thread Mike Sokolov
I think you need to list the charfilter earlier in the analysis chain; before the tokenizer. Porbably Solr should tell you this... -Mike On 07/25/2011 09:03 AM, Merlin Morgenstern wrote: sounds logical. I just changed it to the following, restarted and reindexed with commit:

Re: strip html from data

2011-07-25 Thread Markus Jelsma
charFilters are executed first regardless of their position in the analyzer. On Monday 25 July 2011 17:53:59 Mike Sokolov wrote: I think you need to list the charfilter earlier in the analysis chain; before the tokenizer. Porbably Solr should tell you this... -Mike On 07/25/2011 09:03

Re: dih fetching but not adding records to index

2011-07-25 Thread abhayd
hi thanks for the response I am aware of post.sh but i wanted to make use of dih and scheduling. We can not use cron due to some other issues. So was thinking of using scheduling Data import -- View this message in context:

Re: strip html from data

2011-07-25 Thread Mike Sokolov
Hmm - I'm not sure about that; see https://issues.apache.org/jira/browse/SOLR-2119 On 07/25/2011 12:01 PM, Markus Jelsma wrote: charFilters are executed first regardless of their position in the analyzer. On Monday 25 July 2011 17:53:59 Mike Sokolov wrote: I think you need to list the

Re: Ignore records that are missing a value in a field

2011-07-25 Thread Stefan Matheis
So, with other words you want to exlude those records? :) FilterQuery with a Range-Query could help: fq=-field:[* TO *] Regards Stefan Am 25.07.2011 17:53, schrieb Brian Lamb: Hi all, I have an optional field called common_names. I would like to keep this field optional but at the same,

Re: dih fetching but not adding records to index

2011-07-25 Thread Gora Mohanty
On Mon, Jul 25, 2011 at 9:36 PM, abhayd ajdabhol...@hotmail.com wrote: hi thanks for the response I am aware of post.sh but i wanted to make use of dih and scheduling. We can not use cron due to some other issues. So was thinking of using scheduling Data import [...] OK, though in that

Re: strip html from data

2011-07-25 Thread Markus Jelsma
I've seen that issue too and read comments on the list yet i've never had trouble with the order, don't know what's going on. Check this analyzer, i've moved the charFilter to the bottom: analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ filter

Re: Wiki Error JSON syntax

2011-07-25 Thread Stefan Matheis
Hi Remy, so you may open an Issue for this on the github Project? i mean .. just creating another client, because i have one problem, does not sound like a good plan? Regards Stefan Am 25.07.2011 10:56, schrieb Remy Loubradou: Hey Stephan, Thanks, but I already used this solr client and I

multivalue or denormalise

2011-07-25 Thread abhayd
hi What i want to do is get title_1 if NOW is between st_date_1 and end_date_1 Also at the same time get title_2 if NOW is between st_date_2 and end_date_2 and so on at present i have a schema like this denorm'. I cant figure it out a single solr query to do this. field name=id

Re: strip html from data

2011-07-25 Thread Mike Sokolov
Hmm that looks like it's working fine. I stand corrected. On 07/25/2011 12:24 PM, Markus Jelsma wrote: I've seen that issue too and read comments on the list yet i've never had trouble with the order, don't know what's going on. Check this analyzer, i've moved the charFilter to the bottom: