Re: Sorting in different languages

2008-05-29 Thread Uwe Klosa
On Wed, May 28, 2008 at 11:41 PM, Alexander Ramos Jardim [EMAIL PROTECTED] wrote: Well, One solution that I can see for this problem is having different indexes for each language. In which way would that solve the sorting problem?

Re: SolrTrunk start error

2008-05-29 Thread Eason . Lee
Thanks for your answer I nearly update my Solr everyday It was always OK But when I updated Solr 2 days ago , the errors came out btw, I have set the heap size to 1G,but the problem remains 在08-5-28,Shalin Shekhar Mangar [EMAIL PROTECTED] 写道: Ok, I see you're getting a OutOfMemoryError.

Filter query and facet counts

2008-05-29 Thread Umar Shah
Hi, I have a problem wherein I have field 'type' which can have value A, B C, I want to return facet count for each type but need only show one type of result ( say with max count) so if i have following counts type:A = 300 type:B = 400 type:D = 100 I should only show type:B results

Re: new user: some questions about parameters and query syntax

2008-05-29 Thread Bram de Jong
Hello Eric (and others) snip repeating the parameter: sort=field1,field2 desc,field3 but facet.field=field1facet.field=field2 This is pretty confusing to first-hand users! :-) Yeah, it is confusing. But we have to be careful with order. I don't believe you can rely on the order of same

Re: Filter query and facet counts

2008-05-29 Thread Shalin Shekhar Mangar
No, I don't think it is possible to do that with one query. You'll need to make two calls to Solr: 1. Without fq=type:B -- just to get all type facets 2. With fq to get the results. On Thu, May 29, 2008 at 2:12 PM, Umar Shah [EMAIL PROTECTED] wrote: Hi, I have a problem wherein I have field

schema for literal string, case insensitive

2008-05-29 Thread Bram de Jong
hello all, a while ago I was looking into making the schema for my (rather rich) data set, and I wanted to have a field for username. I would need a case insensitive string for that, but literal (no tokenizing, ...). how can I do this within the Solr schema definition? - bram --

Re: schema for literal string, case insensitive

2008-05-29 Thread Rob Casson
bram, you'll want to look at the KeywordTokenizerFactory (which doesn't actually tokenize), and then use the LowerCaseFilterFactory. the schema in the example has a fieldType called 'alphaOnlySort' that should get you started. cheers, rob On Thu, May 29, 2008 at 6:21 AM, Bram de Jong [EMAIL

Re: Announcement of Solr Javascript Client

2008-05-29 Thread Nik Krimm
Hi Matthias: Glad to hear of your efforts. A couple of initial comments... I'm cautious about your decision to build on top of jQuery. My understanding is that you're planning to build a set of client-side widgets that would be easily embeddable in an existing web-site. Building on a

Re: Announcement of Solr Javascript Client

2008-05-29 Thread Matthias Epheser
Original-Nachricht Datum: Thu, 29 May 2008 09:36:37 -0500 Von: Nik Krimm [EMAIL PROTECTED] An: solr-user@lucene.apache.org solr-user@lucene.apache.org Betreff: Re: Announcement of Solr Javascript Client Hi Matthias: Glad to hear of your efforts. A couple of initial

Want to drill down facet search result

2008-05-29 Thread Rusli Ruslakall
Hi, I index something like this: doc field name=nameCompany A/field field name=cat123/field field name=cat456/field field name=cat789/field /doc doc field name=nameCompany B/field field name=cat129/field field name=cat123/field

Re: Want to drill down facet search result

2008-05-29 Thread Rusli Ruslakall
Hi again, searched forever before posting and of course I found it shortly after :) Can use facet.prefix, beautiful! On Thu, May 29, 2008 at 3:43 PM, Rusli Ruslakall [EMAIL PROTECTED] wrote: Hi, I index something like this: doc field name=nameCompany A/field field

Re: Announcement of Solr Javascript Client

2008-05-29 Thread Greg Ludington
Building on a library like jQuery (which is a great lib) opens the door to some hairy namespacing conflicts with existing libraries (prototype and moo, for instance), or handcoded javascript that may exist on the current site. This is actually one of the areas where jQuery offers

Re: Sorting in different languages

2008-05-29 Thread Otis Gospodnetic
Hi, I don't have a very concrete suggestion for this, but maybe this will lead you in the right direction: http://java.sun.com/javase/6/docs/api/java/text/Collator.html http://java.sun.com/javase/6/docs/api/java/text/spi/CollatorProvider.html You may also wish to bring this up on the Lucene

Re: Announcement of Solr Javascript Client

2008-05-29 Thread Matthew Runo
Wow. This is really pretty cool. You're much further along than I thought you were! I'd love to see this in as an 'official' Solr client. Thanks! Matthew Runo Software Developer Zappos.com 702.943.7833 On May 29, 2008, at 8:15 AM, Matthias Epheser wrote: The server was rebooted yesterday

Re: Announcement of Solr Javascript Client

2008-05-29 Thread Otis Gospodnetic
I just had a look at the demo and reeeally like it! I didn't pay enough attention to this thread, though. Is the main concern that by having a Solr search webapp that is really all in UI and uses your JS library, the backend Solr server is directly exposed and thus somebody could peek in the

Re: Announcement of Solr Javascript Client

2008-05-29 Thread Cam Bazz
I have done something similar and I am using a search servlet that will forward the request to solr tru commons htclient. Maybe it could be a solution to DoS, although it is still possible. Best. -Cam Bazz On Thu, May 29, 2008 at 8:04 PM, Otis Gospodnetic [EMAIL PROTECTED] wrote: I just had

Re: Sorting in different languages

2008-05-29 Thread Uwe Klosa
Thanks for the hints. I have been aware of the Collator. Actually a colleague of mine has written a Collator based sorting Class for lucene. See: https://issues.apache.org/jira/browse/LUCENE-943. This was almost 2 years ago and I only wanted to know if there is already a solution in Solr 1.3 or

Re: Want to drill down facet search result

2008-05-29 Thread Yonik Seeley
On Thu, May 29, 2008 at 12:22 PM, Rusli Ruslakall [EMAIL PROTECTED] wrote: searched forever before posting and of course I found it shortly after :) Can use facet.prefix, beautiful! You can also constrain both results and facets to any arbitrary query via fq=myquery -Yonik On Thu, May 29,

How to describe 2 entities in dataConfig for the DataImporter?

2008-05-29 Thread Julio Castillo
I have 2 dB tables unrelated to each other that I want to index. I have tried 2 approaches for specifying them in my data-config.xml file. None of them seem to work (it seems I can only get data for the first one listed). CASE 1) dataConfig document name=test entity name=owners pk=id

Re: new user: some questions about parameters and query syntax

2008-05-29 Thread Bram de Jong
On Thu, May 29, 2008 at 6:40 PM, Otis Gospodnetic [EMAIL PROTECTED] wrote: I haven't been paying close attention to the uniformity of URL parameters, but if there is room for making them more uniform (e.g. always use singular, always use comma as a delimiting character, etc.) without hurting

Re: ClassCastException trying to use distributed search

2008-05-29 Thread Grégoire Neuville
Hi all, I must now apologize ; the fault was entirely mine : I was shaping the Solr interrogation URL the wrong way (the first parameter of my request wasn't 'shards', and this produced the bug). All is working fine now. Thanks for your quick answers, Grégoire. 2008/5/29 Noble Paul നോബിള്‍

Re: Announcement of Solr Javascript Client

2008-05-29 Thread Matthias Epheser
Greg Ludington schrieb: Building on a library like jQuery (which is a great lib) opens the door to some hairy namespacing conflicts with existing libraries (prototype and moo, for instance), or handcoded javascript that may exist on the current site. This is actually one of the areas where

Re: Announcement of Solr Javascript Client

2008-05-29 Thread Matthias Epheser
Otis Gospodnetic schrieb: I just had a look at the demo and reeeally like it! I didn't pay enough attention to this thread, though. Is the main concern that by having a Solr search webapp that is really all in UI and uses your JS library, the backend Solr server is directly exposed and thus

Re: How to describe 2 entities in dataConfig for the DataImporter?

2008-05-29 Thread Shalin Shekhar Mangar
Hi Julio, The first data-config is correct. You're running DataImportHandler in debug mode which creates only the first 10 documents by default. You can also add count=N to index only the first N documents. But this is intended only for debugging purposes. If you want to do a full-import just

Re: ClassCastException trying to use distributed search

2008-05-29 Thread Grégoire Neuville
(the first parameter of my request wasn't 'shards', and this produced the bug) Wrong. The problem was that I was pointing, in the 'shards' parameter, to a Solr 1.2 installation (which is furthermore sharing a single index with the new Solr 1.3) 2008/5/29 Grégoire Neuville [EMAIL PROTECTED]: Hi

Search query optimization

2008-05-29 Thread Yongjun Rong
Hi, I have a question about how the lucene query parser. For example, I have query A AND B AND C. Will lucene extract all documents satisfy condition A in memory and then filter it with condition B and C? or only the documents satisfying A AND B AND C will be put into memory? Is there any

Re: Search query optimization

2008-05-29 Thread Yonik Seeley
On Thu, May 29, 2008 at 4:05 PM, Yongjun Rong [EMAIL PROTECTED] wrote: I have a question about how the lucene query parser. For example, I have query A AND B AND C. Will lucene extract all documents satisfy condition A in memory and then filter it with condition B and C? No, Lucene will try

RE: Search query optimization

2008-05-29 Thread Yongjun Rong
Hi Yonik, Thanks for your quick reply. I'm very new to the lucene source code. Can you give me a little more detail explaination about this. Do you think it will save some memory if docnum = find_match(A) docnum = find_match(B) and put B in the front of the AND query like B AND A AND C? How

Re: Search query optimization

2008-05-29 Thread Walter Underwood
The people working on Lucene are pretty smart, and this sort of query optimization is a well-known trick, so I would not worry about it. A dozen years ago at Infoseek, we checked the count of matches for each term in an AND, and evaluated the smallest one first. If any of them had zero matches,

Re: Want to drill down facet search result

2008-05-29 Thread Jón Helgi Jónsson
Thanks for that, I looked into fq and it will definatly help when I drill into zip codes. However I'm still having some issues, facet.prefix only got me so far because sometimes the facet is the second word in the field. Also I have another question with this example: doc field

RE: How to describe 2 entities in dataConfig for the DataImporter?

2008-05-29 Thread Julio Castillo
Thanks Shalin, I tried putting everything under the same document (two different unrelated entities), and got a bit further. My problem now appears to be both of them stepping on each other due to id conflicts. Currently my id is defined in my schema as field name=id type=long indexed=true

Relevancy Issue - How do I make it work?

2008-05-29 Thread Tim Christensen
Hi, This is my first post. I have been working with Lucene for about 4 weeks and Solr for just about 10 days. We are going to convert our site search over to Solr as soon as we figure out some of the nuances. As I was testing out the synonyms features to decide how we could best use it,

Re: Relevancy Issue - How do I make it work?

2008-05-29 Thread Yonik Seeley
field norms of un-boosted fields are normally less than 1 (it's a factor that weights larger fields less). The index-time boost is also multiplied into this factor though. Given that your first doc had a huge norm, it looks like the document or field was boosted at index time? -Yonik On Thu, May

Re: Relevancy Issue - How do I make it work?

2008-05-29 Thread Tim Christensen
Yonik, Thank you for the response. You are correct, regular (non-accessory) products are boosted '2.0' at index time. However both items the non ipod item and the ipod would have received the initial boost on the same fields since they are both non-accessory items. Is your comment still

Re: Solr indexing configuration help

2008-05-29 Thread Gaku Mak
Hi Yonik and others, I'm getting this java error after switching to JVM 1.6.0_3. This error occurs after the stress test has been going for a while and failed at 12K docs level and at 18K again. Am I doing something wrong? Please help! Thanks! # # An unexpected error has been detected by

Re: Solr indexing configuration help

2008-05-29 Thread Yonik Seeley
It's most likely a 1) hardware issue: bad memory OR 2) incompatible libraries (most likely libc version for the JVM). If you have another box around, try that. -Yonik On Thu, May 29, 2008 at 9:51 PM, Gaku Mak [EMAIL PROTECTED] wrote: Hi Yonik and others, I'm getting this java error after

Re: Solr indexing configuration help

2008-05-29 Thread Gaku Mak
Looking further at the java error, those crashes are mostly related to GC. VM_Operation (0x41b429e0): parallel gc failed allocation, mode: safepoint, requested by thread 0x2aab1988c400 I'm following the http://java.sun.com/javase/6/webnotes/trouble/TSG-VM/html/gbyzo.html and see if

Re: ClassCastException trying to use distributed search

2008-05-29 Thread Noble Paul നോബിള്‍ नोब्ळ्
This comment for the benefit of who is using distributed search: The protocol of communication has been xml for distributed search. For a good part of 1.3. It is now changed to a custom binary format (SOLR-486 ). So each shard participating in a distributed search must be using the same protocol.

Re: How to describe 2 entities in dataConfig for the DataImporter?

2008-05-29 Thread Shalin Shekhar Mangar
That will happen only if id is the uniqueKey in Solr and the id coming from both your tables have same values. In that case, they will overwrite each other. You will need a separate uniqueKey (on other than id field). On Fri, May 30, 2008 at 6:34 AM, Julio Castillo [EMAIL PROTECTED] wrote:

Re: How to describe 2 entities in dataConfig for the DataImporter?

2008-05-29 Thread Noble Paul നോബിള്‍ नोब्ळ्
Consider constructing the id concatenating an extra string for each document . You can construct that field using the TeplateTransformer. in the entity owners keep the id as field column=id name=id template=owners-${owners.id}/ and in vets field column=id name=id template=vets-${vets.id}/ or