Re: full text search in multiple fields

2010-11-13 Thread PeterKerk
All helpful responses, so thank you for that. I checked the url: http://wiki.apache.org/solr/DisMaxQParserPlugin When I execute this url on my local machine: http://localhost:8983/solr/select/?q=videoqt=defType=dismaxqf=features^20.0+text^0.3 I get the error: unknown handler: defType=dismax

Re: Searching problem

2010-11-13 Thread Govind Kanshi
You must spend time on - http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters On Sat, Nov 13, 2010 at 10:42 AM, M.Rizwan griz...@gmail.com wrote: Hi All, Do you have any idea that why solr search for panasonic* ( without quotes ) does not match panasonic ? If we search panasonic

Re: A Newbie Question

2010-11-13 Thread Govind Kanshi
Another pov you might want to think about - what kind of search you want. Just plain - full text search or there is something more to those text files. Are they grouped in folders? Do the folders imply certain kind of grouping/hierarchy/tagging? I recently was trying to help somebody who had

Re: How to Facet on a price range

2010-11-13 Thread Govind Kanshi
Kudos to Jan's pre-compute option and gwk's range facet answer. On Wed, Nov 10, 2010 at 2:52 PM, Geert-Jan Brits gbr...@gmail.com wrote: Ah I see: like you said it's part of the facet range implementation. Frontend is already working, just need the 'update-on-slide' behavior. Thanks

Using jetty's GzipFilter in the example solr.war

2010-11-13 Thread Jay Luker
Hi, I thought I'd try turning on gzip compression but I can't seem to get jetty's GzipFilter to actually compress my responses. I unpacked the example solr.war and tried adding variations of the following to the web.xml (and then rejar-ed), but as far as I can tell, jetty isn't actually

Re: filtering or getting accurate crawling results

2010-11-13 Thread Erick Erickson
I'm pretty sparse on my Nutch knowledge, you'd probably get more knowledgable answers on the Nutch mailing list. Best Erick On Fri, Nov 12, 2010 at 11:52 PM, Dennis Gearon gear...@sbcglobal.netwrote: Actually, can Nutch be used for SCRAPING, not crawling? I don't just want the url, I want

RE: Searching problem

2010-11-13 Thread Steven A Rowe
Hi Riz, You likely have some form of stemming in your indexing analysis chain - this may cause panasonic, e.g., to be indexed as panason. (The remainder of this email assumes that this is true.) When you search for panasonic, presumably with the same stemming filter in your query analysis

RE: Looking for help with Solr implementation

2010-11-13 Thread Thumuluri, Sai
Please refrain using this mailing group for soliciting and take it offline -Original Message- From: AC [mailto:acanuc...@yahoo.com] Sent: Sat 11/13/2010 1:12 AM To: solr-user@lucene.apache.org Subject: Re: Looking for help with Solr implementation Hey Jean-Sebastien, Thanks for the

Re: Looking for help with Solr implementation

2010-11-13 Thread Jean-Sebastien Vachon
Yes we did. Sorry for this. We both made the same error replying to the mailing list. - Original Message - From: Thumuluri, Sai sai.thumul...@verizonwireless.com To: solr-user@lucene.apache.org Sent: Saturday, November 13, 2010 8:41 AM Subject: RE: Looking for help with Solr

Re: full text search in multiple fields

2010-11-13 Thread Ahmet Arslan
I checked the url: http://wiki.apache.org/solr/DisMaxQParserPlugin When I execute this url on my local machine: http://localhost:8983/solr/select/?q=videoqt=defType=dismaxqf=features^20.0+text^0.3 I get the error: unknown handler: defType=dismax So where can I download that handler and

Re: facetting when using field collapsing

2010-11-13 Thread Yonik Seeley
On Wed, Nov 10, 2010 at 9:12 AM, Lukas Kahwe Smith m...@pooteeweet.org wrote: The above wiki page seems to be out of date. Reading the comments in https://issues.apache.org/jira/browse/SOLR-236 it seems like group should be replaced with collapse. The Wiki page is not expansive, but I've

Re: facetting when using field collapsing

2010-11-13 Thread Lukas Kahwe Smith
On 13.11.2010, at 10:30, Yonik Seeley wrote: On Wed, Nov 10, 2010 at 9:12 AM, Lukas Kahwe Smith m...@pooteeweet.org wrote: The above wiki page seems to be out of date. Reading the comments in https://issues.apache.org/jira/browse/SOLR-236 it seems like group should be replaced with

Re: facetting when using field collapsing

2010-11-13 Thread Yonik Seeley
On Sat, Nov 13, 2010 at 10:46 AM, Lukas Kahwe Smith m...@pooteeweet.org wrote: On 13.11.2010, at 10:30, Yonik Seeley wrote: On Wed, Nov 10, 2010 at 9:12 AM, Lukas Kahwe Smith m...@pooteeweet.org wrote: The above wiki page seems to be out of date. Reading the comments in

RE: IndexableBinaryStringTools (was FieldCache)

2010-11-13 Thread Steven A Rowe
Hi Mathias, I assume that the char[] returned form IndexableBinaryStringTools.encode is encoded in UTF-8 again and then stored. At some point the information is lost and cannot be recovered. Can you give an example? This should not happen. My character array returned by

Re: IndexableBinaryStringTools (was FieldCache)

2010-11-13 Thread Yonik Seeley
On Sat, Nov 13, 2010 at 1:50 PM, Steven A Rowe sar...@syr.edu wrote: Looks to me like the returned value is in a Solr-internal form of XML character escaping: \u is represented as #0; and \u0008 is represented as #8;.  (The escaping code is in

RE: IndexableBinaryStringTools (was FieldCache)

2010-11-13 Thread Steven A Rowe
On 11/13/2010 at 2:04 PM, Yonik Seeley wrote: n Sat, Nov 13, 2010 at 1:50 PM, Steven A Rowe sar...@syr.edu wrote: Looks to me like the returned value is in a Solr-internal form of XML character escaping: \u is represented as #0; and \u0008 is represented as #8;.  (The escaping code is in

DIH for multilingual index multiValued field?

2010-11-13 Thread Andy
I have a MySQL table: CREATE TABLE documents ( id INT NOT NULL AUTO_INCREMENT, language_code CHAR(2), tags CHAR(30), text TEXT, PRIMARY KEY (id) ); I have 2 questions about Solr DIH: 1) The langauge_code field indicates what language the text

Re: DIH for multilingual index multiValued field?

2010-11-13 Thread Imran
I think a custom transformer would be of help in these scenarios http://wiki.apache.org/solr/DIHCustomTransformer http://wiki.apache.org/solr/DIHCustomTransformerCheers -- Imran On Sat, Nov 13, 2010 at 8:55 PM, Andy angelf...@yahoo.com wrote: I have a MySQL table: CREATE TABLE documents (

Re: DIH for multilingual index multiValued field?

2010-11-13 Thread Ahmet Arslan
For (1) you probably need to write a custom transformer. Something like: public Object transformRow(MapString, Object row) { String language_code = row.get(language_code); String text = row.get(text); if(en.equals(language_code)) row.put(text_en, text); else if

Re: DIH for multilingual index multiValued field?

2010-11-13 Thread Ken Stanley
On Sat, Nov 13, 2010 at 4:56 PM, Ahmet Arslan iori...@yahoo.com wrote: For (1) you probably need to write a custom transformer. Something like: public Object transformRow(MapString, Object row)     { String language_code = row.get(language_code); String text = row.get(text);

Re: DIH for multilingual index multiValued field?

2010-11-13 Thread Ken Stanley
On Sat, Nov 13, 2010 at 5:59 PM, Ken Stanley doh...@gmail.com wrote:   CREATE TABLE documents (       id INT NOT NULL AUTO_INCREMENT,       language_code CHAR(2),       tags CHAR(30),       text TEXT,       PRIMARY KEY (id)   ); I apologize, but I couldn't leave the typo in my last post

Re: Using jetty's GzipFilter in the example solr.war

2010-11-13 Thread Kiwi de coder
try to put u filter on top of web.xml (instead of middle or bottom), i try this few day and it just only a simple solution (not sure is a spec to put on top or is a bug) hope this help. kiwi On Sat, Nov 13, 2010 at 8:11 PM, Jay Luker lb...@reallywow.com wrote: Hi, I thought I'd try turning