Re: Need tokenization that finds part of stringvalue

2012-03-06 Thread PeterKerk
@iorixxx: Sorry it took so long, had some difficulties upgrading to 3.5.0 It still doesnt work. Here's what I have now: I copied text_general_rev from http://svn.apache.org/repos/asf/lucene/dev/trunk/solr/example/solr/conf/schema.xml to my schema.xml: fieldType name=text_general_rev

Re: Need tokenization that finds part of stringvalue

2012-03-06 Thread PeterKerk
edismax did the trick! Thanks! -- View this message in context: http://lucene.472066.n3.nabble.com/Need-tokenization-that-finds-part-of-stringvalue-tp3785366p3805045.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Need tokenization that finds part of stringvalue

2012-03-04 Thread PeterKerk
@iorixxx I tried making my title_search of type text_rev and tried adding the ReversedWildcardFilterFactory to my existing text type, but in both cases no luck. @Erick Erickson On frequent method of doing leading and trailing wildcards is to use ngrams (as distinct from edgengrams). That in

Re: Need tokenization that finds part of stringvalue

2012-03-04 Thread Ahmet Arslan
@iorixxx I tried making my title_search of type text_rev and tried adding the ReversedWildcardFilterFactory to my existing text type, but in both cases no luck. I was able to perform *query* types of searches with solr 3.5 distro. Here is what I did: Download apache-solr-3.5.0 Edit

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread Erick Erickson
Right, there's nothing in Solr that I know of that'll help here. How would a tokenizer understand that smartphone should be smart phone? There's no general solution for this issue. You can do domain-specific solutions with synonyms for instance, or some other word list that contains terms you're

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread PeterKerk
I think I didnt explain myself clearly: I need to be able to find substrings. So, its not that I'd expect Solr to find synonyms, but rather if a piece of text contains the searched text, for example: if title holds smartphone I want it to be found when someone types martph or smar or smart. I

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread Walter Underwood
I once used a spell checker to break up compound words. It was slow, but worked pretty well. wunder On Mar 1, 2012, at 5:53 AM, Erick Erickson wrote: Right, there's nothing in Solr that I know of that'll help here. How would a tokenizer understand that smartphone should be smart phone?

RE: Need tokenization that finds part of stringvalue

2012-03-01 Thread Dyer, James
: Thursday, March 01, 2012 9:59 AM To: solr-user@lucene.apache.org Subject: Re: Need tokenization that finds part of stringvalue I once used a spell checker to break up compound words. It was slow, but worked pretty well. wunder On Mar 1, 2012, at 5:53 AM, Erick Erickson wrote: Right, there's

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread Ahmet Arslan
if title holds smartphone I want it to be found when someone types martph or smar or smart. Peter, so you want to beginsWith startsWith type of search? You can use use wildcard search (with start operator) for this. e.g. q=smar* Alternatively, if your index size is not huge, you can use

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread PeterKerk
@iorixxx: yes, that is what I need. But also when its IN the text, not necessarily at the beginning. So using the * character like: q=smart* the product is found, but when I do this: q=*mart* it isnt...why is that? -- View this message in context:

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread Ahmet Arslan
--- On Thu, 3/1/12, PeterKerk vettepa...@hotmail.com wrote: From: PeterKerk vettepa...@hotmail.com Subject: Re: Need tokenization that finds part of stringvalue To: solr-user@lucene.apache.org Date: Thursday, March 1, 2012, 6:59 PM @iorixxx: yes, that is what I need. But also when its

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread PeterKerk
@iorixxx: Where can I find that example schema.xml? I downloaded the latest version here: ftp://apache.mirror.easycolocate.nl//lucene/solr/3.5.0 And checked \example\example-DIH\solr\db\conf\schema.xml But no text_rev type is defined in there. And when I find it, can I just make the title field

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread Erick Erickson
On frequent method of doing leading and trailing wildcards is to use ngrams (as distinct from edgengrams). That in combination with phrase queries might work well in this case. You also might be surprised at how little space bigrams take, give it a test and see G.. Best Erick On Thu, Mar 1,

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread Ahmet Arslan
@iorixxx: Where can I find that example schema.xml? Please find text_general_rev at http://svn.apache.org/repos/asf/lucene/dev/trunk/solr/example/solr/conf/schema.xml And when I find it, can I just make the title field which currently is of text type then of text_rev type? Yes, also you