Re: Comma delemitered words shawn in terms like one word.

abhay kumar Fri, 27 May 2011 06:25:30 -0700

Thanks I was looking exactly for this.
I needed to spli tokens based on comma.


On Fri, Jun 18, 2010 at 10:12 PM, Joe Calderon <calderon....@gmail.com>wrote:

> set generateWordParts=1 on wordDelimiter or use
> PatternTokenizerFactory to split on commas
>
>
> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.PatternTokenizerFactory
>
>
> you can use the analysis page to see what your filter chains are going
> to do before you index
>
> /admin/analysis.jsp
>
> On Fri, Jun 18, 2010 at 6:41 AM, Vitaliy Avdeev <vavd...@sistyma.net>
> wrote:
> > Hello.
> > In indexing text I have such string John,Mark,Sam. Then I looks at it in
> > TermVectorComponent it looks like this johnmarksam.
> >
> > I am using this type for storing data
> >
> >    <fieldType name="textTight2" class="solr.TextField"
> > positionIncrementGap="100" >
> >      <analyzer>
> >    <tokenizer class="solr.HTMLStripWhitespaceTokenizerFactory"/>
> >        <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"
> > ignoreCase="true" expand="false"/>
> >        <filter class="solr.StopFilterFactory" ignoreCase="true"
> > words="stopwords.txt"/>
> >        <filter class="solr.WordDelimiterFilterFactory"
> > generateWordParts="0" generateNumberParts="0" catenateWords="1"
> > catenateNumbers="1" catenateAll="0"/>
> >        <filter class="solr.LowerCaseFilterFactory"/>
> >        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
> >      </analyzer>
> >    </fieldType>
> >
> > What filter I need to use to get John Mark Sam as different words?
> >
>



-- 
Thanks and Regards
Abhay Kumar Singh

Re: Comma delemitered words shawn in terms like one word.

Reply via email to