Re: Looking for a MappingCharFilter that accepts regular expressions

Weiwei Wang Sun, 13 Dec 2009 19:15:01 -0800

I need the source file not the patch file, where can i download it?

On Mon, Dec 14, 2009 at 1:15 AM, Koji Sekiguchi <[email protected]> wrote:


> Koji Sekiguchi wrote:
>
>> Paul Taylor wrote:
>>
>>> I want my search to treat 'No. 1' and 'No.1' the same, because in our
>>> context its one token I want 'No. 1' to become 'No.1',  I need to do this
>>> before tokenizing because the tokenizer would split one value into two terms
>>> and one into just one term. I already use a NormalizeMapFilter to map &' to
>>> 'and' but I think it only takes literal text and I need to
>>>
>>> 1. be case insensitive (but lowercasefilter is only applied after
>>> tokenizing)
>>>
>>> 2. cope with all numbers e.g no. 109
>>>
>>> So I was going to subclass BaseCharFilter and do my matches with a
>>> regular expression like ([Nn]+[Oo]+\\.) ([0-9]+) but I'm struggling to
>>> understand the offset methods you have to do once you get a match. Has
>>> anyone already got a regular expression Charfilter OR am I approaching this
>>> all wrong
>>>
>>> thanks Paul
>>>
>>>
>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: [email protected]
>>> For additional commands, e-mail: [email protected]
>>>
>>>
>>>  Hi Paul,
>>
>> I've written a patch for this kind of purpose. See:
>>
>> https://issues.apache.org/jira/browse/SOLR-1653
>>
>> Koji
>>
>>  Oops. I thought this is solr-user list, but it was java-user. :-D
>
>
> Koji
>
> --
> http://www.rondhuit.com/en/
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>
>


-- 
Weiwei Wang
Alex Wang
王巍巍
Room 403, Mengmin Wei Building
Computer Science Department
Gulou Campus of Nanjing University
Nanjing, P.R.China, 210093

Homepage: http://cs.nju.edu.cn/rl/weiweiwang

Re: Looking for a MappingCharFilter that accepts regular expressions

Reply via email to