hi :)

I'm having an interesting problem with my data. in general, I want the results of the WordDelimiterFilter for better matching, but there are times when it's just too aggressive. for example

  boys2men => boys 2 men (good)
  p!nk     => pnk (maybe)
  !!!      => (nothing - bad)

there's a special place for bands who name themselves just punctuation marks :)

anyway, one way around this is synonyms. but if I do that then I need to run the synonym filter multiple times. the first might expand

  !!!  => chk chk chk
  p!nk => pink

while the next would need to run after the WordDelimiterFilter for

  boys 2 men => boyz II men

I'd really like to avoid multiple passes (and multiple synonym files) if at all possible, but that's the solution I'm faced with currently...

unless an 'expand' option were added to the WordDelimiterFilter, in which case I'd have

  p!nk => p!nk pnk

after it runs, so I could just apply the synonyms once. or maybe there's another solution I'm missing.

would it be difficult (or desirable) to add an expand option?

--Geoff

Reply via email to