hi :)
I'm having an interesting problem with my data. in general, I want the
results of the WordDelimiterFilter for better matching, but there are
times when it's just too aggressive. for example
boys2men => boys 2 men (good)
p!nk => pnk (maybe)
!!! => (nothing - bad)
there's a special place for bands who name themselves just punctuation
marks :)
anyway, one way around this is synonyms. but if I do that then I need
to run the synonym filter multiple times. the first might expand
!!! => chk chk chk
p!nk => pink
while the next would need to run after the WordDelimiterFilter for
boys 2 men => boyz II men
I'd really like to avoid multiple passes (and multiple synonym files) if
at all possible, but that's the solution I'm faced with currently...
unless an 'expand' option were added to the WordDelimiterFilter, in
which case I'd have
p!nk => p!nk pnk
after it runs, so I could just apply the synonyms once. or maybe
there's another solution I'm missing.
would it be difficult (or desirable) to add an expand option?
--Geoff