Multi-word synonyms are tricky.

You probably want to make sure this synonym is only expanded at index time, and not at search time. See some background in the SynonymFilterFactory section of http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters

I think the synonym approach is a fine way to search for greek letters by name; it's possible some of the new Unicode stuff in Solr 3.1 might expand greek letters too, but I think actually probably not (because you don't neccesarily want that in the general case), I think synonyms is probably your best bet. (Same for things like expanding the musical sharp or flat glyph to "sharp" or "flat", which I've considered).

On 5/27/2011 7:43 AM, Thomas Dowling wrote:
Greetings--

I'm trying to flesh out my synonyms.txt file for a couple of Solr indexes,
and I stumbled across something weird.  I added these lines to synonyms.txt:

co2, carbon dioxide
ch4, methane


The second line worked as expected: I restarted Solr, reindexed, and could
search ch4 and methane as synonyms of each other.

The first line did something weird.  Before the change, I can get results
for both CO2 and for "CARBON DIOXIDE" (just different results).  After the
change, searching CO2 got zero results, as did "CARBON DIOXIDE".  So at
least they're acting like synonyms, right?  But why in the world do they
both stop finding hits?

Pre-change:
   CO2                  225 hits
   "CARBON DIOXIDE"   130 hits
   CARBON DIOXIDE       1030 hits

Post-change:
   CO2                  0 hits
   "CARBON DIOXIDE"   0 hits
   CARBON DIOXIDE       1030 hits


Also, if I want to be able to search for Greek letters by name (alpha,
beta, etc.), is there a better way than to use synonmyms.txt?

   Δ,δ,delta


TIA

Reply via email to