Re: Correcting text at index time

2015-06-30 Thread hossmaa
Hi all Thanks for the replies. So there's no getting away from doing it on my own then... @Jack: I need to replace a whole list of shortened words... It would make a crazy regex (which I incidentally wouldn't even know how to formulate). Cheers A. -- View this message in context:

RE: Correcting text at index time

2015-06-29 Thread hossmaa
Hi Markus Thanks for the reply. I'm already using the Synonyms filter and it is working fine (i.e., when I search for customer, it also returns documents containing cst.). What the synonyms filter does not do is to actually replace the word cst. with customer in the document. Just to be clearer:

Correcting text at index time

2015-06-29 Thread hossmaa
Hi everyone I'm wondering if it's possible in Solr to correct text at indexing time, based on a synonyms-like list. This would be great for expanding undesirable abbreviations (for example, cst. instead of customer). I've been searching the Solr docs and the web quite thoroughly I believe, but

Re: solr uima and opennlp

2015-05-28 Thread hossmaa
Hi Tommaso Thanks for the quick reply! I have another question about using the Dictionary Annotator, but I guess it's better to post it separately. Cheers Andreea -- View this message in context: http://lucene.472066.n3.nabble.com/solr-uima-and-opennlp-tp4206873p4208348.html Sent from the

solr and uima dictionary annotator

2015-05-28 Thread hossmaa
Hi everyone I am using the UIMA DictionaryAnnotator to tag Solr documents. It seems to be working (I do get tags), but I get some strange behavior: 1. I am using the White Space Tokenizer both for the indexed text and for creating the dictionary. Most entries in my dictionary consist of multiple

solr uima and opennlp

2015-05-21 Thread hossmaa
Hi everyone I'm trying to plug in a new UIMA annotator into solr. What is necessary for this? Is is enough to build a Jar similarly to the ones from the uima-addons package? More specifically, are the uima-addona Jars identical to the ones found in solr's contrib folder? Thanks! Andreea --