RE: Indexing nouns only - UIMA vs. OpenNLP

2013-02-01 Thread Kai Gülzau
Hi Lance, About removing non-nouns: the OpenNLP patch includes two simple TokenFilters for manipulating terms with payloads. The FilterPayloadFilter lets you keep or remove terms with given payloads. yes, I used this already in the schema.xml filter class=solr.FilterPayloadsFilterFactory

Indexing nouns only - UIMA vs. OpenNLP

2013-01-31 Thread Kai Gülzau
Hi, I am stuck trying to index only the nouns of german and english texts. (very similar to http://wiki.apache.org/solr/OpenNLP#Full_Example) First try was to use UIMA with the HMMTagger: processor class=org.apache.solr.uima.processor.UIMAUpdateRequestProcessorFactory lst name=uimaConfig

RE: Indexing nouns only - UIMA vs. OpenNLP

2013-01-31 Thread Kai Gülzau
To: solr-user@lucene.apache.org Subject: Indexing nouns only - UIMA vs. OpenNLP Hi, I am stuck trying to index only the nouns of german and english texts. (very similar to http://wiki.apache.org/solr/OpenNLP#Full_Example) First try was to use UIMA with the HMMTagger: processor class

Re: Indexing nouns only - UIMA vs. OpenNLP

2013-01-31 Thread Lance Norskog
(some OOM here while testing with 1GB via Analyzer Admin GUI)? Regards, Kai Gülzau -Original Message- From: Kai Gülzau [mailto:kguel...@novomind.com] Sent: Thursday, January 31, 2013 2:19 PM To: solr-user@lucene.apache.org Subject: Indexing nouns only - UIMA vs. OpenNLP Hi, I am