I was wondering if there is any way to directly use Lucene API to extract terms from a given string. My requirement is that I have a text document for which I need a term frequency vector ( after stemming, removing stopwords and synonyms checks ). The result needs to be the terms and frequency.
Is it possible to get this using any lucene API? ( As I see lucene also needs to stem, remove stopwords, synonyms etc before indexing). Or is this any java project that would help me in this? -- View this message in context: http://www.nabble.com/Term-Extraction-tp24953406p24953406.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org