How to export the list of terms indexed in Solr?

2015-04-29 Thread brent3600
We are indexing collections of documents (files) with SOLR, and would like the following capability: Export or pull from SOLR the list of terms that have been indexed for a document or set of documents, along with the term frequency count. 1. Does SOLR already provide an API or method to

Re: How to export the list of terms indexed in Solr?

2015-04-29 Thread Koji Sekiguchi
Hi brent3600, You can use NLP4L for this purpose. NLP4L is good at counting the number of words not only in whole index but also in a set of documents. There is a tutorial for this function. Count the number of words http://nlp4l.github.io/tutorial_ja.html#useNLP Sorry but the tutorial is