We are indexing collections of documents (files) with SOLR, and would like
the following capability:
Export or pull from SOLR the list of terms that have been indexed for a
document or set of documents, along with the term frequency count.
1. Does SOLR already provide an API or method to
Hi brent3600,
You can use NLP4L for this purpose. NLP4L is good at counting the number of
words
not only in whole index but also in a set of documents. There is a tutorial
for this function.
Count the number of words
http://nlp4l.github.io/tutorial_ja.html#useNLP
Sorry but the tutorial is