Hi I am currently using function queries to obtain these two statistics, as I didn't see a better or more explicit API and the Terms component only returns docFreq, but not totalTermFreq.
The way I use the API is submit requests as follows: curl " http://localhost:8983/solr/mycollection/select?q=*:*&rows=1&fl=ttf(text,'t1'),docfreq(text,'t1 ')" Today I noticed that it sometimes returns 0 for these stats for existing terms. After debugging and going through the code, I noticed that it performs analysis on the value that's given. So if I provide an already stemmed value, it analyzes the value further and in some cases it results in a non-existing term (and in other cases I get stats for a term I didn't ask for). I want to get the stats of the indexed version of the terms, and that's why I send the already stemmed one. In my case I tried to get the stats for the term 'disguis' which is the stem of 'disguise' and 'disguised', however it further analyzed the value to 'disgui' (per the analysis chain) and that term does not exist in the index. So first question is -- is this the right API to retrieve such statistics? I didn't find another one, but could be I missed it. If it is, why does it analyze the value? I tried to wrap the value with single and double quotes, but of course that does not affect the analysis ... is analysis an intended behavior or a bug? Shai
