Just a little follow-up - I did a little more testing, and the query
takes 20 seconds no matter what - If there's one document in the results
set, or if I do a query that returns all 130000 documents.
It seems something isn't right... it looks like solr is doing faceted
search on the whole index no matter what's the result set when doing
facets on a string field. I must be doing something wrong?
Michael Imbeault
CHUL Research Center (CHUQ)
2705 boul. Laurier
Ste-Foy, QC, Canada, G1V 4G2
Tel: (418) 654-2705, Fax: (418) 654-2212
Michael Imbeault wrote:
Been playing around with the news 'facets search' and it works very
well, but it's really slow for some particular applications. I've been
trying to use it to display the most frequent authors of articles;
this is from a huge (15 millions articles) database and names of
authors are rare and heterogeneous. On a query that takes (without
facets) 0.1 seconds, it jumps to ~20 seconds with just 1% of the
documents indexed (I've been getting java.lang.OutOfMemoryError with
the full index). ~40 seconds for a faceted search on 2 (string)
fields. Range queries on a slong field is more acceptable (even with a
dozen of them, query time is still in the subsecond range).
I'm I trying to do something which isn't what faceted search was made
for? It would be understandable, after all, I guess the facets engine
has to check very doc in the index and sort... which shouldn't yield
good performance no matter what, sadly.
Is there any other way I could achieve what I'm trying to do? Just a
list of the most frequent (top 5) authors present in the results of a
query.
Thanks,