After doing some more testing, I've seen the performance decrease yet again.
It happens after solr has been run for about 1/2 hour. I left my test
running over the weekend and saw the CPU usage go down to a reasonable level
at the end of the weekend. It is the same problem where the CPU has maximum
usage. I attached a profiler to the solr instance and found that 99% of the
CPU time is spent in the doFilter method of the SolrDispatchFilter class.

Does anyone know why all of the CPU would be hogged on this particular
method?

I'm requesting by relevance without sorting. I'm requesting 500 results per
query. There are no repititions in the query set.

As for the fields. I'm using String and SortableInt fields. There are 3
string fields and 3 Sortable Int fields in my schema. One of the String
Fields is multivalued. The fields are quite small. Since its 18 GB for a 100
million document index.

Thanks,
Brad

2009/11/6 ba ba <solrm...@gmail.com>

> After looking at the question about the sorting. It seems that the schema
> was using the SortableIntField class. When I did not return these fields in
> the queries, I got reasonable CPU usage. If I search only on one of these
> SortableIntFields, I get the bad query performance. I think the problem is
> the schema is using a Sortable field when I don't need a sortable field.
>
> Thanks for the help.
>
> -Brad
>
> 2009/11/5 Otis Gospodnetic <otis_gospodne...@yahoo.com>
>
> You may also want to share some sample queries, your fields definitions,
>> and tell us how long a core remains 100% utilized.
>>
>>  Otis
>> --
>> Sematext is hiring -- http://sematext.com/about/jobs.html?mls
>> Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR
>>
>>
>>
>> ----- Original Message ----
>> > From: ba ba <solrm...@gmail.com>
>> > To: solr-user@lucene.apache.org
>> > Sent: Thu, November 5, 2009 9:20:13 PM
>> > Subject: CPU Max Utilization
>> >
>> > Greetings,
>> >
>> > I'm running a solr instance with 100 million documents in it. The index
>> is
>> > 18 GB.
>> >
>> > The strange behavior I'm seeing is CPU utilization gets maxed out. I'm
>> > running on an 8 core machine with 32 GB or ram. Every concurrent query I
>> run
>> > on it uses up one of the cores. So, if I am running 1 concurrent query
>> I'm
>> > using up the cpu of one of the cores. If I have 8 concurrent queries I'm
>> > using up all of the cores.
>> >
>> > Is this normal to have such a high CPU utilization. If not, what am I
>> doing
>> > wrong here. The only thing I have modified is the schema.xml file to
>> > correspond to the documents I want to store. Everything else is just
>> using
>> > the default values for all the config files.
>> >
>> > Thanks.
>>
>>
>

Reply via email to