Kip,

> What can I do to optimize this  query?

For more efficient alternatives see "Within-group aggregates" at http://www.artfulsoftware.com/queries.php.

PB

-----

Kip Turk wrote:
I'm having problems optimizing a series of subselects. I have the following sample table:

mysql> select * from fake order by msgid, id desc;
+----+-------+-----+---------------------+
| id | msgid | nec | dt                  |
+----+-------+-----+---------------------+
| 10 |     1 | 300 | 2008-06-25 09:18:05 |
|  9 |     1 | 301 | 2008-06-25 09:18:02 |
|  6 |     1 | 305 | 2008-06-25 09:15:40 |
|  5 |     1 | 301 | 2008-06-25 09:15:32 |
|  2 |     1 | 301 | 2008-06-25 09:15:10 |
|  1 |     1 | 300 | 2008-06-25 09:15:04 |
| 11 |     2 | 300 | 2008-06-25 09:18:13 |
|  8 |     2 | 305 | 2008-06-25 09:17:49 |
|  4 |     2 | 305 | 2008-06-25 09:15:19 |
|  3 |     2 | 301 | 2008-06-25 09:15:14 |
|  7 |     3 | 305 | 2008-06-25 09:17:44 |
| 12 |     4 | 300 | 2008-06-25 09:23:22 |
| 14 |     5 | 305 | 2008-06-25 09:23:39 |
| 13 |     5 | 301 | 2008-06-25 09:23:33 |
| 15 |     6 | 300 | 2008-06-25 09:23:45 |
+----+-------+-----+---------------------+

I'm trying to grab and count the nec for the highest id entry for each distinct msgid. To get the correct entries, I can use:

mysql> select * from (select * from fake order by id desc) as fake1 group by msgid;
+----+-------+-----+---------------------+
| id | msgid | nec | dt                  |
+----+-------+-----+---------------------+
| 10 |     1 | 300 | 2008-06-25 09:18:05 |
| 11 |     2 | 300 | 2008-06-25 09:18:13 |
|  7 |     3 | 305 | 2008-06-25 09:17:44 |
| 12 |     4 | 300 | 2008-06-25 09:23:22 |
| 14 |     5 | 305 | 2008-06-25 09:23:39 |
| 15 |     6 | 300 | 2008-06-25 09:23:45 |
+----+-------+-----+---------------------+

And to get the counts, I can use:
mysql> select nec, count(nec) as count from (select * from (select * from fake order by id desc) as fake1 group by msgid) as fake2 group by nec;
+-----+-------+
| nec | count |
+-----+-------+
| 300 |     4 |
| 305 |     2 |
+-----+-------+

So on my tiny test table, the logic is valid to get the results I want. However, on my actual table with several million lines, the nested selects makes this a pretty ugly option (to the point even explain took a few minutes to respond). What can I do to optimize this query?

Thanks,
Kip Turk

------------------------------------------------------------------------


No virus found in this incoming message.
Checked by AVG. Version: 8.0.101 / Virus Database: 270.4.1/1518 - Release Date: 6/25/2008 9:46 AM

Reply via email to