Kip,
> What can I do to optimize this query?
For more efficient alternatives see "Within-group aggregates" at
http://www.artfulsoftware.com/queries.php.
PB
-----
Kip Turk wrote:
I'm having problems optimizing a series of subselects. I have the
following sample table:
mysql> select * from fake order by msgid, id desc;
+----+-------+-----+---------------------+
| id | msgid | nec | dt |
+----+-------+-----+---------------------+
| 10 | 1 | 300 | 2008-06-25 09:18:05 |
| 9 | 1 | 301 | 2008-06-25 09:18:02 |
| 6 | 1 | 305 | 2008-06-25 09:15:40 |
| 5 | 1 | 301 | 2008-06-25 09:15:32 |
| 2 | 1 | 301 | 2008-06-25 09:15:10 |
| 1 | 1 | 300 | 2008-06-25 09:15:04 |
| 11 | 2 | 300 | 2008-06-25 09:18:13 |
| 8 | 2 | 305 | 2008-06-25 09:17:49 |
| 4 | 2 | 305 | 2008-06-25 09:15:19 |
| 3 | 2 | 301 | 2008-06-25 09:15:14 |
| 7 | 3 | 305 | 2008-06-25 09:17:44 |
| 12 | 4 | 300 | 2008-06-25 09:23:22 |
| 14 | 5 | 305 | 2008-06-25 09:23:39 |
| 13 | 5 | 301 | 2008-06-25 09:23:33 |
| 15 | 6 | 300 | 2008-06-25 09:23:45 |
+----+-------+-----+---------------------+
I'm trying to grab and count the nec for the highest id entry for each
distinct msgid. To get the correct entries, I can use:
mysql> select * from (select * from fake order by id desc) as fake1
group by msgid;
+----+-------+-----+---------------------+
| id | msgid | nec | dt |
+----+-------+-----+---------------------+
| 10 | 1 | 300 | 2008-06-25 09:18:05 |
| 11 | 2 | 300 | 2008-06-25 09:18:13 |
| 7 | 3 | 305 | 2008-06-25 09:17:44 |
| 12 | 4 | 300 | 2008-06-25 09:23:22 |
| 14 | 5 | 305 | 2008-06-25 09:23:39 |
| 15 | 6 | 300 | 2008-06-25 09:23:45 |
+----+-------+-----+---------------------+
And to get the counts, I can use:
mysql> select nec, count(nec) as count from (select * from (select *
from fake order by id desc) as fake1 group by msgid) as fake2 group by
nec;
+-----+-------+
| nec | count |
+-----+-------+
| 300 | 4 |
| 305 | 2 |
+-----+-------+
So on my tiny test table, the logic is valid to get the results I
want. However, on my actual table with several million lines, the
nested selects makes this a pretty ugly option (to the point even
explain took a few minutes to respond). What can I do to optimize
this query?
Thanks,
Kip Turk
------------------------------------------------------------------------
No virus found in this incoming message.
Checked by AVG.
Version: 8.0.101 / Virus Database: 270.4.1/1518 - Release Date: 6/25/2008 9:46 AM