Re: [PERFORM] Comparative performance

Joe Mon, 03 Oct 2005 19:10:44 -0700

PFC wrote:

Even though this query isn't that optimized, it's still only 16milliseconds.
    Why does it take this long for PHP to get the results ?
Can you try pg_query'ing this exact same query, FROM PHP, and timingit with getmicrotime() ?

That query took about 27 msec in actual PHP execution time. It turns out thereal culprit is the following query, which interestingly enough retrieves zerorows in the case of the Economists page that I've been using for testing, yet ituses up about 1370 msec in actual runtime:

SELECT topic_id1, topic_id2, topic_name, categ_id, list_name, t.title, url,page_type, rel_type, inverse_id, r.description AS rel_descrip, r.created, r.updated

FROM relationship r, topic t, entry_type e

WHERE ((topic_id1 = topic_id AND topic_id2 = 1252) OR (topic_id2 = topic_id andtopic_id1 = 1252)) AND rel_type = type_id AND e.class_id = 2

ORDER BY rel_type, list_name;

The EXPLAIN ANALYZE output, after I ran VACUUM ANALYZE on the three tables, is:

Sort (cost=4035.55..4035.56 rows=1 width=131) (actual time=2110.000..2110.000rows=0 loops=1)

   Sort Key: r.rel_type, t.list_name

-> Nested Loop (cost=36.06..4035.54 rows=1 width=131) (actualtime=2110.000..2110.000 rows=0 loops=1)Join Filter: ((("inner".topic_id1 = "outer".topic_id) AND("inner".topic_id2 = 1252)) OR (("inner".topic_id2 = "outer".topic_id) AND("inner".topic_id1 = 1252)))-> Seq Scan on topic t (cost=0.00..38.34 rows=1234 width=90) (actualtime=0.000..15.000 rows=1234 loops=1)-> Materialize (cost=36.06..37.13 rows=107 width=45) (actualtime=0.000..0.509 rows=466 loops=1234)-> Merge Join (cost=30.31..35.96 rows=107 width=45) (actualtime=0.000..0.000 rows=466 loops=1)

                     Merge Cond: ("outer".type_id = "inner".rel_type)
                     ->  Index Scan using entry_type_pkey on entry_type e  (cost
=0.00..3.94 rows=16 width=4) (actual time=0.000..0.000 rows=15 loops=1)
                           Filter: (class_id = 2)

-> Sort (cost=30.31..31.48 rows=466 width=43) (actualtime=0.000..0.000 rows=466 loops=1)

                           Sort Key: r.rel_type

-> Seq Scan on relationship r (cost=0.00..9.66rows=466 width=43) (actual time=0.000..0.000 rows=466 loops=1)

 Total runtime: 2110.000 ms
(14 rows)

The tables are as follows:

CREATE TABLE entry_type (
  type_id SMALLINT NOT NULL PRIMARY KEY,
  title VARCHAR(32) NOT NULL,
  rel_title VARCHAR(32),
  class_id SMALLINT NOT NULL DEFAULT 1,
  inverse_id SMALLINT,
  updated TIMESTAMP WITH TIME ZONE NOT NULL DEFAULT CURRENT_TIMESTAMP)
WITHOUT OIDS;

CREATE TABLE topic (
  topic_id serial PRIMARY KEY,
  topic_name VARCHAR(48) NOT NULL UNIQUE,
  categ_id SMALLINT NOT NULL,
  parent_entity INTEGER,
  parent_concept INTEGER,
  crossref_id INTEGER,
  list_name VARCHAR(80) NOT NULL,
  title VARCHAR(80),
  description VARCHAR(255),
  url VARCHAR(64),
  page_type SMALLINT NOT NULL,
  dark_ind BOOLEAN NOT NULL DEFAULT FALSE,
  ad_code INTEGER,
  created DATE NOT NULL DEFAULT CURRENT_DATE,
  updated TIMESTAMP WITH TIME ZONE NOT NULL DEFAULT CURRENT_TIMESTAMP)
WITHOUT OIDS;

CREATE TABLE relationship (
  topic_id1 INTEGER NOT NULL REFERENCES topic,
  topic_id2 INTEGER NOT NULL REFERENCES topic,
  rel_type INTEGER NOT NULL,
  description VARCHAR(255),
  created DATE NOT NULL DEFAULT CURRENT_DATE,
  updated TIMESTAMP WITH TIME ZONE NOT NULL DEFAULT CURRENT_TIMESTAMP,
  PRIMARY KEY (topic_id1, topic_id2, rel_type))
WITHOUT OIDS;

I'm thinking that perhaps I need to set up another index with topic_id2 firstand topic_id1 second. In addition, an index on entry_type.class_id may improvethings. Another possibility would be to rewrite the query as a UNION.

Of course, this doesn't explain how MySQL manages to execute the query in about9 msec. The only minor differences in the schema are: entry_type.title andrel_title are char(32) in MySQL, entry_type.class_id is a tinyint, andtopic.categ_id, page_type and dark_ind are also tinyints. MySQL also doesn'thave the REFERENCES.

A couple of interesting side notes from my testing. First is that pg_connect()took about 39 msec but mysql_connect() took only 4 msec, however, pg_pconnect()took 0.14 msec while mysql_pconnect() took 0.99 msec (all tests were repeatedfive times and the quoted results are averages). Second, is that PostgreSQL'sperformance appears to be much more consistent in certain queries. For example,the query that retrieves the list of subtopics (the names and description ofeconomists), took 17 msec in PG, with a low of 15 (three times) and a high of21, whereas MySQL took 60 msec on average but had a low of 22 and a high of 102msec.


Joe


---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?

              http://archives.postgresql.org

Re: [PERFORM] Comparative performance

Reply via email to