Performance regression with 10.2.1.0, oddly throughput drops with fix for derby-1543

Sunitha Kambhampati Tue, 29 Aug 2006 08:36:49 -0700

I ran some simple performance tests with the 10.2.1.0 beta jars and oneof the tests( lets just say the name of the test is ScanCoveredIdxInt), thethroughput is about *56%* of the 10.1.3 release.


BRIEF DESCRIPTION OF TEST:

The test has a setup phase, runtest phase and then the cleanup phase.The timed portion of the test is the

runtest phase.
What the test does:

-- setup includes - drop table, create table, inserting 20000 rows. eachrow is approximately 100bytes each. ,

creating a unique composite index.

-- runtest which is the timed part of the test is multiple iterations ofa select which should onlyinvolve an index scan of the data and number of rows that qualify is10000 rows.

-- cleanup - drops the table.

-- The runtest part is iterated 400 times and the runtest part is timed.(scroll to the end of this mail for code snippets from the test).

Each testrun involves the setup phase, and then runtest phase andcleanup..The select query is iterated 400 times.The testrun is repeated four times and the average throughput iscalculated from the last three testruns.

First testrun result is ignored.

ibm142/linux (2.8intel xeon cpu, hyperthreading enabled,4gbram,scsidisks). Throughput unit is 10000rows/sec

Throughput with 10.1.3  - 24
Throughput with 10.2.1.0 - 13
(Measurement is throughput, so higher number the better).

Tried different things but the short story is:

1)The test does a create database and then drop table;create table;loaddata into table.if the 'create database' step is removed from the testrun, thethroughput of 10.1.3 and 10.2.1.0

are in the same range.

2)If the checkpoint was added after the load of the test, then thethroughput of the select query

for both 10.1.3 and 10.2.1.0 are in similar range.

This (#1) to me doesnt make much sense. I would have thought it was someweird i/o sync happening.

Testing with different jars, it turns out at revision *#426847*, thethroughput drops. Among several runs,this behavior is consistent. I tested with the previous revision(426825)and the throughput is good.

This revision #426847 fixes DERBY-1543http://svn.apache.org/viewvc?rev=426847&view=rev

"1) Now Derby raises an SQLWarning when SQL authorization is ON withoutauthentication at connect time.This is done by checking if AuthenticationService being used is aninstance of NoneAuthenticationServiceImpl.Since this is the default authentication service with Derby, it shouldalways be present.

2) Added code to drop permission descriptors from SYSTABLEPERMS,SYSCOLPERMS and SYSROUTINEPERMSwhen the object they provide permission for is dropped. This includestables, views and routines

and these descriptors needs to be removed from permissionCache as well."

Does this make sense? Or am I just seeing some other i/o side effect inmy test ? Appreciate your input. Thanks.


Just noting some observations of the other things that I had already tried:
-------------------------------------------------------------------

1) Changing the test to only time the select without load being part ofthe same jvm run, the throughput for 10.2.1.0 is closer to 10.1.3 ( ~ 94%).(Details - do the load separately. remove the drop table in cleanup andthus time only the select part. )

Throughput with 10.1.3 - 59
Throughput with 10.2.1.0 - 55

I guess, since the main purpose of the test is to time the select, thetest should have just had select as part of the jvm run.

2) I logged query plans using the derby.language.logQueryPlan andchecked the derby.log for both 10.1.3 run and 10.2.1.0 run. Thethroughput at this time was similar. Note, logQueryPlan prints plansand it is pretty verbose.. so slowing down execution. In any case, FWIW- the query plans looked the same for both 10.1.3 and 10.2.1.0. Anindex scan is used, the estimates were also the same.

From #1, somehow it seems that the work related to load is beingdelayed and thus affecting the select part of the test.

Some theories :

a)-- Maybe it was the changes that went in as part of derby-888 thatimproves performance of page allocation that could have affected it.This change writes pages to the OS at allocation time but doesnt syncthem. Before the writes would have all been synced so no OS work to doafter the load.-- A checkpoint is getting triggered , that could be affecting theselect portion of the run.

c)-- something wrong with optimizer.  But #2 observation rules this out.

Further things tried:

To prove that it maybe a result of (a), I tried to build jars justbefore the derby-888 changes and ran tests.Modified test , pre and post 888 , throughput changes not seen. This isok. Of interest is the throughput value (130000 rows/sec).Original test case with load+select, post 888, the inserts are donemuch faster than pre-888. This is expected and ok.


Checkpoint case:
With load+ explicit checkpoint and then time the test.

* post 888 shows little improvements over the pre 888. (pre - 12, postis 14.6).

* 10.1.3 and 10.2.1.0 throughput in same range. (10.1.3 - 14.3 ,10.2 - 12.6)

Modified test(#1) and putting checkpoint interval to 100mb, to avoid acheckpoint during the testrun 10.2 is about 90% of 10.1.3.Original test with putting checkpoint interval to 100mb, to avoid anycheckpoitns from happening. 10.2 and 10.1 are similar range.

-------------------------------------------------------------

Pseudo code:setup:

....
           execSQL("drop table " + getTableName());
       catch (SQLException se)
       {
           out.println("Caught expected SQL exception " + se.toString());
       }

       execSQL("create table " + getTableName() + " ("
           + "i1 int, i2 int, i3 int, i4 int, i5 int, "
           + "c6 char(20), c7 char(20), c8 char(20), c9 char(20))");

       conn.setAutoCommit(false);
       loadData();   //loads 20000 rows.

       execSQL(
           "create unique index " +
           getTableName() + "x on " + getTableName() + "(i1, i3)");
        selectStatement =
                      conn.prepareStatement(
                           "select i1 from " + getTableName() +
                               " where i1 > ? and i1 <= ?");

                  conn.commit();

runpart:
   public void run() throws Exception
   {
       // set begin scan to start 1/4 into the data set.
       selectStatement.setInt(1, ((getRowCount() * 2) / 4));

       // set end scan to end 3/4 into the data set.
       selectStatement.setInt(2, (((getRowCount() * 2) / 4) * 3));

       ResultSet rs = selectStatement.executeQuery();
       while (rs.next())
       {
           int i = rs.getInt(1);
       }
       rs.close();
      conn.commit();
   }

Performance regression with 10.2.1.0, oddly throughput drops with fix for derby-1543

Reply via email to