Re: [sqlite] Count(1)

Noah Hart Fri, 04 Apr 2008 09:47:32 -0700

Questions to the SQLite maintainers...

The docs tell us that ...
    ** The page headers looks like this:
    **
    **   OFFSET   SIZE     DESCRIPTION
    **      0       1      Flags. 1: intkey, 2: zerodata, 4: leafdata,
8: leaf
    **      1       2      byte offset to the first freeblock
    **      3       2      number of cells on this page

Since the count of cells in use stored in for each btree page?
Wouldn't it be pretty easy to optimize count(*) by

<PSEUDO CODE>
count = 0
Btree_MOVE_TO_FIRST_ENTRY
while not Btree_END_OF_TREE
 count += NUMBER_OF_ENTRIES_ON_THIS_CHILD_PAGE
 Btree_MOVE_TO_NEXT_CHILD_PAGE
return count;
</PSEUDO CODE>

With large rows contents lengths, the savings would be minimal
However even with rows contents lengths around 100, the savings would be
10x

Regards -- Noah

-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Scott Hess
Sent: Friday, April 04, 2008 9:15 AM
To: General Discussion of SQLite Database
Subject: Re: [sqlite] Count(1)

What I meant when I said "full table scan" is that it has to read at
least something for every single row in the table.  So the following
are going to be the same:

  SELECT COUNT(*) FROM t;
  SELECT COUNT(rowid) FROM t;

It won't have to scan any overflow pages, but it will have to hit all
the leaf nodes.

You could certainly do a full scan on an index other than the rowid.
It might involve much less reading if the indexed items are small
relative to the overall row.  Not sure if SQLite does this
optimization for you or not (I don't think it much matters - it's
still going to bel O(N), just with a lower constant).

-scott

On Fri, Apr 4, 2008 at 8:19 AM, Samuel Neff <[EMAIL PROTECTED]>
wrote:
> Scott,
>
>  Is it really a full table scan or just an index scan (at least in the
case
>  where no data is needed from the table as in the original sample that
had no
>  join or where clause).
>
>  Thanks,
>
>  Sam
>
>
>
>  On Thu, Apr 3, 2008 at 4:12 PM, Scott Hess <[EMAIL PROTECTED]> wrote:
>
>  > A little bit more info:  SELECT COUNT(*) is implemented as a full
>  > table scan, so SQLite is visiting every row in the table, which
will
>  > get slower and slower as the table gets bigger and the database
>  > fragments.  This differs from many database engines (which
implement
>  > an optimization for this)  Doing the trigger thing means that it
only
>  > visits the specific row that contains the count.
>  >
>  > -scott
>  >
>  >
>  --
>  -----------------------------------------------------------------
>  We're Hiring! Seeking passionate Flex, C#, or C++ (RTSP, H264)
developer.
>  Position is in the Washington D.C. metro area. Contact
>  [EMAIL PROTECTED]
>
>
> _______________________________________________
>  sqlite-users mailing list
>  [email protected]
>  http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users
>
_______________________________________________
sqlite-users mailing list
[email protected]
http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users

CONFIDENTIALITY NOTICE: 
This message may contain confidential and/or privileged information. If you are 
not the addressee or authorized to receive this for the addressee, you must not 
use, copy, disclose, or take any action based on this message or any 
information herein. If you have received this message in error, please advise 
the sender immediately by reply e-mail and delete this message. Thank you for 
your cooperation.

_______________________________________________
sqlite-users mailing list
[email protected]
http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users

Re: [sqlite] Count(1)

Reply via email to