Re: [sqlite] Packing integer primary key with field bits

R Smith Thu, 10 Aug 2017 08:24:28 -0700


On 2017/08/10 4:51 PM, x wrote:

Valid point on the intended range Gunter. I don’t know enough about sqlite to fully 
understand your index cell paragraph. I thought the way sqlite worked was e.g. to 
get the value of the 3rd column it had to read the lengths of col1 & col2 so it 
knows where col 3 value starts (I’ve seen a comment that the most retrieved cols 
should be at the start of the record)

This is true, but the cost of "reading" the bytes to get to the valuesare no worse than what your cost would be for reading the Int64 andunpacking the bytes from there... SQLite internally does the same thingonly without using an Int64, it just uses an array of bytes, but gleansthe values equally efficiently. If it was better to use Int64 values instead of a byte-array, SQLite would already be using it. By the way,Int64 is just another way of saying "list of 8 bytes", which is justanother way of saying "list of 64 bits", there is nothing magically moreefficient about 64bit INT than a list of 8 bytes. Memory is memory.Your idea might offer savings in that you predetermine the length ofstored values and ensure they fit into that list of 8 bytes, whereasSQLite stores also a Length, but has the vastly superior property ofbeing able to store any size of any type of variable. Again, the speedgain is dubious for the price it comes at.

To make matters worse, the real bottleneck is usually the storage layer,and to read even 1 byte from a row, or Index, usually requires readingan entire File-System Page (Typically ~4 Thousand bytes) but somesuccessive reads are usually gotten from the same page to make mattersmore efficient. The point being, the reading of that File system page isseveral magnitudes more time-intensive than unpacking the bytes. Youwill not save anything much speed-wise, and very little space-wise, andthen only for smaller row-lengths.

By the way, the same goes for reading your own array of bytes from thestorage medium - the hope is of course that some good caching at boththe storage layer and your application layer would smooth out read/writewaits.

  and that indexes are stored the same way.

This is not quite true. The Index is (usually) a B-Tree structure withconsistent size value indices. An Index is built for speed, not storageefficiency. Adding an Index effectively copies the entire column(s) theIndex refers to into slightly less efficient (but more accessible)storage - It's much worse space-wise than simply using separate fieldsand no Index.[Note: I am not 100% sure about the BTree storage format in SQLite'scase, I refer here to other DB Engines, but I would be surprised if anSQLite BTree Index is stored exactly the same as the table Row format.]



_______________________________________________
sqlite-users mailing list
[email protected]
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users

Re: [sqlite] Packing integer primary key with field bits

Reply via email to