Re: [HACKERS] Refactoring the Type System

Darren Duncan Sat, 13 Nov 2010 19:54:53 -0800

David Fetter wrote:

For the past couple of years, I've been hearing from the PostGIS
people among others that our type system just isn't flexible enough
for their needs.  It's really starting to show its age, or possibly
design compromises that seemed reasonable a decade or more ago, but
are less so now.


To that end, I've put up a page on the wiki that includes a list of
issues to be addressed.  It's intended to be changed, possibly
completely.

http://wiki.postgresql.org/wiki/Refactor_Type_System

What might the next version of the type system look like?

Are you talking about changes to the type system as users see it or just changesto how the existing behavior is implemented internally? If you're talkingabout, as users see it, which the other replies to this thread seem to besaying, though not necessarily the url you pointed to which looks more internals ...


As a statement which may surprise no one who's heard me talk about it before ...

I've mostly completed a type system specification that would be useable byPostgres, as the most fundamental part of my Muldis D language.

The type system is arguably the most central piece of any DBMS, around whicheverything else is defined and built.


You have data, which is structured in some way, and has operators for it.

If you look at a DBMS from the perspective of being a programming languageimplementation, you find that a database is just a variable that holds a valueof a structured type. In the case of a relational database, said database is atuple whose attribute values are relations; or in the case ofnamespaces/schemas, the database tuple has tuple attributes having relationattributes.

If a database is a variable, then all database constraints are type constraintson the declared type of that variable, and you can make said constraintsarbitrarily complicated.

From basic structures like nestable tuples and relations, plus a complement ofbasic types like numbers and strings, and arbitrary constraints, you can definedata types of any shape or form.

A key component of a good type system is that users can define data types, andmoreover where possible, system-defined types are defined in the same ways asusers define types. For example, stuff like temporal types or geospatial typesare prime candidates for being defined like user-defined types.

If you define all structures using tuples and relations, you can easily flattenthis out on the implementation end and basically do everything as associatedflat relation variables as you do now.

So what I propose is both very flexible and easy to implement, scale, andoptimize, relatively speaking.

You don't have to kludge things by implementing arrays as blobs for example; youcan implement them as relations instead. Geospatial types can just be tuples.Arrays of structured types can just be relations with an attribute per typeattribute. Arrays of simple types can just be unary relations.

You can also emulate all of the existing Pg features and syntax that you havenow over the type system I've defined, maintaining compatibility too.

I also want to emphasize that, while I drew inspiration from many sources whendefining Muldis D, and there was/is a lot I still didn't/don't know aboutPostgres, I have found that as I use and learn Postgres, I'm finding frequentlythat how Postgres does things is similar and compatible to how I independentlycame up with Muldis D's design; I'm finding more similarities all the time.


-- Darren Duncan

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Refactoring the Type System

Reply via email to