[Dbix-class] RFC - some DBMS features you use

Darren Duncan Thu, 04 Feb 2010 14:19:12 -0800

Hello,

In order to help me with some prioritization on my Muldis database projects(which would typically be used as a DBMS wrapper above the DBI level but belowthe level of other non-trivial DBMS wrappers like yours; it has its own querylanguage intended to be what I think SQL should have been in the first place),and decide what features to try and design into the first version versus puttingoff for later, I wanted to quickly survey what SQL DBMS features you are usingor care about now. I will just ask about some specific features that I haven'tformally specced yet.

I may also be asking about what you know regarding the current feature set ofexisting SQL DBMSs, such as what they do or support already if one wanted to usethem.

If I don't mention something, it could either be because I already specced it(probably 80-90% of everything you care about by now) or it isn't yet on myradar or I don't consider it important enough for now.

Some of these questions overlap, so I recommend reading all of them beforeanswering any of them; you may be able to consolidate responses.

1. Locking. Do you explicitly place locks (including with "select ... forupdate" or otherwise) on either the whole or parts of the database to ensure noone else updates them for the duration of the locks? What DBMSs do or don'tsupport doing this?


1a.  At what levels of granularity do you care about being supported?  Examples:
- whole database at once
- whole database relvars/tables at once

- just specific tuples/rows with specific primary key or other key values (eg,where person id is 3 or 7)- generic predicate locks, covering any tuples/rows that are or might be visibleto a particular query (eg, where name starts with 'f' and date_a is beforedate_b and the person is a member of clubs started before this date)

- other kinds?

1b. Do you want locks to be freed only explicitly or automatically at the endof a transaction commit or rollback? What do current DBMSs do and what do youwant to be able to do? (Personally, I would think being able to maintain locksindependently of transactions is useful, especially of inner transactions orsavepoints.)

2. Read consistency. After you start a transaction, are you seeing aconsistent database state besides any changes you make yourself, or if yourepeated the same query without making changes, would you get the same resultsboth times? That is, are committed changes by other database users visible toyou to affect the results. Do you want to be able to choose a desired behaviorand have it enforced, such as whether you want to see other committed changesimmediately or not see them at all during your transaction, the latterespecially if it is a read-only transaction? For picking either behavior, howdo you currently tell the DBMS to do this? And do you do it natively with SQLor does your Perl wrapper handle it for you and if so then how?

2a. Is there anything you do differently if you are using cursors versus if youare not?

2b. Is your DBMS multi-versioned, so that someone can see a consistent databasestate for an extended period without blocking others that want to update it? Doyou desire such functionality or do you prefer that updates of a common-interestsection will block other users? What do current DBMSs do in this regard orwhich let you choose?

2c. Do you explicitly concern yourself with the above at all, or do you just doyour read and update queries without worrying about whether things might havechanged in the middle, and just let the DBMS sort it out implicitly, and you letthings fall as they may? Note that conceivably this is the scenario easiest tosupport as I just don't do anything special, sometimes. Do any Perl DBMSwrappers care about this?

3. Transition constraints. Can you currently or do you want to be able todeclare declarative transition constraints that the DBMS enforces. In contrastto state constraints, which look at the current state of a whole database andsee if it is "consistent" with respect to itself, a transition constraintcompares the state of the database before the change you made with the stateafterwards and says whether the database may directly transition from the firstto the second. Any SQL TRIGGER which looks at both "OLD" and "NEW" together forthe purpose of veto power on a database change or transaction would qualify.

3a. Do you expect transition constraints to evaluate per statement or innertransaction / savepoint or only at the end of a main transaction, comparing thestate before the transaction to after it?

4. Do you use triggers to optionally cause a database update that isn'tconceptually part of the triggering action, such as write to a log table, or doyou only use them to define constraints that say can't be expressed with a CHECKclause or unique/foreign/etc key declaration? Note that if the trigger issimply implementing an updateable view, where all of the values being writtenare from the user in the same statement/statement group, that doesn't count as Iwould consider that lot simultaneous, the cause of itself.

5. Users. Do you use multiple DBMS-recognized users in your database or justone? I refer to what you give to DBI's connect() as its user/pass. Do allusers of your applications share the same user/pass or does each human user ofyour application have their own user/pass that is used there? If you only useone, do you wish you could have multiple or does your Perl layer emulate thatyou are without you actually using multiple at the DBMS level?

5a. Do you generate temporary database users at runtime, ostensibly forsecurity purposes. For example, the user/pass a user of your app logs in withisn't an actual DBMS user, but when they want to login, a DBMS user whose onlyprivilege is to invoke a specific stored procedure is used to validate theuser/pass that the user gave, and then if accepted a temporary user/pass isgenerated by the stored procedure where those credentials are returned by theprocedure, and then the application logs into the DBMS with those to do all theother/normal work of the user for that session. (Similar to this has actuallybeen done in a major government database application.)


5b.  Do you define your users using plain SQL or can your Perl layer do it?

6. User privileges. Do you have differing ranges of privileges in yourdatabases for what database users can see or change or invoke? Is this enforcedby the DBMS itself or just by your application, or is it not enforced at all butis just a "supposed to"? Do you want this definable and enforceable at a lowerlevel such that if say your higher level code is overly broad and makes amistake, it won't end up showing or changing something the user shouldn't beallowed to?

6a. What DBMSs currently support roles or per-user privileges other than simplywhether they may login or not? And what sorts of granularity do they have?

6b. How important is it to you to be able to have per-user privileges in yourdatabases? Or how important is it to be able to define these without using SQL?

7. Spatial types and operators. Who uses spatial or GIS or such types oroperators? How valuable is it to be able to do so without writing SQL?

8. Special values for numerics. Who uses those IEEE float special values suchas NaN or infinities or under/overflows or distinct +/- zero or whatever intheir databases, or do you only care about storing or working with normalnumbers? If you do use any special values, then which ones and why? Do youcurrently distinguish between multiple kinds of NaN (eg, 0/0 versus square rootof -1 versus whatever), or is a NaN just a NaN? Note that "not using" wouldmean that an operation that would otherwise produce one instead returns an error/ throws an exception.


9.  Full text search.  Who uses this versus a simple substring match?

10. Who uses regular expressions of any kind in the database? EitherPerl-compatible or otherwise. Basically any kind of pattern matching besidesthe trivial kind that LIKE supports (literals plus . plus .*).

11. Out of band sequence generators. If you read from / increment a sequencegenerator from within a transaction, do you want a rollback of that transactionto also rollback the sequence generator or not? Do you want both possibilitiesto be supported. What do current DBMSs support?

12. Sequence generators that produce values other than integers? Who uses thisor what DBMSs support them? And what would be the semantics for non-integers?

13. Transactions and data definition. If you start an explicit transaction,and then do a data definition operation, such as changing the type of a tablecolumn, do you want the latter to be subject to the transaction such that youcan roll it back? (I believe SQLite subjugates everything to transactions.)

14. Implicit commit of explicit transaction. If you have an explicittransaction, should any operation you do in it other than an explicit 'commit'cause it to commit, or should only 'commit' do that? If someone attempts datadefinition within a transaction and the DBMS doesn't support subjugating thoseto transactions, then should the data definition attempt fail or should it causean automatic commit. (I think the subjugate-or-fail should be the case, and theimplicit commit is always bad.) What DBMSs currently support each behavior?(AFAIK, SQLite does not automatically commit, but MySQL does. Know otherwise?)


I may have more questions, but that's for now.

Thank you in advance for any feedback.

-- Darren Duncan


_______________________________________________
List: http://lists.scsys.co.uk/cgi-bin/mailman/listinfo/dbix-class
IRC: irc.perl.org#dbix-class
SVN: http://dev.catalyst.perl.org/repos/bast/DBIx-Class/
Searchable Archive: http://www.grokbase.com/group/[email protected]

[Dbix-class] RFC - some DBMS features you use

Reply via email to