Re: Collation implementation WAS Re: Should COLLATION attribute related code go in BasicDatabase?

Daniel John Debrunner Thu, 15 Mar 2007 13:40:13 -0800

Mike Matrigali wrote:

Rick Hillegas wrote:
Thanks, Mike. This overhead seems pretty small to me. It's hard forme to predict whether this is useful generality or over-design.
In the SQL standard, collations can be declared per column. Thataffects index descriptors. In addition, via CASTs, collations can bedeclared per sortable expression in an ORDER BY clause. That affectsthe sorter. I'm not the person scratching this initial itch. I justwant to register my instinct to design-in the generality up front. Ithink this has two advantages:
1) It will remove an upgrade issue later on when someone wants toimplement more of the SQL collation support.
2) It generally lowers the barrier to implementing more of the standard.

Regards,
-Rick
I am just not sure how comfortable I feel forcing an upgrade issue on a
developer for a particular feature that is not their itch. Mamta istrying to solve single collation database problem, not full SQLcollation support.

There's a number of factors that come in, one is the long termmaintainability of the code. I think that trumps any single developer'sitch. The developer can work with the community in coming up with asolution that keeps a good balance between what the community see asmaintainability and scratching their itch.

I'm actually trying to save the contributor (Mamta) work here, I thinkchanging all the locations that generate characters to have the correct"new-character-type" is a huge amount of work and subject to errors(just from the amount of changes and interesting situations). E.g. insome situations a literal will be a CHAR (sorting by ucs_basic) andothers a CHAR (sorting by locale). That decision may not be able to bemade until very late in the bind time, and may not possibly even mattereven thought code would have to pick one. Only caring about this whencollation is involved may make it easier.

Your suggestion may get us more there, not arguing that.  But a solution
shorter along an agreed upon direction seems fine to me, and I would not
hold up a developer contribution that did that. If the community feelsthat
4 new classes is ok, but 4 new types is not the right direction then
it is reasonable to work with the community to get the direction right.
I am waiting on Dan's reply as I think there are SYSTABLES and/orSYSCOLUMNS metadata changes necessary that haven't been discussed.

No changes to SYSTABLES or SYSCOLUMNS are needed for what Mamta isproposing. Support per-schema collation would probably need some change,though strangely enough per-column would probably not.

Possibly changing the way TypeDescriptorImpl writes itself out to diskwould be needed, but there's enough room in the current format to storethe collation information by overloading the on-disk space occupied byscale, since scale is always zero for a character type.


Dan.

Re: Collation implementation WAS Re: Should COLLATION attribute related code go in BasicDatabase?

Reply via email to