Re: [sqlite] BETWEEN and explicit collation assignment

Igor Tandetnik Thu, 22 Aug 2013 10:17:55 -0700

On 8/22/2013 11:49 AM, Simon Slavin wrote:


On 22 Aug 2013, at 2:36pm, Igor Tandetnik <[email protected]> wrote:

On 8/22/2013 8:52 AM, Simon Slavin wrote:

Nevertheless do you understand the point I'm trying to make -- that collations 
are a modifier for comparisons not individual values ?


I do understand your point. I still don't understand how it's supposed to apply 
in practice to a situation like

create table t(x text collate nocase);
select * from t where x = 'a';

Here, at the time I specify "collate nocase", there is no comparison it could 
apply to. So what is it a property of, in your thinking?


COLLATE is a property of that column

So sometimes it's a property of a column, and other times it's aproperty of a comparison operator? I predict you are going to have ahard time describing this notion in a formal spec.

Your second line doesn't specify a COLLATE operator in its expression, so I 
have no problem with it.

But again, by what formal mechanism does a property of the column affectthe behavior of the operator? And if you are OK with it doing that, thenwhat's wrong with the existing model, which is based on this veryapproach (collation is a property of an expression; behavior of anoperator depends on collations associated with its operands)?

Let's put it this way: why should there be a fundamental differencebetween an expression (x) where x is a column declared with COLLATENOCASE clause, and an expression ('a' COLLATE NOCASE)? What purposewould such a distinction serve? Why exactly are you OK with the former,but not the latter?

By what mechanism does it end up applying to x='a' comparison (I assume to do 
want the statement to return rows both with 'a' and 'A' in column x)? How would 
you modify the formal spec at http://sqlite.org/datatype3.html to lead to your 
desired outcome? Precise wording matters.


I don't understand why binary comparison operators are on that page at all.  
They aren't used directly as column definitions, only as parts of expressions, 
and expressions are defined on another page.  If you remove mention of 
comparison operators from that page, the rest of that page is fine.

Precise wording as you requested ?  Remove all of section 6.1 apart from the 
last paragraph.

But again, I assume you do want the expression (x='a') to sometimesevaluate to true when column x contains the value 'A', and other timesevaluate to false. How would this happen? It doesn't matter which pagedescribes the behavior: if it's nothttp://www.sqlite.org/datatype3.html, then I imagine it would besomewhere on http://www.sqlite.org/lang_expr.html. What should it say?

As I said, I could relate to your point of view better if you just did this: 
inhttp://sqlite.org/datatype3.html section 6.1, replaced two occurrences of "with precedence 
to the left operand" with "It's an error if two operands have different collations". 
This keeps the existing, well defined mechanisms intact, while neatly excluding the case you seem 
to find most objectionable.


As I wrote originally, my problem is not with the use of COLLATE in column 
definitions, it's with its use in expressions.  So my problem in documentation 
of SQLite doesn't come in datatype3, it comes in

<http://www.sqlite.org/lang_expr.html>

Step 1 would be remove all ability to specify collation applying to a single 
value.

But it already applies, implicitly, to a single value that happens to bea column name. It seems you want to preserve that, right?

 This is just two short paragraphs on that page (search for the word 'collate').

One of those paragraphs says: "See the detailed discussion on collatingsequences in the Datatype In SQLite3 document for additionalinformation." You claim that you want to excise that very discussionfrom "datatypes" article - doesn't that mean that you would have to moveit here, rather than incorporating it by reference? The behavior must bedescribed *somewhere*.

This would remove all trace of the use of COLLATE I have a problem with, i.e. 
where it can be used in such a way as to be applied to a single value and not a 
comparison.

However, a collation would still apply to a single value that happens tobe a column name, and comparison operators where such a value is anoperand would have to take that into account somehow, wouldn't they? Youare not saying that (x='a') should always use BINARY collationregardless of how column x was declared, are you? And if you define somemechanism by which the comparison takes the collation of x into account,then I don't understand why it's fundamentally awful and wrong to extendthat same mechanism to the expression (('A' collate nocase) = 'a').

You instead suggest the invention of a separate mechanism whereby thecollation is assigned directly to the operator rather than inferred fromits operands; but you also need to keep the original mechanism around.So now you have to specify two mechanisms, plus the interaction betweenthem. See http://en.wikipedia.org/wiki/Occam's_razor : "entities mustnot be multiplied beyond necessity."

--
Igor Tandetnik

_______________________________________________
sqlite-users mailing list
[email protected]
http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users

Re: [sqlite] BETWEEN and explicit collation assignment

Reply via email to