Re: between and among: worth Phobosization?

Chris Cain Tue, 17 Dec 2013 17:31:21 -0800

On Tuesday, 17 December 2013 at 22:14:41 UTC, H. S. Teoh wrote:

Ah, I see what you're getting at now. I think this idea has amerit onits own; I'm just not sure if it is useful as an actualintermediate
data *type*.


The use over a function would be

1. Contain all of the complexity that working with intervals on(in this case) integers. It's been shown enough times that thestraight-forward way of dealing with this is error-prone.2. Maintain performance characteristics as much as possible.Without an object, a function doing this sort of thing would haveto revalidate the bounds each time or, worse, NOT validate thebounds at all (with in contracts, we can validate each timebecause release code will take the contracts out, but it's stillpotentially an issue). With an object we can cache any type ofvalidations and/or assertions needed and, potentially, improveperformance in some cases.3. Allow for existing functions to specialize when an interval isgiven, when appropriate.

But, putting that aside, I think the concept does serve itspurpose.It's a pity that the word 'range' already has an assignedmeaning in D,because otherwise that would be the best name in this case(i.e., rangein the mathematical sense of being a contiguous subset of, say,thenumber line). So, for the lack of a better name, let'stentatively callit "Bounds" (as in, the set of elements bounded by upper andlower
bounds), which may be defined, at least conceptually, as:

Just to step up your idea to something a bit closer to complete(still have not thrown it into a compiler or anything yet):

http://www.dpaste.dzfl.pl/19c80ff9

(And I really like the idea of a CtInterval, but haven't doneanything with it so I've excluded it in the above paste)

It'd also be needed for it to have a simple way to get thesmallestacceptable type for the range of values the "between" objectcouldrepresent. So a for a Between!(uint, int) that would be auint, and aBetween!(int, uint) that would be a long, and so on. Obviouslysomethings _don't_ have acceptable types, such as a Between!(long,ulong)(no integral type currently can actually hold all of thosevalues).
There's nothing wrong with Bounds!(long,ulong); it just won'thave anopApply method, that's all. :) It can be convenientlystatic-if'd out inthat case. It can still represent number ranges beyond thecurrent rangeof built-in types, like [long.min, ulong.max], and you can testformembership with various types. This allows you to testvariables ofdifferent types, like ints and uints, so the ability torepresent such a
range is still useful.

Well, I'm not suggesting that the interval not be allowed... butfor things that use that interval, they may produce some sort ofoutput. If they're using the interval to output, then they'llneed to know what data type the output needs to be. It'd beconvenient if some standard function existed to accomplish thattask in a standard way.

The example I'm using for this is if std.random.uniform took inan interval, what would its output be? Obviously, it couldn'toutput something in [long.min, ulong.max], but it's possible itcould spit out an answer in [byte.min, ubyte.max] since a shortcould contain all of those values.

Something like this, like I showed, could be used to pass tootherfunctions like std.random.uniform which request a range togenerate.Or you should be able to pass it to something likestd.algorithm.find,
std.algorithm.count, etc (predicates that take one parameter).
While you *could* implement the input range API for the Boundsstructfor this purpose, it's probably better to define specialoverloads forfind and count, since you really don't want to waste timeiterating overelements instead of just directly computing the narrowed Boundsstruct
or subtracting the lower bound from the upper, respectively. For
example:

Sorry, confusion based on using the word "range" again. When Isaid range, I meant bounds/interval in this case. Functions thatrequest some sort of interval or bounds should use intervalinstead of trying to do anything on its own (since the "do yourown thing" is increasingly being found to be errorprone).


So, something like this should work:

    unittest
    {
        import std.algorithm;
        assert(
            find!"a in b"([5, 6, 2, 9], interval(1, 4))
                == [2, 9]);
        // uses std.algorithm.find

        assert(
            count!"a in b"([5, 6, 1, 3, 9, 7, 2], interval(1,3))
                == 3);
        // uses std.algorithm.count

        import std.random;
        foreach(_; 0..10000)
            assert(uniform(interval(1,5)) in interval(1,5));
        // Nice assertion, right?
    }

It might also be useful in some circumstances to be able to knowhow many values are in the interval (sort of like a "length" or"size") but if you have an interval of [long.min, ulong.max] ...well, you know the problem.

Considering what Andrei said, we might could expand this conceptto support the interval arithmetic. We'd also need to be able tosupport intervals like (-oo, oo), (-oo, x], [x, oo) ... where themembership test returns true, <=x, and >=x respectively (whiletaking care of the issues that exist with signed/unsignedcomparisons, obviously). That said, not all functions will wantto handle those types of intervals (std.random.uniform, forinstance).

Re: between and among: worth Phobosization?

Reply via email to