Re: std.linalg

FreeSlave Fri, 11 Oct 2013 12:50:51 -0700

On Friday, 11 October 2013 at 17:49:32 UTC, H. S. Teoh wrote:

On Fri, Oct 11, 2013 at 06:10:19PM +0200, FreeSlave wrote:
There is "Matrices and linear algebra" module in wish list.Let'sdiscuss its design. D is complicated language so it'sdifficult to
choose the right way here. We need to find compromise between
efficiency and convenient interface. I'm going to make some
suggestions how this module should look like.
I think we need to differentiate between multidimensionalarrays (as adata storage type) and linear algebra (operations performed on2Darrays). These two intersect, but they also have areas that arenotcompatible with each other (e.g. matrix product vs.element-by-element
product). Ideally, we should support both in a clean way.

As far as the former is concerned, Denis has implemented a
multidimensional array library, and I've independently done thesame,with a slightly different interface. I think one or two othershave
implemented similar libraries as well. It would be good if we
standardize the API so that our code can become interoperable.


Can you please give links to both libraries?

As far as the latter is concerned, I've been meaning toimplement adouble-description convex hull algorithm, but have been toobusy toactually work on it. This particular algorithm is interesting,becauseit stress-tests (1) performance of D algorithms, and (2)challenges the
design of matrix APIs because while the input vertices (resp.
hyperplanes) can be interpreted as a matrix, the algorithmitself alsoneeds to permute rows, which means it is most efficient whengiven anarray-of-pointers representation, contrary to the usualflattenedrepresentations (as proposed below). I think there's a placefor both,which is why we need to distinguish between data representationand the
algorithms that work on them.
First of all, it should provide two templates for matrices.Let's callthem StaticMatrix and DynamicMatrix. The first one has"templated"size and therefore may use static arrays and compile-timechecks. Itcan be useful when the size is determined by our needs, forexample,
in graphics. DynamicMatrix has variable size, i.e. it should be
created in heap. It can be useful in all other math areas.
I like this idea. Ideally, we should have many possiblerepresentations,but all conforming to a single API understood by allalgorithms, so thatyou only have to write algorithms once, and they will work withany datastructure. That's one key advantage of D, and we should makegood use of
it.

The problem is that algorithms still should know matrix templateto provide compile-time checks if possible or throw exceptions atruntime if something gone wrong.

I do not want to see a repetition of the C++ situation wherethere areso many different matrix/multidimensional array libraries, andall ofthem use incompatible representations and you cannot freelypass data
from one to algorithms in the other. Then when no library meets
precisely what you need, you're forced to reinvent yet anothermatrix
class, which is a waste of time.
Both templates should support all floating point types andmoreover
user-defined (for example wrappers for GMP library and others).
Definitely, yes.
For efficiency in both cases matrices should useone-dimensional
array for inner representation. But actually I'm not sure if
matrices should support other container types besides standardDarrays. The good thing about one-dimensional arrays is thatthey canbe easily exposed to foreign functions, for example, to Clibrariesand OpenGL. So we should take care about memory layout - atleast
row-major and column-major. I think it can be templated too.
We should not tie algorithms to specific data representations(concrete
types). One key advantage of D is that you can write algorithms
generically, such that they can work with *any* type as long asitconforms to a standard API. One excellent example is the rangeAPI:
*anything* that conforms to the range API can be used with
std.algorithm, not just a specific representation. In fact,std.rangeprovides a whole bunch of different ranges and range wrappers,and allof them automatically can be used with std.algorithm, becausethe codein std.algorithm uses only the range API and never (at least intheory:P) depends on concrete types. We should take advantage of thisfeature.
It would be good, of course, to provide some standard,commonly-usedrepresentations, for example row-major (or column-major) matrixclasses/ structs, etc.. But the algorithms should not directly dependon theseconcrete types. An algorithm that works with a matrix stored asa 1Darray should also work with a matrix stored as a nested arrayof arrays,as well as a sparse matrix representation that uses some otherkind ofstorage mechanism. As long as a type conforms to some standardmatrix
API, it should Just Work(tm) with any std.linalg algorithm.
But another question arises - which "majority" should we use in
interface? Interface should not depend on innerrepresentation. Allfunctions need unambiguity to avoid complication andrepetition of
design. Well, actually we can deal with different majority in
interface - we can provide something like "asTransposed"adapter, thatwill be applied by functions if needed, but then we will forceuser tocheck majority of matrix interface, it's not very goodapproach.
Algorithms shouldn't even care what majority the datarepresentation isin. It should only access data via the standardized matrix API(whateverit is we decide on). The input type should be templated so that*any*
type that conforms to this API will work.
Of course, for performance-sensitive code, the user should beaware ofwhich representations are best-performing, and make sure topass in theappropriate type of representations; but we should notprematurelyoptimize here. Any linear algebra algorithms should be able towork with
*any* type that conforms to a standard matrix API.

I'm not sure if you understand idea of differences between innerimplementation majority and interface majority. I agree thatinner majority should be defined by inner type. Interfacemajority is just choice between


matrix[rowIndex, columnIndex]

and

matrix[columnIndex, rowIndex]

In case of interface majority we just must choose the appropriateone and use it all over the library. It does not relate toperformance.

Sometimes user takes data from some other source and wants toavoidcopying in Matrix construction, but she also wants to getmatrixfunctionality. So we should provide "arrayAsMatrix" adapter,thatcan adopt one-dimensional and two-dimensional arrays makingthemfeel like matrices. It definitely should not make copy ofdynamic
array, but I'm not sure about static.
If a function expects a 1xN matrix, we should be able to passin anarray and it should Just Work. Manually using adapters shouldnot beneeded. Of course, standard concrete matrix types provided bythelibrary should have ctors / factory methods for initializing amatrixobject that uses some input array as initial data -- if wedesign thiscorrectly, it should be a cheap operation (the matrix typeitself shouldjust be a thin wrapper over the array to provide methods thatconform tothe standard matrix API). Then if some function F requires amatrixobject, we should be able to just create a Matrix instance withour
input array as initial data, and pass it to F.
About operation overloading. It's quite clear about 'add' and
'subtruct' operations, but what's about product? Here I thinkall'op'-functions should be 'element by element' operations. Sowe canuse all other operations too without ambiguity. For actualmatrixmultiplication it can provide 'multiply' or 'product'function. It'ssimilar to Maxima approach, besides Maxima uses dot notationfor these
needs.
Here is where we see the advantage of separating representationfromalgorithm. Technically, a matrix is not the same thing as a 2Darray,because a matrix has a specific interpretation in linearalgebra,
whereas a 2D array is just a 2D container of some elements. My
suggestion would be to write a Matrix struct that wraps arounda 2Darray, and provides / overrides the overloaded operators tohave a
linear algebra interpretation.
So, a 2D array type should have per-element operations, butonce wrappedin a Matrix struct, it will acquire special matrix algebraoperationslike matrix products, inversion, etc.. In the most generalcase, a 2Darray should be a specific instance of a multidimensionalarray, and aMatrix struct should be able to use any underlyingrepresentation that
conforms to a 2D array API. For example:

        // Example of a generic multidimensional array type
        struct Array(int dimension, ElemType) {
                ...
                Array opBinary(string op)(Array x)
                {
                        // implement per-element operations here
                }
        }

        // A matrix wrapper around a 2D array type.
        struct Matrix(T)
                if (is2DArray!T)
        {
                T representation;
                Matrix opBinary(string op)(Matrix x)
                        if (op == "*")
                {
                        // implement matrix multiplication here
                }

                Matrix opBinary(string op)(Matrix x)
                        if (op != "*")
                {
                        // forward to representation.opBinary to default
                        // to per-element operations
                }

                // Provide operations specific to matrices that don't
                // exist in general multidimensional arrays.
                Matrix invert() {
                        ...
                }
        }

        Array!(2,float) myArray, myOtherArray;
auto arrayProd = myArray * myOtherArray; // per-elementmultiplication
        auto A = Matrix(myArray);       // wrap array in Matrix wrapper
        auto B = Matrix(myOtherArray);
        auto C = A * B;                 // matrix product
The idea of the Matrix struct here is that the user should befree tochoose any underlying matrix representation: a 1D array inrow-major orcolumn-major representation, or a nested array of arrays, or asparsearray with some other kind of representation. As long as theyprovide astandard way of accessing array elements, Matrix should be ableto
accept them, and provide matrix algebra semantics for them.
Transposition. I've already mentioned "asTransposed" adapter.Itshould be useful to make matrix feel like transposed withoutits
copying. We also can implement 'transpose' and 'transposed'
functions. The first one transposes matrix in place. It'sactuallynot allowed for non-square StaticMatrix since we can't changethesize of this type of matrices at runtime. The second onereturnscopy so it's applicable in all cases. Actually I'm not sureshould
these functions be member functions or not.
The most generic approach to transposition is simply areordering ofindices. This difference is important once you get to 3D arraysandbeyond, because then there is no unique transpose, but anypermutationof array indices should be permissible. Denis' multidimensionalarrayshave a method that does O(1) reordering of array indices:basically, youcreate a "view" of the original array that has its indicesswappedaround. So there is no data copying; it's just a different"view" into
the same underlying data.
This approach of using "views" rather than copying data allowsfor O(1)submatrix extraction: if you have a 50x50 matrix, then you cantakearbitrary 10x10 submatrices of it without needing to copy anyof thedata, which would be very expensive. Avoiding unnecessarycopyingbecomes very important when the dimension of the arrayincreases; if youhave a 3D or 5D array, copying subarrays become extremelyexpensive very
quickly.
A .dup method should be provided in the cases where youactually *want*
to copy the data, of course.
Basically, subarrays / transpositions / index reordering shouldberegarded as generalizations of D's array slices. No data shouldbe
copied until necessary.
Invertible matrix. It must not be allowed for squareStaticMatrix.
You mean for non-square StaticMatrix?


Yes, non-square. My bad.

Well, ok. We want to abstract from inner representation toprovide freedom for users. We fall in metaprogramming and genericprogramming here, so we need to define concepts just likeBoost/STL/std.range do. The good thing is that in D types withdifferent interfaces and syntax constraints can satisfy sameconcept that would be impossible or very difficult in C++. Thanksto static if and is(typeof()). For example inner representationtype can provide [][] operator or [,] operator and Matrix typewill understand both cases.


Suppose:

template canBeMatrixRepresentation(T)
{
    enum bool canBeMatrixRepresentation = is(typeof(
        {
            T t; //default constructable
            const(T) ct;
            alias ElementType!T E; //has ElementType
            E e; //element type is default constructable
            static if (/*has [,] operator*/)
            {
                t[0,0] = e; //can be assigned

e = ct[0,0]; //can retrive element value fromconst(T)

            }
            else static if (/*has [][] operator*/)
            {
                t[0][0] = e; //can be assigned

e = ct[0][0]; //can retrive element value fromconst(T)

            }
            else
            {
                static assert(false);
            }

            size_t rows = ct.rowNum; //has row number
            size_t cols = ct.columnNum; //has column number

            t.rowNum = size_t.init;
            t.columnNum = size_t.init;
        }));
}

We see that two-dimensional D array does not satisfy this conceptbecause it has no rowNum and columnNum so it should be handledseparately. This concept is not ideal since not all types mayprovide variable rowNum and columnNum. Also concept should exposeinformation whether is "static" type or not, so algorithms willknow can they use compile-time checks or not.Types also can provide copy constructor. If they do then Matrixwill use it, if they don't then Matrix will do element-by-elementcopy. It also can try .dup property.

It's just example how it should work, but I hope the point isclear. We also need Matrix concept (or separate concepts forStaticMatrix and DynamicMatrix).

Re: std.linalg

Reply via email to