Re: GSoC-2011 project:: Containers

Steven Schveighoffer Mon, 28 Mar 2011 04:57:42 -0700

On Fri, 25 Mar 2011 18:37:26 -0400, Jonathan M Davis <[email protected]>wrote:

On 2011-03-25 05:41, spir wrote:
About D collections: aside std.container in Phobos, Steven Schweighoffer
has a fairly advanced project called dcollections:
http://www.dsource.org/projects/dcollections. As I understand it, it isabit of a concurrent for std.container, but there seems to be apossibility
for them to converge in the future. In any case, you should definitely
study it, if only to take inspiration and avoid double work.
dcollections is Steven Schweighoffer's project which has existed sinceD1.std.container is the container module for Phobos. So, they aren't,strictlyspeaking related. When designing std.container and planning out howcontainersshould be done in Phobos, Andrei took a different approach than Stevedid. So,nothing can be taken from dcollections and simply plopped intostd.container.However, dcollections 2.0 does use the Boost license, so the code fromthere
can be refactored to work in std.container. Steve already did that with
RedBlackTree. He ported std.RedBlackTree from whatever his red-black tree
implementation is in dcollections. So, if it makes sense, code can betakenfrom dcollections and ported to Phobos (and Steve would obviously be agoodguy to talk to about that). However, anyone doing that needs to be awareofthe differences in how dcollections works vs how std.container works(e.g.
dcollections has cursors whereas std.container uses ranges exclusively).

Any of the private implementations inside dcollections are usable instd.container, because they do not expose any public interface. In fact,dcollections' types can be separated into two categories -- interfaces andimplementations. The implementations have a very raw, unsafe, simple API(no range or cursor support there). The interface types (which BTW areimplemented via final classes) expose the common public interface that alldcollections classes have, including ranges and cursors.

It is this separation which allowed me to port red black tree tostd.container by changing nothing in the red black node implementation.If you compare RBNode in std.container and RBNode in dcollections, you'llfind them virtually identical (little cleanup here and there). In fact, Iplan to have dcollections' RBTree use std.container's RBNode to avoid codeduplication.

Unfortunately, red black tree is the most complex part of dcollections, sothere is not much else to gain by porting to std.container. I think Dequewould be a good one, even though it's implementation is not separate (theimplementation is based on builtin arrays), so the port would be moreinvolved. You could also take the Link implementation (dual-linked list),but that is simple enough to write from scratch ;) The Hash is extremelynaive and basic, so I'm not sure it's worth copying. I'm not analgorithms expert.

Aside from porting dcollections/implementing equivalent types, I thinkthere are some things that would be good to have in phobos:

* Conceptual types that use the implementations, such as a map type.These *should* be implementation agnostic as long as you use templateconstraints to identify the appropriate functions required. Doing thisshould test the completeness of the functions that the containers define.* Custom allocation. This has increased dcollections' performancesignificantly.

If you have any questions, do not hesitate to email me at this address. Iwould be a mentor for this, but 1) I don't have much time (not sure whatis required) and 2) I have a severe difference of opinion from Andrei onwhat is good in collections, I don't want to guide someone to designs/codethat won't be accepted.


-Steve

Re: GSoC-2011 project:: Containers

Reply via email to