Re: [rust-dev] Persistent data structures

Michael Woerister Wed, 04 Dec 2013 01:52:11 -0800

I've implemented a persistent HAMT [1] a while back:
https://github.com/michaelwoerister/rs-persistent-datastructures/blob/master/hamt.rs

It's the data structure used for persistent maps in Clojure (and Scala,I think). It's not too hard to implement and it's pretty nifty. I'm notsure about the performance with atomic reference counting being used formemory management. It will definitely be worse than with astop-the-world garbage collector. Although, it's noteworthy that lookups in the data structure only have to copy one `Arc` for returning theresult, so the high fan-out of the data structure should not hurt if youmostly read from it. I'd be very interested in a performance comparisonto other persistent map implementations in Rust (e.g. a red-black treeor splay tree).


Here are some things I came across during implementing this:

* I too discovered that I couldn't parametrize on the type of referencebeing used without higher kinded types. I did the implementation withregular @ pointers at first and later switched to `Arc`, sinceconcurrent contexts are where persistent data structures really shine.Switching the implementation from @ to Arc was pretty straight forward.* I found there is no standardized trait for persistent maps in libstdor libextra. It would be nice to have one!* It's probably a very good idea to provide a non-persistent "builder"that avoids the excessive copying during the insertion phase. In Clojureone can switch between "transient" and persistent mode for a datastructure instance which also allows for optimized batch modifications.An `insert_from(iterator)` method might also do the trick. There's quitea bit of design space here.* I would have liked to avoid some allocations and pointer chasing byusing fixed size vectors directly within nodes but I could not get thatto work without a lot of unsafe code that I was not sure would becorrect in all cases. So I just used owned vectors in the end.


Looking forward to seeing more in this area :)

-Michael

[1] https://en.wikipedia.org/wiki/Hash_array_mapped_trie

On 04.12.2013 08:28, Isaac Dupree wrote:

I'm interested in having persistent data structures[1] in Rust. Tostart, I implemented the simplest one, the cons/nil list (it lookslike extra::list::List has another implementation of it). Am I usingRust conventions properly?
My persistent list code:
https://github.com/idupree/rust-code/blob/master/persistent.rs
My next goal is a persistent tree-map, probably cribbing fromHaskell's Data.Map.
Is Rc the best smart pointer for persistent data structures? Is itpossible for the structure to be parametrized on smart pointer?
Rc requires the contained data to be Freeze or Send or risk referencecycles. Gc requires T:'static (which means no borrowed pointersbesides &'static ones within the type). Every Send type is 'static,but not every Freeze type is 'static, so neither Rc nor Gc is strictlymore flexible. Arc is Send, unlike either Rc or Gc, but has moreoverhead and can only contain Freeze+Send data; someone wanting toshare persistence between tasks (conceivably for the sake ofmemory-use or asymptotic time) would want it.
Is it possible to implement FromIterator<T> for List<T> without usingO(n) temporary space or "unsafe" code? The problem is that the listcomes out reversed in an obvious implementation. (O(n) stack space viarecursion, or an O(n) intermediate data structure, or unsafelyconstructing a cons cell before constructing its tail.)
[1] https://en.wikipedia.org/wiki/Persistent_data_structure , likeevery data structure in Haskell
-Isaac
_______________________________________________
Rust-dev mailing list
[email protected]
https://mail.mozilla.org/listinfo/rust-dev


_______________________________________________
Rust-dev mailing list
[email protected]
https://mail.mozilla.org/listinfo/rust-dev

Re: [rust-dev] Persistent data structures

Reply via email to