I started working on supporting ndarray inside Postgres before I came across MADlib. My motivation was the idea that it should be easier to perform computation local to the data where appropriate.

Currently the only type I've been formally working on is a simple ndarray, though I've seen at least simple uses of a DataFrame work. I'm part-way through adding SQL-level versions of basic ndarray functions. I've got casts between ndarray and relevant Postgres base data types. The storage is currently rather brain-dead (simply storing the version 0 pickle), but the Postgres ndarray type does have it's own version byte, so this could be significantly more sophisticated in the future.

Even if this wouldn't be useful to MADlib, I'd love any feedback on what it could be useful for. I suspect there's a lot that it could be useful for, but I'm more data architect than data scientist.
--
Jim Nasby, Data Architect, Blue Treble Consulting, Austin TX
Experts in Analytics, Data Architecture and PostgreSQL
Data in Trouble? Get it in Treble! http://BlueTreble.com

Reply via email to