Hello all. I've decided that it's time to put a little bit of effort
into deciding where to go next with ActiveWarehouse so here goes.
First, let's look at where we are today:

* The ActiveWarehouse Rails plugin does a decent job of defining
classes that can be used to model a dimensional data warehouse. This
includes classes such as Fact, Dimension and Bridge. These classes, in
conjunction with their migration generators can be used to construct a
basic dimensional model. Additionally we have mix-ins for SCDs,
Hierarchical dimensions and classes for dimensional views (role
playing) and date dimensions.

* The ActiveWarehouse ETL library is fairly full-featured. It has
support for file source data including delimited, fixed-width and XML
as well as inline data produced by a Ruby Enumerable, data from
relational databases and data from Rails model objects. It supports a
variety of built-in transforms as well as custom transforms and inline
block transforms. Finally it supports output to flat files as well as
databases and supports bulk upload through the adapter extensions
library for MySQL, PostgreSQL and SQL Server.

These are the bright spots. The areas which need improvement are:

* ActiveWarehouse Rails plugin reporting tools. Right now AW doesn't
give you much in terms of a reporting and query front end. The
report_helper.rb code is horrendous and is basically all there is for
reporting.

* Aggregation in AW is all over the place right now with none of the
aggregation implementations delivering much in the way of
high-performance aggregation schemes. You can still execute queries
through the NoAggregate implementation however for anything
substantial you're talking about potentially slow queries.

So what's next then:

* Release ActiveWarehouse ETL 1.0. I think it's time to freeze this
thing. It has documentation, a reasonable number of tests and is
essentially usable right now so I think it's time for a release.

* Focus on the AW front end. AW needs to come ready to go with at
least some basic reporting tools on the front.

* Clean out all of the old aggregation techniques and settle on
something specific, perhaps based on aggregation techniques in other
open source data warehouse libraries?

What other items do you think should part of upcoming AW development?
If you have time and are interested in contributing please let me know
as AW can always benefit from more developers with a passion for data
warehousing.

V/r
Anthony Eden

-- 
Cell: 321 473-4966
Current Location: Melbourne, FL
Home Location: Melbourne, FL
_______________________________________________
Activewarehouse-discuss mailing list
Activewarehouse-discuss@rubyforge.org
http://rubyforge.org/mailman/listinfo/activewarehouse-discuss

Reply via email to