DataStage was originally created by VMARK, using UniVerse 9.4 as its underlying engine and UniVerse Objects for communication between its client and "server" components. It also made use of BCI for communicating with other databases using ODBC protocols, and GCI for communication with C-callable database client software. (It also leveraged VMARK's experience at migrating from one database to another (to UniVerse!), for example in its understanding that the process needs not just data but, even more important, metadata. It (DataStage) also made use of the BASIC programming language in UniVerse to allow users to create their own transformation functions and callable subroutines.
As VMARK merged with UniData to become Ardent, which was then acquired by Informix, DataStage remained a UniVerse application, with DataStage 5.2 being the last such version, being built on UniVerse 9.6. Some of the DW pieces from the UniData side of things became adjoint products (Quality Manger, PRISM Warehouse Executive to name two), while some of that technology was incorporated into DataStage. When Informix, by that time controlled by the ex-Ardent board (and that's an interesting story in itself!), sold their databases to IBM, arrangements were put in place that both companies would begin from the same base, but development would occur separately as the separate products' requirements dictated. In particular, it became necessary that the DataStage Engine and UniVerse could run on the same machine. To achieve this, the "home" directory for DataStage is now recorded in /.dshome on UNIX platforms, rather than /.uvhome, the RPC daemon runs as dsprcd, the deadlock daemon runs as dsdlockd, and so on. Shared memory identifiers use a different key; where UniVerse keys have the form 0xace..., DataStage keys have the form 0xdae... There are a few other changes at this "magic number" level, one implication of which is that BASIC code compiled on the one will not run on the other. No less significant (for ETL performance) are some things that were removed from the DataStage Engine as unnecessary. For example, it does not have a spooler subsystem, it does not have networking software like UV/Net. Some of the configuration parameters have different default values, and there are some configuration parameters in DataStage that simply do not exist in UniVerse. Probably the major change in DataStage is the ability to cache hashed files in memory. One of the most heavy-duty tasks in ETL processing is reference lookups; if the source can be guaranteed to be in memory these can be very fast. Since the split, it has been possible to cache hashed files in memory for writing, too; when populating the hashed file writes to random groups occur into memory then, when they are all done, the file is flushed to disk one group at a time, in group order. Very efficient. Currently (version 7.0 of DataStage), there is still support for the query langugages, paragraphs, even PROCs. If you were to move to a DataStage site, you could find your way around. Although the preferred design approach is to use the GUI and not to code things - this approach is seen as being more easy to maintain, plus there are benefits in ease of documentation. There have been a lot of non-UniVerse things done to DataStage since the split with UniVerse; in particular there is a second engine, called Orchestrate, which Ascential got when it acquired Torrent Systems. This is a mechanism for automatically generating parallel processing in an SMP or MPP environment. In a fairly busy burst of acquisition, using some of the funds from IBM, they also acquired MetaRecon data profiling software from MetaGenix, INTEGRITY data cleansing software from Vality, and Mercator most recently of all. These technologies are integrated into the overall scheme of what Ascential calls "Enterprise Data Integration" in which DataStage is only one pillar. Underpinning it all is MetaStage, a technology for managing metadata in a semantic fashion so that metadata can be shared with many consumers and providers of metadata, such as CASE tools, database management systems, and business intelligence tools. There have been two major releases of DataStage since UniVerse was sold to IBM, versions 6.0 and 7.0. Version 7.1 is due out in a couple of months. It is still recognizably UniVerse under the covers, though there are rumours that a new Engine is under development. We have already seen the beginnings of the likely direction for this in version 7.0, in which there is a component (interlude?) in Orchestrate that allows BASIC code to be executed, by loading the run machine. I suspect (personal opinion/educated guess only) that the new DataStage Engine will not be obviously UniVerse-based, but will continue to be able to do many of the things that UniVerse can do, but not all, and quite a few that UniVerse can't do. In summary I would disagree that DataStage Engine is a stripped down or simplified version of UniVerse. Certainly a couple of unnecessary things have been removed (or not put in, such as triggers on base files). On the other hand, quite a few things have been put in, that aren't in - aren't required in - UniVerse. They're different products and, as development of each progresses, I would expect to see greater divergence in functionality. After all, they do quite different jobs. It is to be hoped that the DataStage engineers (some of whom are ex UniVerse) and the IBM U2 engineers continue to exchange ideas. Ultimately, however, it is not engineers who decide product directions. Recall the Golgafrinchan "B" Ark. ----- Original Message ----- From: [EMAIL PROTECTED] Date: Sat, 31 Jan 2004 21:02:16 EST To: [EMAIL PROTECTED] Subject: Re: [ot] Peoplesoft migrates to Ascential Could anyone elucidate a bit on what exactly was meant by saying that Datastage is a stipped-down, or more simplified, or more purified or whatever (!) version of Universe? Like maybe an example, or some details about how exactly Universe was modded into Datastage? Thanks Will _______________________________________________ u2-users mailing list [EMAIL PROTECTED] http://www.oliver.com/mailman/listinfo/u2-users