| advertise add site services publishers database health videos | ![]() | about toolbar stats live show health store more stuff JOIN/LOGIN |
Ab Initio International brazelton-institute.com |
Ab Initio Software Corporation was founded in the mid 1990's by the former CEO of Thinking Machines Corporation, Sheryl Handler, and several other former employees after the bankruptcy of that company. The Ab Initio software is a fourth generation powerful data analysis, batch processing, data manipulation graphical user interface (GUI)-based parallel processing tool which is commonly used to extract, transform and load (ETL) data. The Ab Initio software is a suite of products which together provide a platform for data processing applications. The Core Ab Initio products are:
[edit] The Co>Operating SystemRuns across a variety of Operating Systems and Hardware Platforms including OS/390, zOS on Mainframe, Unix, Linux, and Windows. Supports distributed and parallel execution. Can provide scalability proportional to the hardware resources provided. Supports platform independent data transport using the Ab Initio data manipulation language. The Co>Operating System is the underlying system of all parts of the product suite allowing communication and integration of all parts into the platform. It will also help the native operating system to understand the Ab Initio commands. This tool comes with a very high performance data processing capability with its parallel data manipulation technique. Ab Initio implements parallelism in mainly 3 ways: Data parallelism - data is divided among many partitions known as multi-files. During processing, each partition is processed in parallel. Component parallelism - multiple components are run in parallel. Components execute simultaneously on different branches of a graph. Pipeline parallelism - when a record is processed in one component and a previous record is being processed in another components. Operations like sorting and aggregation break pipeline parallelism. [edit] The Component LibraryReusable software Modules for Sorting, Joining, Data Transformation, Database Loading, etc. The components adapt at runtime to the record formats and business rules controlling their behavior. Components also include various system "connectors" giving access to various storage engines. [edit] The Graphical Development EnvironmentProvides graphical interface for editing and executing Ab Initio computer programs. This development environment utilizes the available components from the library to enable various ETL activities to occur. The Co>Operating System can execute these programs directly. Allows for monitoring of running applications to quantify data volumes and execution times for performance estimation. An Ab Initio computer program is called a graph as it behaves similar to its math counterpart. A graph contains one or more components or vertices, each joined by a flow or edge through which data flows. Data flows only in one direction, which allows the graph to run in a parallel processing environment. Each graph is compiled by the GDE into a Korn Shell script (or batch file in Windows environment) which can be run by the Co>Operating System. It is also possible to write Ab Initio programs without the GDE using a common text editor (but that is extremely cumbersome so rarely done in practice). [edit] Enterprise Meta>EnvironmentAlong with the Development environment and Co>Operating system is a datastore which tracks changes in development of graphs, as well as metadata pertaining to the development, how data is used, and potential of other means of data classification. The storage of graph related Metadata allows for data impact analysis to occur, giving the user a visual sense of how the data is changing in a graph, and the impacts those changes have on another graph. Additionally the EME doubles up for config/change management allowing the latest version of a graph to reside past subsequent code changes thereby ensuring the latest code and data. It performs the following operations:
[edit] Data ProfilerThe Data Profiler is a graphical data analysis tool which runs on top of the Co>Operating system. It can be used to characterize data range, scope, distribution, variance, and quality. [edit] Conduct>ItAb Initio Conduct>It (formerly Plan>It) helps developer to create and manage large-scale data processing systems. Conduct>It provides a framework to create a complete production-ready system consisting of Ab Initio graphs, custom scripts, and third-party programs. [edit] Major CompetitorsThe Gartner Magic Quadrant for Data Integration Tools 2009 lists IBM, Informatica, Oracle Corporation, and SAP Business Objects as the leading vendors followed by Microsoft as challenger. iWay Software, SAS Institute, Sybase, Pervasive Software and Talend are listed as visionaries. The 2009 report also includes newcomers such as expressor , CloverETL and Pentaho software. Gartner considers that Ab Initio doesn't meet its analysis criteria. Ab Initio has had a particularly poor run over the latest Gartner Magic Quadrants, dropping from Visionary (2005) to Niche (2006) before falling off the grid entirely (2007). [1] Ab Initio has also been criticised for their extreme secrecy about their products. Anyone working with their product (even who work for organizations who use Ab Initio) has to sign a non-disclosure agreement which prevents them from revealing Ab Initio technical information to the public. [edit] See also[edit] References[edit] External links |
| ↑ top of page ↑ | about thumbshots |