"Outside of a dog, a book is man's best friend. Inside of a dog it's too dark to read."

Groucho Marx

“Knowledge has become the key economic resource and the dominant, if not the only, source of competitive advantage.”

Peter F. Drucker

"Any fool can make things bigger, more complex, and more violent. It takes a touch of genius - and a lot of courage - to move in the opposite direction."

Albert Einstein

“We should not only use the brains we have, but all that we can borrow.”

Woodrow Wilson

“If you don’t have a competitive advantage, don’t compete!”

Jack Welsh, CEO, General Electric

“Above all else show the data”

Edward Tufte in “The Visual Display of Quantitative Information”

"Nobody ever washes a rental car. Without established data ownership, don't expect clean data."


IBM Pure Data Systems for Analytics

IBM Pure Data System for Analytics

(formerly IBM Netezza Data Warehouse Appliances)

The Simple Data Warehouse Appliance for Serious Analytics

The PureData System for Analytics, powered by Netezza technology, is a simple data appliance for serious analytics. It simplifies and optimises performance of data services for analytic applications, enabling very complex algorithms to run in minutes not hours, delivering:

IBM PureData System for Analytics is a high-performance, scalable and massively parallel system that enables organisations to perform analytics on enormous data volumes. The system meets business intelligence and advanced analytics requirements that were previously impossible or impractical to achieve.

IBM PureData System for Analytics is a purpose-built, standards-based data warehouse and analytic appliance that architecturally integrates database, server, storage and advanced analytic capabilities into a single, easy-to-manage system. It is designed for rapid and deep analysis of data volumes that scale into petabytes.

Why use IBM PureData System for Analytics?


The PureData System offers orders of magnitude performance advantage over alternative analytic options. This advantage comes from its unique asymmetric massively parallel processing (AMPP™) architecture that combines open, IBM blade servers and disk storage with IBM’s patented data filtering using field programmable gate arrays (FPGAs). This combination delivers impressive query performance on analytic workloads that support tens of thousands of BI users; as well as sophisticated analytics at the speed of thought.


With the PureData System, organisations can deploy the appropriately sized environments for their data volumes and workloads. In addition, they are reassured that while their data volumes grow, additional IBM PureData Systems can be deployed quickly and easily.

The PureData System for Analytics starts with a raw data capacity of 8 TB for a quarter-rack system, and can grow to well over 700 TB. With built-in hardware compression, the usable capacity exceeds 1.2 petabytes for the largest systems.


PureData System for Analytics dramatically simplifies analytics by consolidating all analytic activity to the point where the data resides. Users can acquire their predicted scores in near real-time which helps to operationalise advanced analytics and make it available throughout the enterprise.

The built-in analytical infrastructure and extensive library of statistical and mathematical functions supports a breadth of analytic tools and programming languages. It is delivered with a library of more than 200 prebuilt, scalable in-database analytic functions that execute analytics in parallel, while removing the complexity of parallel programming from the developers, users, and DBAs.

Simplicity & Ease of Administraion

The entire lifecycle of the PureData for Analytics has been simplified; from how the system is procured and deployed, to how it’s managed and maintained. This results in a low cost of ownership and minimal maintenance effort. The system is up and running in hours, requires minimal up front design and tuning, and minimal ongoing administration. It provides standard interfaces to best of breed analytics, business intelligence and data integration tools; as well as easy connectivity to other big data platform components.

The Netezza technology also eliminates the need for complex database management tasks such as defining and optimizing indexes and manually administering storage. Our new enhancements provide increased system resilience with more spare drives and improved workload management tools. This means you can do more with the same number of people and lessen IT administrator’s tasks.

IBM PureData System for Hadoop


IBM® PureData™ System for Hadoop is the newest member of the IBM PureSystem family. It delivers a smarter way to reduce the complexity, accelerate time to value and improve IT economics. It is a purpose-built, standards-based, expert integrated system that architecturally integrates IBM InfoSphere BigInsights Hadoop-based software, server, and storage into a single, easy-to-manage system.

IBM PureData System for Hadoop is built to optimize Hadoop data services for big data analytics and online archive with appliance simplicity. It delivers enterprise Hadoop capabilities with easy-to-use analytic tools and visualization for business analysts and data scientists. It comes with rich developer tools, powerful analytic functions, and exceptional administration and management capabilities, as well as the latest versions of Hadoop and associated projects. In addition, IBM PureData System for Hadoop provides extensive capabilities with enhanced big data tools for monitoring, development, and integration with many more enterprise systems.

IBM PureData System for Hadoop offers simplicity, flexibility, and consumability in a single integrated system.

Built in Expertise
• Speed to insight with built-in social data, machine data and text analytics accelerators
• Speed to value with accelerated deployment

Simplified Experience
• No assembly required, data load ready in hours
• Single system console for full system administration
• Rapid maintenance updates with automation

Integration by Design
• Hadoop system with built-in archiving tools
• Delivered with more robust security than open source software
• Architected for high availability

What is Hadoop?

About Hadoop®
Apache™ Hadoop® is an open source software project that enables the distributed processing of large data sets across clusters of commodity servers. It is designed to scale up from a single server to thousands of machines, with a very high degree of fault tolerance. Rather than relying on high-end hardware, the resiliency of these clusters comes from the software’s ability to detect and handle failures at the application layer.

So what’s the big deal?
Hadoop changes the economics and the dynamics of large scale computing. Its impact can be boiled down to four salient characteristics.

Hadoop enables a computing solution that is:
Scalable – New nodes can be added as needed, and added without needing to change data formats, how data is loaded, how jobs are written, or the applications on top.
Cost effective – Hadoop brings massively parallel computing to commodity servers. The result is a sizeable decrease in the cost per terabyte of storage, which in turn makes it affordable to model all your data.
Flexible – Hadoop is schema-less, and can absorb any type of data, structured or not, from any number of sources. Data from multiple sources can be joined and aggregated in arbitrary ways enabling deeper analyses than any one system can provide.
Fault tolerant – When you lose a node, the system redirects work to another location of the data and continues processing without missing a beat.

Think Hadoop is right for you?
Eighty percent of the world’s data is unstructured, and most businesses don’t even attempt to use this data to their advantage. Imagine if you could afford to keep all the data generated by your business? Imagine if you had a way to analyze that data?
IBM InfoSphere BigInsights brings the power of Hadoop to the enterprise. With built-in analytics, extensive integration capabilities and the reliability, security and support that you require, IBM can help put your big data to work for you.
InfoSphere BigInsights Quick Start Edition, the latest edition to the InfoSphere BigInsights family, is a free, downloadable, non-production version.
With InfoSphere BigInsights Quick Start, you get access to hands-on learning through a set of tutorials designed to guide you through your Hadoop experience. Plus, there is no data capacity or time limitation, so you can experiment with large data sets and explore different use cases, on your own timeframe.

Take a Test Drive

4 out of 5 companies that try the PureData System, buy the PureData System

The Test Drive makes it easy to sample the simplicity and high performance of the appliance on your site and with your production data.

Take the Test Drive in 3 easy steps:

  1. We ship an appliance to you for trial.
  2. The PureData System is up and running in 24 hours (really).
  3. 80% of trial customers keep the System for good.

Contact us for more information

The Netezza Technology powering the PureData System

IBM PureData System for Analytics adheres to IBM’s basic principle of moving processing to the data, not moving the data to the processing. Each IBM PureData System for Analytics contains multiple Snippet Blades or S-Blades, where SQL query code segments (or “snippets”) and complex analytic processes are executed. The S-Blades are intelligent processing nodes that make up the massively parallel processing engine of the appliance. Each S-Blade is an independent server that contains powerful multi-core Intel CPUs, IBM’s unique multi-engine FPGAs and gigabytes of RAM as well as dedicated storage devices.

IBM PureData System for Analytics is architected for high availability from the ground up. All components are internally redundant, and the failure of a processing node (S-Blade) causes no significant performance degradation for a robust, production-ready environment.

Included with every system, Netezza Performance Portal provides a web-based GUI that helps administrators to monitor and manage hardware, administer database objects, configure workload management, view active sessions and monitor system resource utilization for capacity planning. The portal provides a consolidated administrative interface supporting one to many PureData Systems for Analytics from one, easy-to-use access point.

IBM have introduced a mini appliance specifically designed for small to mid-sized organisations.

Please contact us to take a test drive ➤