fbpx

IBM PureData System for Analytics

IBM PureData System for Analytics, based on Netezza technology, is a simple, dedicated device capable of carrying out serious analytical tasks. Simplifies and optimizes data handling for analytics applications, allowing you to perform highly complex algorithms in minutes rather than hours.

IBM PureData System for Analytics is designed specifically for running complex analytics on very large data volumes with faster execution times than competing solutions. It delivers the proven performance: scalability, intelligence, and simplicity that organizations need to leverage their data.

IBM

Faster

IBM PureData System for Analytics N3001 delivers a performance advantage over other analytic options. This comes from its unique asymmetric massively parallel processing (AMPP)™ architecture that combines open IBM blade servers and disk storage with IBM’s patented, hardware-accelerated data filtering, using field programmable gate arrays (FPGAs). This combination delivers fast query performance on analytic workloads supporting thousands of business intelligence and data warehouse users, providing sophisticated analytics for satisfying business requirements.

Smart

IBM PureData System for Analytics dramatically simplifies analytics by consolidating all analytic activity to one place, where the data resides. Moving analytics to the IBM PureData System is straightforward with IBM’s embedded analytic platform. With support for PMML 4.0 models, data modelers and quantitative teams can operate on the data directly inside the appliance instead of having to off load massive data volumes to a separate infrastructure, and then have to deal with the associated data preprocessing, transformation, and movement.

Data scientists can build their models using all the enterprise data, and then iterate through different models much faster to arrive at the best solution. Once the model is developed, it can be seamlessly executed against the relevant data in the appliance. Prediction and scoring can be done where the data resides. Users can get their predictive scores in near real-time, helping operationalize advanced analytics and making it available throughout the enterprise. Included with every PureData System for Analytics system is IBM Netezza Analytics software.

IBM Netezza Analytics offers a built-in analytical infrastructure and extensive library of statistical and mathematical functions, supporting a breadth of analytic tools and programming languages, including Open Source R. It is delivered with a library of more than 200 prebuilt, scalable, in-database analytic functions that execute analytics in parallel while abstracting away the complexity of parallel programming from the developers, users and DBAs.

The Netezza Analytics functionality also includes in-database geospatial analytics that are compatible with the industrystandard ESRI GIS formats. This enables easy integration with existing geospatial analytic environments. In addition, if models are developed using SPSS Modeler or SAS, IBM Netezza Analytics will accelerate the development and scoring of these models. The IBM PureData System for Analytics N3001 brings advanced security to your data in this insecure world. Building on the appliance simplicity model, all data is stored on selfencrypting disk (SED) drives, providing security while not impacting performance. The protection provided by the SED implementation supports the leading industries in security compliance—health care, government, and the financial sectors. This system utilizes strong authentication that prevents threats due to unauthorized access, based on the industry-standard Kerberos protocol.

Simple and completely integrated

The IBM PureData System for Analytics N3001 also offers a great value bundle as complementary software licenses to use in conjunction with the appliance. Data movement, reporting, analytic tools, and Hadoop licenses make for a full service offering.

Included software entitlements:

  • IBM Cognos® Business Intelligence—five Analytics User licenses, one Analytics Administrator license.
  • IBM DataStage (280 PVUs)—2 concurrent Designer Client licenses and IBM InfoSphere Data Click (with PureData System for Analytics as a source or target).
  • IBM BigInsights for Apache Hadoop, software licenses to manage around 100 TB of Hadoop data.
  • Two non-production user licenses for the IBM InfoSphere Streams Developer Edition.

All of these new features are delivered with the same simplicity and ease-of-use that distinguish all IBM PureSystems® family offerings and what sets the IBM PureData System for Analytics apart. As an appliance, the integration of hardware, software and storage is done for you, leading to shorter deployment cycles and industry leading time-to-value for business intelligence and analytic initiatives. The appliance is delivered ready-to-go for immediate data loading and query execution. The appliance integrates with leading ETL, BI and analytic applications through standard ODBC, JDBC and OLE DB interfaces. Included with every system, the PureData System for Analytics Performance Portal provides a web-based GUI that helps administrators monitor and manage hardware, administer database objects, configure workload management, view active sessions and monitor system resource utilization for capacity planning.

The portal provides a consolidated administrative interface supporting PureData Systems for Analytics from one, easy-to-use access point. IBM PureData System for Analytics is architected for high availability. All components are internally redundant, and the failure of a processing node (S-Blade) causes no significant performance degradation for a robust, production-ready environment from the moment the appliance is installed in your data center. IBM eliminates complexity at every step so you can redirect valuable resources to initiatives that will positively impact the bottom line.

Scalable

With the IBM PureData System for Analytics solution, organizations can deploy the right-sized environments for their data volumes and workloads, and be confident that as data volumes grow, larger systems can be deployed quickly and easily. The IBM PureData System for Analytics N3001 family of seven different configurations, starts with a data capacity of 16 TB (new N3001-001) and can grow to well over a petabyte for an eight-rack system (new N3001-080), assuming a 4X compression rate. IBM PureData System for Analytics provides near linear performance scalability as the size of the appliance grows, which means that organizations can pick the appropriate sized appliance to meet both their data volume and performance requirements. This is accomplished with predictable, scalable performance with no need to add significant resources to manage and maintain the appliance as data volumes grow.