Skip to main content

Hortonworks Data Platform with IBM Spectrum Scale: Reference Guide for Building an Integrated Solution

An IBM Redpaper publication

thumbnail 

Published on 26 June 2018

  1. .EPUB (0.6 MB)
  2. .PDF (2.3 MB)

Apple BooksGoogle Play Books

Share this page:   

ISBN-10: 0738456969
ISBN-13: 9780738456966
IBM Form #: REDP-5448-01


Authors: R. Sandeep Patil, Wei G. Gong, Pallavi Galgali, Piyush Chaudhary, Muthu Muthiah, Yong ZY Zheng and Larry Coyne

    menu icon

    Abstract

    This IBM® Redpaper™ publication provides guidance on building an enterprise-grade data lake by using IBM Spectrum™ Scale and Hortonworks Data Platform for performing in-place Hadoop or Spark-based analytics. It covers the benefits of the integrated solution, and gives guidance about the types of deployment models and considerations during the implementation of these models.

    Hortonworks Data Platform (HDP) is a leading Hadoop and Spark distribution. HDP addresses the complete needs of data-at-rest, powers real-time customer applications, and delivers robust analytics that accelerate decision making and innovation.

    IBM Spectrum Scale™ is flexible and scalable software-defined file storage for analytics workloads. Enterprises around the globe have deployed IBM Spectrum Scale to form large data lakes and content repositories to perform high-performance computing (HPC) and analytics workloads. It can scale performance and capacity both without bottlenecks.

    Table of Contents

    Hortonworks Data Platform

    IBM SPectrum Scale

    Integrated solution overview

    Component diagram

    Deployment diagram

    Deployment models

    Shared Storage model

    Shared Nothing Storage model

    System configuration

    HDP and IBM Spectrum Scale frequently asked questions

     

    Others who read this also read