Skip to main content

Apache Spark Implementation on IBM z/OS

An IBM Redbooks publication


Published on 13 August 2016

  1. .EPUB (2.1 MB)
  2. .PDF (4.5 MB)

Apple BooksGoogle Play Books

Share this page:   

ISBN-10: 0738414961
ISBN-13: 9780738414966
IBM Form #: SG24-8325-00

Authors: Lydia Parziale, Joe Bostian, Ravi Kumar, Ulrich Seelbach and Zhong Yu Ye

    menu icon


    The term big data refers to extremely large sets of data that are analyzed to reveal insights, such as patterns, trends, and associations. The algorithms that analyze this data to provide these insights must extract value from a wide range of data sources, including business data and live, streaming, social media data.

    However, the real value of these insights comes from their timeliness. Rapid delivery of insights enables anyone (not only data scientists) to make effective decisions, applying deep intelligence to every enterprise application.

    Apache Spark is an integrated analytics framework and runtime to accelerate and simplify algorithm development, depoyment, and realization of business insight from analytics. Apache Spark on IBM® z/OS® puts the open source engine, augmented with unique differentiated features, built specifically for data science, where big data resides.

    This IBM Redbooks® publication describes the installation and configuration of IBM z/OS Platform for Apache Spark for field teams and clients. Additionally, it includes examples of business analytics scenarios.

    Table of Contents

    Chapter 1. Architectural overview

    Chapter 2. Components and extensions

    Chapter 3. Installation and configuration

    Chapter 4. Spark application development on z/OS

    Chapter 5. Production integration

    Chapter 6. IBM z/OS Platform for Apache Spark and the ecosystem

    Chapter 7. Use case patterns

    Appendix A. Sample code to run on Apache Spark cluster on z/OS

    Appendix B. FAQ: Frequently asked questions, and answers


    Others who read this also read