Apache Spark Implementation on IBM z/OS

An IBM Redbooks publication

Abstract

The term big data refers to extremely large sets of data that are analyzed to reveal insights, such as patterns, trends, and associations. The algorithms that analyze this data to provide these insights must extract value from a wide range of data sources, including business data and live, streaming, social media data.

However, the real value of these insights comes from their timeliness. Rapid delivery of insights enables anyone (not only data scientists) to make effective decisions, applying deep intelligence to every enterprise application.

Apache Spark is an integrated analytics framework and runtime to accelerate and simplify algorithm development, depoyment, and realization of business insight from analytics. Apache Spark on IBM® z/OS® puts the open source engine, augmented with unique differentiated features, built specifically for data science, where big data resides.

This IBM Redbooks® publication describes the installation and configuration of IBM z/OS Platform for Apache Spark for field teams and clients. Additionally, it includes examples of business analytics scenarios.

Table of contents

Chapter 1. Architectural overview
Chapter 2. Components and extensions
Chapter 3. Installation and configuration
Chapter 4. Spark application development on z/OS
Chapter 5. Production integration
Chapter 6. IBM z/OS Platform for Apache Spark and the ecosystem
Chapter 7. Use case patterns
Appendix A. Sample code to run on Apache Spark cluster on z/OS
Appendix B. FAQ: Frequently asked questions, and answers

Profile

Publish Date
13 August 2016


Rating: Not yet rated


Author(s)

ISBN-10
0738414961

ISBN-13
9780738414966

IBM Form Number
SG24-8325-00

Number of pages
142