This IBM® Redpaper publication provides guidance on building an enterprise-grade data lake by using IBM Spectrum® Scale and Cloudera Data Platform (CDP) Private Cloud Base for performing in-place Cloudera Hadoop or Cloudera Spark-based analytics. It also covers the benefits of the integrated solution and gives guidance about the types of deployment models and considerations during the implementation of these models.
August 2021 update added CES protocol support in Hadoop environment
Cloudera Data Platform Private Cloud Base
IBM Spectrum Scale and Elastic Storage System
Integrated solution overview
Component relationship
Integration with Cloudera Data Platform Private Cloud Base
CES HDFS
Deployment architecture
System configuration
Additional references