Hadoop can store big data and unlock the answers by analyzing them. IBM® InfoSphere® BigInsights™ is built on top of open source Hadoop and extends it with advanced analytic tools and other capabilities with added value. InfoSphere BigInsights helps organizations of all sizes to more efficiently manage the vast amounts of data that consumers and businesses create every day. At its core, Hadoop is a Distributed Computing Environment that manages the execution of distributed jobs and tasks on a cluster. As with any Distributed Computing Environment, the Hadoop software needs to provide facilities for resource management, scheduling, remote execution, and exception handling. Although Hadoop provides basic capabilities in these areas, IBM Platform Computing has been working on these problems and perfecting them for twenty years.
This IBM Redpaper™ publication describes the integration of IBM Platform Symphony® 5.2 and IBM InfoSphere BigInsights 1.4 in an IBM System x® cluster. IBM Platform Symphony is a low-latency scheduling solution that supports true multitenancy and sophisticated workload management capabilities.
IBM Platform Symphony
Environment
Configuring InfoSphere BigInsights
Installing IBM Platform Symphony Advanced Edition
Integrating IBM Platform Symphony and InfoSphere BigInsights
Additional configuration for IBM Platform Symphony
Benchmark tests
Adding users
Adding nodes
Troubleshooting