IBM Spectrum Discover: Metadata Management for Deep Insight of Unstructured Storage
A draft IBM Redpaper publication
Updated 05 July 2019
IBM Form #: REDP-5550-00
Rate and comment
Authors: Joseph Dain, Norman Bogard, Isom Crawford Jr., Mathias Defiebre, Larry Coyne
This IBM® Redpaper™ publication provides a comprehensive overview of the IBM Spectrum™ Discover metadata management software platform. We give a detailed explanation of how the product creates, collects, and analyzes metadata. There are some in-depth use cases that show examples in the areas of analytics, governance, and optimization. We also provide step by step information to install and set-up the IBM Spectrum Discover trial environment.
More than 80% of all data collected by organizations does not reside in a standard relational database. Instead, it is trapped in unstructured documents, social media posts, machine logs, and the like. Many organizations face significant challenges to manage this deluge of unstructured data such as:
- Pinpointing and activating relevant data for large-scale analytics
- Lacking the fine-grained visibility needed to map data to business priorities
- Removing redundant, obsolete, and trivial (ROT) data
- Identifying and classifying sensitive data
IBM Spectrum Discover is a modern metadata management software that provides data insight for petabyte-scale file and object storage, storage on premises and in the cloud, enabling organizations to make better business decisions and gain and maintain a competitive advantage.
IBM Spectrum Discover provides a rich metadata layer that enables storage administrators, data stewards, and data scientists to efficiently manage, classify, and gain insights from massive amounts of unstructured data. It improves storage economics, helps mitigate risk, and accelerates large-scale analytics to create competitive advantage and speed critical research.
Table of contents
Chapter 1. IBM Spectrum Discover Overview
Chapter 2. Metadata Essentials
Chapter 3. Sample Use Cases
Chapter 4. Deep Inspection and the AI Pipeline
Appendix A. IBM Spectrum Discover Installation and Set-up
These pages are Web versions of IBM Redbooks- and Redpapers-in-progress. They are published here for those who need the information now and may contain spelling, layout and grammatical errors. This material has not been submitted to any formal IBM test and is published AS IS. It has not been the subject of rigorous review. Your feedback is welcomed to improve the usefulness of the material to others.
Follow IBM Redbooks
Follow IBM Redbooks