LEADER 00000cam a22006737a 4500 003 OCoLC 005 20240129213017.0 006 m o d 007 cr cnu|||||||| 008 120820s2020 xx o 000 0 eng 019 1302275428 024 8 9780738459028 029 0 AU@|b000067830083 035 (OCoLC)1192526752|z(OCoLC)1302275428 040 AU@|beng|epn|cAU@|dUAB|dOCLCO|dOCLCF|dLVT|dOCLCO|dOCLCQ |dOCLCO|dOCLCL 049 INap 099 eBook O'Reilly for Public Libraries 100 1 Dain, Joseph,|eauthor. 245 10 Cataloging Unstructured Data in IBM Watson Knowledge Catalog with IBM Spectrum Discover /|cDain, Joseph. |h[O'Reilly electronic resource] 250 1st edition. 264 1 |bIBM Redbooks,|c2020. 300 1 online resource (108 pages) 336 text|btxt|2rdacontent 337 computer|bc|2rdamedia 338 online resource|bcr|2rdacarrier 347 text file 520 This IBM® Redpaper publication explains how IBM Spectrum® Discover integrates with the IBM Watson® Knowledge Catalog (WKC) component of IBM Cloud® Pak for Data (IBM CP4D) to make the enriched catalog content in IBM Spectrum Discover along with the associated data available in WKC and IBM CP4D. From an end-to-end IBM solution point of view, IBM CP4D and WKC provide state-of-the-art data governance, collaboration, and artificial intelligence (AI) and analytics tools, and IBM Spectrum Discover complements these features by adding support for unstructured data on large-scale file and object storage systems on premises and in the cloud. Many organizations face challenges to manage unstructured data. Some challenges that companies face include: Pinpointing and activating relevant data for large-scale analytics, machine learning (ML) and deep learning (DL) workloads. Lacking the fine-grained visibility that is needed to map data to business priorities. Removing redundant, obsolete, and trivial (ROT) data and identifying data that can be moved to a lower-cost storage tier. Identifying and classifying sensitive data as it relates to various compliance mandates, such as the General Data Privacy Regulation (GDPR), Payment Card Industry Data Security Standards (PCI -DSS), and the Health Information Portability and Accountability Act (HIPAA). This paper describes how IBM Spectrum Discover provides seamless integration of data in IBM Storage with IBM Watson Knowledge Catalog (WKC). Features include: Event-based cataloging and tagging of unstructured data across the enterprise. Automatically inspecting and classifying over 1000 unstructured data types, including genomics and imaging specific file formats. Automatically registering assets with WKC based on IBM Spectrum Discover search and filter criteria, and by using assets in IBM CP4D. Enforcing data governance policies in WKC in IBM CP4D based on insights from IBM Spectrum Discover, and using assets in IBM CP4D. Several in-depth use cases are used that show examples of healthcare, life sciences, and financial services. IBM Spectrum Discover integration with WKC enables storage administrators, data stewards, and data scientists to efficiently manage, classify, and gain insights from massive amounts of data. The integration improves storage economics, helps mitigate risk, and accelerates large- scale analytics to create competitive advantage and speed critical research. 542 |fCopyright 2020 © IBM|g2020 550 Made available through: Safari, an O'Reilly Media Company. 588 Online resource; Title from title page (viewed August 11, 2020) 590 O'Reilly|bO'Reilly Online Learning: Academic/Public Library Edition 650 0 Database management. 650 0 IBM computers. 650 0 Information retrieval|xComputer programs. 650 0 Information storage and retrieval systems. 650 2 Information Systems 650 6 Bases de données|xGestion. 650 6 IBM (Ordinateurs) 650 6 Systèmes d'information. 650 7 Database management|2fast 650 7 IBM computers|2fast 650 7 Information retrieval|xComputer programs|2fast 650 7 Information storage and retrieval systems|2fast 700 1 Selim, Abeer,|eauthor. 700 1 Patil, Anil,|eauthor. 700 1 Vollmar, Christopher,|eauthor. 700 1 De Rezende, Flavio,|eauthor. 700 1 Greco, Frank,|eauthor. 700 1 Lee, Frank,|eauthor. 700 1 Crawford, Isom,|eauthor. 700 1 Bozhinov, Ivaylo,|eauthor. 700 1 Wong, Joanna,|eauthor. 700 1 Blumert, Joshua,|eauthor. 700 1 Coyne, Larry,|eauthor. 710 2 Safari, an O'Reilly Media Company. 856 40 |uhttps://ezproxy.naperville-lib.org/login?url=https:// learning.oreilly.com/library/view/~/9780738459028/?ar |zAvailable on O'Reilly for Public Libraries 936 BATCHLOAD 994 92|bJFN