Library Hours
Monday to Friday: 9 a.m. to 9 p.m.
Saturday: 9 a.m. to 5 p.m.
Sunday: 1 p.m. to 9 p.m.
Naper Blvd. 1 p.m. to 5 p.m.
     
Limit search to available items
Results Page:  Previous Next
Author Singh, Gurmukh, author.

Title Hadoop 2.x administration cookbook : administer and maintain large Apache Hadoop clusters / Gurmukh Singh. [O'Reilly electronic resource]

Publication Info. Birmingham, UK : Packt Publishing, 2017.
QR Code
Description 1 online resource (1 volume) : illustrations
Note Includes index.
Summary Over 100 practical recipes for becoming an expert Hadoop Admininstrator About This Book Become an expert Hadoop administrator and perform tasks for optimizing your Hadoop Cluster Import and export data into Hive and use Oozie to manage workflow. Practical recipes to help you plan and secure your Hadoop cluster, and make it highly available Who This Book Is For If you are a system administrator with a basic understanding of Hadoop who wants to get into Hadoop administration, this book is for you. If you are a Hadoop administrator who wants a quick reference guide to all the Hadoop administration-related tasks and solutions to commonly occuring problems, this book will also help you What you will learn Set up hadoop architecture to run a Hadoop cluster smoothly. Maintain Hadoop cluster on HDFS, YARN and MapReduce. Understand High Availability with Zookeeper and Journal Node. Configure Flume for data ingestion and Oozie to run various workflows. Tune the Hadoop cluster for optimal performance. Schedule jobs on Hadoop cluster using Fair and Capacity scheduler. Secure your cluster and troubleshoot it for various common pain points. In Detail Hadoop allows distributed storage and processing of large data sets across clusters of computers. Learning to administer Hadoop is crucial for exploiting its unique features. With this book, you will be able to overcome common problems encountered in Hadoop Administration. This book begins with laying the foundation by showing the steps to set up the Hadoop cluster and its various nodes. You will get a better understanding of how to maintain Hadoop cluster, especially on the HDFS layer and using YARN and MapReduce. Further you will explore durabiltiy and high availability of Hadoop cluster. Get a better understanding of the schedulers in Hadoop and how to configure and use them for your tasks. You will also get a hands-on experience with the back up and recovery options and also performance tuning aspects of Hadoop. Finally, you will get a better understanding of troubleshooting, diagnostics and best practices in Hadoop administration. By the end of this book, you will get a proper understanding of working with Hadoop clusters and will also be able to secure, encrypt it and configure auditing for your Hadoop clusters.
Subject Apache Hadoop.
Apache Hadoop
Electronic data processing -- Distributed processing.
Big data.
Traitement réparti.
Données volumineuses.
Big data
Electronic data processing -- Distributed processing
ISBN 1787126870
9781787126879 (electronic bk.)
Patron reviews: add a review
Click for more information
EBOOK
No one has rated this material

You can...
Also...
- Find similar reads
- Add a review
- Sign-up for Newsletter
- Suggest a purchase
- Can't find what you want?
More Information