Hadoop 2.x Administration Cookbook

Hadoop 2.x Administration Cookbook

Hadoop 2.x Administration Cookbook

Gurmukh Singh

79,67 €
IVA incluido
Disponible
Editorial:
Packt Publishing
Año de edición:
2017
ISBN:
9781787126732
79,67 €
IVA incluido
Disponible

Selecciona una librería:

  • Librería Samer Atenea
  • Librería Aciertas (Toledo)
  • Kálamo Books
  • Librería Perelló (Valencia)
  • Librería Elías (Asturias)
  • Donde los libros
  • Librería Kolima (Madrid)
  • Librería Proteo (Málaga)

Over 100 practical recipes to help you become an expert Hadoop administratorKey Features:- Become an expert Hadoop administrator and perform tasks to optimize your Hadoop Cluster- Import and export data into Hive and use Oozie to manage workflow.- Practical recipes will help you plan and secure your Hadoop cluster, and make it highly availableBook Description:Hadoop enables the distributed storage and processing of large datasets across clusters of computers. Learning how to administer Hadoop is crucial to exploit its unique features. With this book, you will be able to overcome common problems encountered in Hadoop administration.The book begins with laying the foundation by showing you the steps needed to set up a Hadoop cluster and its various nodes. You will get a better understanding of how to maintain Hadoop cluster, especially on the HDFS layer and using YARN and MapReduce. Further on, you will explore durability and high availability of a Hadoop cluster.You’ll get a better understanding of the schedulers in Hadoop and how to configure and use them for your tasks. You will also get hands-on experience with the backup and recovery options and the performance tuning aspects of Hadoop. Finally, you will get a better understanding of troubleshooting, diagnostics, and best practices in Hadoop administration.By the end of this book, you will have a proper understanding of working with Hadoop clusters and will also be able to secure, encrypt it, and configure auditing for your Hadoop clusters.What You Will Learn:- Set up the Hadoop architecture to run a Hadoop cluster smoothly- Maintain a Hadoop cluster on HDFS, YARN, and MapReduce- Understand high availability with Zookeeper and Journal Node- Configure Flume for data ingestion and Oozie to run various workflows- Tune the Hadoop cluster for optimal performance- Schedule jobs on a Hadoop cluster using the Fair and Capacity scheduler- Secure your cluster and troubleshoot it for various common pain pointsWho this book is for:If you are a system administrator with a basic understanding of Hadoop and you want to get into Hadoop administration, this book is for you. It’s also ideal if you are a Hadoop administrator who wants a quick reference guide to all the Hadoop administration-related tasks and solutions to commonly occurring problems

Artículos relacionados

  • Exploring Advances in Interdisciplinary Data Mining and Analytics
    Data mining is still a relatively young field, expanding at the rate of technology while advancing tools and techniques for gaining knowledge, finding patterns, and managing databases. Exploring Advances in Interdisciplinary Data Mining and Analytics: New Trends is an updated look at the state of technology in the field of data mining and analytics. As processor speeds, databas...
  • Knowledge Discovery Practices and Emerging Applications of Data Mining
    Recent developments have drastically increased the volume and complexity of data available to be mined, leading researchers to explore new ways to glean non-trivial data automatically. Knowledge Discovery Practices and Emerging Applications of Data Mining: Trends and New Domains introduces the reader to recent research activities in the field of data mining. This book covers as...
  • Research and Trends in Data Mining Technologies and Applications
    David Taniar
    ...
  • Developing Metadata Application Profiles
    The prevalence of data science has grown exponentially in recent years. Increases in data exchange have created the need for standards and formats on handling data from different sources. Developing Metadata Application Profiles is an innovative reference source that discusses the latest trends and techniques for effectively managing and exchanging metadata. Including a range o...
  • Modern Technologies for Big Data Classification and Clustering
    Data has increased due to the growing use of web applications and communication devices. It is necessary to develop new techniques of managing data in order to ensure adequate usage. Modern Technologies for Big Data Classification and Clustering is an essential reference source for the latest scholarly research on handling large data sets with conventional data mining and provi...
  • 90 Gelöste Fälle zu Zeitintelligenz in der DAX-Sprache
    Ramón Javier Castro Amador
    Dieser Ratgeber ist rein praktisch ausgerichtet, so dass Sie den gesamten DAX-Code in dieser Publikation anhand einer zum Download verfügbaren .pbix-Datei testen können.'90 gelöste Fälle zu Zeitintelligenz in DAX' ist ein Ratgeber für Benutzer von Microsoft Power BI, der Lösungen für sehr häufige praktische Fälle in Zeitintelligenzmodellen in der Sprache DAX bietet.Um das Verst...
    Disponible

    16,15 €

Otros libros del autor

  • Protokół routingu wielościeżkowego oparty na DHT dla sieci MANET
    Gurmukh Singh
    Pomimo swoich atrakcyjnych cech, przejście od „tradycyjnych' sieci do mobilnych sieci ad-hoc rodzi kilka trudnych problemów. Mobilne sieci ad-hoc (MANET) dziedziczą wszystkie tradycyjne problemy komunikacji bezprzewodowej i mobilnej, takie jak optymalizacja przepustowości, kontrola mocy i poprawa jakości transmisji. Dodatkowo, wielopasmowa natura i brak stałej infrastruktury wp...
    Disponible

    81,16 €

  • Protocole de routage à trajets multiples basé sur le DHT pour les MANETs
    Gurmukh Singh
    Malgré ses caractéristiques attrayantes, le passage des réseaux traditionnels aux réseaux mobiles ad hoc soulève plusieurs problèmes difficiles. Les réseaux mobiles ad hoc (MANET) héritent de tous les problèmes traditionnels des communications sans fil et mobiles, tels que l’optimisation de la bande passante, le contrôle de la puissance et l’amélioration de la qualité de la t...
    Disponible

    81,16 €

  • Monitoring Hadoop
    Gurmukh Singh
    ...
    Disponible

    46,73 €