Inicio > > Informática: cuestiones generales > The Azure Data Lakehouse Toolkit
The Azure Data Lakehouse Toolkit

The Azure Data Lakehouse Toolkit

Ron L’Esteve

68,71 €
IVA incluido
Disponible
Editorial:
Springer Nature B.V.
Año de edición:
2022
Materia
Informática: cuestiones generales
ISBN:
9781484282328
68,71 €
IVA incluido
Disponible

Selecciona una librería:

  • Librería Samer Atenea
  • Librería Aciertas (Toledo)
  • Kálamo Books
  • Librería Perelló (Valencia)
  • Librería Elías (Asturias)
  • Donde los libros
  • Librería Kolima (Madrid)
  • Librería Proteo (Málaga)

Design and implement a modern data lakehouse on the Azure Data Platform using Delta Lake, Apache Spark, Azure Databricks, Azure Synapse Analytics, and Snowflake. This book teaches you the intricate details of the Data Lakehouse Paradigm and how to efficiently design a cloud-based data lakehouse using highly performant and cutting-edge Apache Spark capabilities using Azure Databricks, Azure Synapse Analytics, and Snowflake. You will learn to write efficient PySpark code for batch and streaming ELT jobs on Azure. And you will follow along with practical, scenario-based examples showing how to apply the capabilities of Delta Lake and Apache Spark to optimize performance, and secure, share, and manage a high volume, high velocity, and high variety of data in your lakehouse with ease.The patterns of success that you acquire from reading this book will help you hone your skills to build high-performing and scalable ACID-compliant lakehouses using flexible and cost-efficient decoupled storage and compute capabilities. Extensive coverage of Delta Lake ensures that you are aware of and can benefit from all that this new, open source storage layer can offer. In addition to the deep examples on Databricks in the book, there is coverage of alternative platforms such as Synapse Analytics and Snowflake so that you can make the right platform choice for your needs.After reading this book, you will be able to implement Delta Lake capabilities, including Schema Evolution, Change Feed, Live Tables, Sharing, and Clones to enable better business intelligence and advanced analytics on your data within the Azure Data Platform.What You Will LearnImplement the Data Lakehouse Paradigm on Microsoft’s Azure cloud platformBenefit from the new Delta Lake open-source storage layer for data lakehouses Take advantage of schema evolution, change feeds, live tables, and moreWrite functional PySpark code for data lakehouse ELT jobsOptimize Apache Spark performance through partitioning, indexing, and other tuning optionsChoose between alternatives such as Databricks, Synapse Analytics, and SnowflakeWho This Book Is ForData, analytics, and AI professionals at all levels, including data architect and data engineer practitioners. Also for data professionals seeking patterns of success by which to remain relevant as they learn to build scalable data lakehouses for their organizations and customers who are migrating into the modern Azure Data Platform. 

Artículos relacionados

  • Interview with Jeffery Khoury, Bringing Telemedicine to the People
    Richard G Lowe Jr
    Did you know you can consult with a medical specialist over your smartphone from the comfort of your own home? Imagine speaking to a highly-trained and accredited doctor about whatever is ailing you from virtually anywhere in the world.Thanks to a young entrepreneur named Jeffery Khoury, you can get the advice you need from a pool of medical specialists without waiting in a doc...
  • IT Consulting Secrets
    Carl A Katz
    This book is for IT consultants of all experience levels and the content is relevant to any IT support business model from managed services (MSP) to break/fix. The author has methodically compiled these strategies and this information from over sixteen years of experience working in the IT support field at the small and medium sized business and enterprise levels. ...
    Disponible

    29,41 €

  • Modeling, Analysis, and Applications in Metaheuristic Computing
    Peng-Yeng Yin
    The engineering and business problems the world faces today have become more impenetrable and unstructured, making the design of a satisfactory problem-specific algorithm nontrivial. Modeling, Analysis, and Applications in Metaheuristic Computing: Advancements and Trends is a collection of the latest developments, models, and applications within the transdisciplinary fields rel...
  • Knowledge Management and Drivers of Innovation in Services Industries
    Knowledge Management is concerned with all aspects of eliciting, acquiring, modelling, and managing knowledge. Application of knowledge resources successfully helps the organization to deliver creative products and services. Especially in service business, service job experience and information about the customer, as well as the installed site equipment, are key factors to deli...
  • Current Trends and Future Practices for Digital Literacy and Competence
    Antonio Cartelli
    Being a digital citizen has transformed from a process of familiarizing ones’ self with terminology and techniques to a full-time responsibility in the hands of any who want to stay abreast of the latest technological change in their respective field. Current Trends and Future Practices for Digital Literacy and Competence offers a look at the latest research within digital lite...
  • Human Rights and Risks in the Digital Era
    Globalization, along with its digital and information communication technology counterparts, including the Internet and cyberspace, may signify a whole new era for human rights, characterized by new tensions, challenges, and risks for human rights, as well as new opportunities. Human Rights and Risks in the Digital Era: Globalization and the Effects of Information Technologies ...