Inicio > Tecnología, ingeniería, agricultura > Tecnología: cuestiones generales > Quantization Methods for Large Language Models From Theory to Real-World Implementations
Quantization Methods for Large Language Models From Theory to Real-World Implementations

Quantization Methods for Large Language Models From Theory to Real-World Implementations

Anand Vemula

19,84 €
IVA incluido
Consulta disponibilidad
Editorial:
Anand Vemula
Año de edición:
2024
Materia
Tecnología: cuestiones generales
ISBN:
9798227328335

Selecciona una librería:

  • Librería Desdémona
  • Librería Samer Atenea
  • Librería Aciertas (Toledo)
  • Kálamo Books
  • Librería Perelló (Valencia)
  • Librería Elías (Asturias)
  • Donde los libros
  • Librería Kolima (Madrid)
  • Librería Proteo (Málaga)

The book provides an in-depth understanding of quantization techniques and their impact on model efficiency, performance, and deployment.The book starts with a foundational overview of quantization, explaining its significance in reducing the computational and memory requirements of LLMs. It delves into various quantization methods, including uniform and non-uniform quantization, per-layer and per-channel quantization, and hybrid approaches. Each technique is examined for its applicability and trade-offs, helping readers select the best method for their specific needs.The guide further explores advanced topics such as quantization for edge devices and multi-lingual models. It contrasts dynamic and static quantization strategies and discusses emerging trends in the field. Practical examples, use cases, and case studies are provided to illustrate how these techniques are applied in real-world scenarios, including the quantization of popular models like GPT and BERT.

Artículos relacionados

  • Science and the Big Issues of Our Time
    Martin Gellender
    Within the last few generations, our world has been shaped by technological change enabled by scientific advances. This is particularly evident to the 'baby boomer' generation, who have lived through and witnessed huge changes in society over the course of their lifetimes. Although many have little education in science, or have forgotten what they learned in high school, they u...
    Disponible

    26,58 €

  • Statics+++
    James W Dally / Robert J Bonenberger
    This textbook has been prepared to support a course offering for Statics at the University of Nevada at Reno.  Statics provides the first exposure of engineering students to the study of mechanics.  While Statics is a relatively simple subject, many students find it difficult, and they often perform far below our expectations.  In an effort to improve the curriculum, several me...
    Disponible

    101,22 €

  • Technical Writing, Presentational Skills, and Online Communication
    Raymond Greenlaw
    This book addresses four main topics: professional ethics, technical writing, presentation skills, and online writing. These topics are woven throughout the book and some of them are the main subjects of one or more chapters. The overarching theme of this book is to provide well-tested, best-practice techniques and strategies for main topic areas while focusing on information t...
    Disponible

    229,44 €

  • Project Management Techniques and Innovations in Information Technology
    John Wang
    Managing cost, time, and quality of a project can be a challenging task for any project manager, but especially in times of an ever-changing and burgeoning field of IT. Project Management Techniques and Innovations in Information Technology offers a vital compendium of the latest research, case studies, best practices, and methodologies within the field of IT project management...
    Disponible

    229,98 €

  • Phenomenology, Organizational Politics, and IT Design
    Information systems are researched, published on, and utilized as an extremely broad and vital sector of current technology development, usually studied from the scientific or technological viewpoints therein. Phenomenology, Organizational Politics, and IT Design: The Social Study of Information Systems offers a new look at the latest research and critical issues within the fie...
    Disponible

    230,06 €

  • Geotechnical Applications for Earthquake Engineering
    Disaster preparedness and response management is a burgeoning field of technological research, and staying abreast of the latest developments within the field is a difficult task. Geotechnical Applications for Earthquake Engineering: Research Advancements has collected chapters from experts from around the world in a variety of applications, frameworks, and methodologies, and p...
    Disponible

    236,13 €

Otros libros del autor

  • Designing Multi-Agent Architecture for Advanced Generative AI Applications
    Anand Vemula
    This book explores the intricate world of multi-agent systems (MAS) within the context of advanced generative AI applications. It begins with an introduction to multi-agent systems, detailing their evolution, key concepts, and the significance of multi-agent architecture in the realm of generative AI. The foundation is laid by examining generative AI models, including Variation...
  • Vector Embeddings and Data Representation
    Anand Vemula
    This book explores the critical role of vector representations in generative AI and large language models (LLMs), detailing how data transforms into vectors and embeds into high-dimensional spaces for advanced AI applications. Beginning with the fundamentals of vector embeddings, the text outlines the mathematical foundations, including key linear algebra concepts, before delvi...