Verstärkungslernen

Verstärkungslernen

Dr Satyanarayana S / Dr Thayyaba Khatoon MD / N V Madhu Bindu

96,15 €
IVA incluido
Disponible
Editorial:
KS OmniScriptum Publishing
Año de edición:
2023
Materia
Redes y comunicaciones informáticas
ISBN:
9786206403128
96,15 €
IVA incluido
Disponible

Selecciona una librería:

  • Donde los libros
  • Librería 7artes
  • Librería Elías (Asturias)
  • Librería Kolima (Madrid)
  • Librería Proteo (Málaga)

Dieses Buch ist in fünf Einheiten gegliedert und bietet eine ganzheitliche Lernerfahrung. Die Reise beginnt mit einer Einführung in Bandit-Algorithmen, wobei Kernkonzepte wie die Algorithmen Upper Confidence Bound (UCB) und Probably Approximately Correct (PAC) erforscht werden. Die nächste Einheit führt in den vollständigen Rahmen des Reinforcement Learning (RL) ein und geht dabei über Bandit-Algorithmen hinaus, um Interaktionen zwischen Akteur und Umgebung über mehrere Zeitschritte zu berücksichtigen. Markov-Entscheidungsprozesse (MDPs) werden als grundlegender Rahmen für die Modellierung sequenzieller Entscheidungsaufgaben eingeführt. Die vierte Einheit befasst sich mit Methoden der dynamischen Programmierung, temporalen Differenzen (TD) und der Bellman-Optimalitätsgleichung in RL. Diese Konzepte ermöglichen es Agenten, ihre Aktionen effektiv zu planen, zu lernen und zu optimieren. Die letzte Einheit beschäftigt sich mit fortgeschrittenen RL-Techniken wie Eligibility Traces, Funktionsannäherung, Methoden der kleinsten Quadrate, Fitted Q-learning, Deep Q-Network (DQN) und Policy Gradient Algorithmen.

Artículos relacionados

  • Next Generation Search Engines
    Recent technological progress in computer science, Web technologies, and the constantly evolving information available on the Internet has drastically changed the landscape of search and access to information. Current search engines employ advanced techniques involving machine learning, social networks, and semantic analysis. Next Generation Search Engines: Advanced Models for ...
    Disponible

    256,63 €

  • Collaboration and the Semantic Web
    Collaborative working has been increasingly viewed as a good practice for organizations to achieve efficiency. Organizations that work well in collaboration may have access to new sources of funding, deliver new, improved, and more integrated services, make savings on shared costs, and exchange knowledge, information and expertise. Collaboration and the Semantic Web: Social Net...
    Disponible

    229,92 €

  • Resource Allocation in Next-Generation Broadband Wireless Access Networks
    With the growing popularity of wireless networks in recent years, the need to increase network capacity and efficiency has become more prominent in society. This has led to the development and implementation of heterogeneous networks. Resource Allocation in Next-Generation Broadband Wireless Access Networks is a comprehensive reference source for the latest scholarly research o...
    Disponible

    249,42 €

  • Advanced Topics in Information Technology Standards and Standardization Research, Volume 1
    Kai Jakobs
    ...
    Disponible

    118,72 €

  • Data Warehouses and OLAP
    ...
    Disponible

    118,72 €

  • Selected Readings on Database Technologies and Applications
    Terry Halpin
    Education and research in the field of database technology can prove problematic without the proper resources and tools on the most relevant issues, trends, and advancements. Selected Readings on Database Technologies and Applications supplements course instruction and student research with quality chapters focused on key issues concerning the development, design, and analysis ...
    Disponible

    256,64 €

Otros libros del autor

  • Apprendimento per rinforzo
    Dr Satyanarayana S / Dr Thayyaba Khatoon MD / N V Madhu Bindu
    Questo libro è strutturato in cinque unità, per offrire un’esperienza di apprendimento olistica. Il viaggio inizia con un’introduzione agli algoritmi bandit, esplorando concetti fondamentali come gli algoritmi Upper Confidence Bound (UCB) e Probably Approximately Correct (PAC). L’unità successiva introduce la struttura completa del Reinforcement Learning (RL), andando oltre gli...
    Disponible

    96,17 €

  • Aprendizagem por reforço
    Dr Satyanarayana S / Dr Thayyaba Khatoon MD / N V Madhu Bindu
    Este livro está estruturado em cinco unidades, oferecendo uma experiência de aprendizagem holística. A viagem começa com uma introdução aos algoritmos bandit, explorando conceitos fundamentais como os algoritmos Upper Confidence Bound (UCB) e Probably Approximately Correct (PAC). A unidade seguinte introduz a estrutura completa da Aprendizagem por Reforço (RL), indo além dos al...
    Disponible

    96,16 €