Inicio > Matemáticas y ciencia > Matemáticas > Probabilidad y estadística > Minimum Divergence Methods in Statistical Machine Learning
Minimum Divergence Methods in Statistical Machine Learning

Minimum Divergence Methods in Statistical Machine Learning

Osamu Komori / Shinto Eguchi

157,42 €
IVA incluido
Consulta disponibilidad
Editorial:
Springer Nature B.V.
Año de edición:
2022
Materia
Probabilidad y estadística
ISBN:
9784431569206

Selecciona una librería:

  • Librería Samer Atenea
  • Librería Aciertas (Toledo)
  • Kálamo Books
  • Librería Perelló (Valencia)
  • Librería Elías (Asturias)
  • Donde los libros
  • Librería Kolima (Madrid)
  • Librería Proteo (Málaga)

This book explores minimum divergence methods of statistical machine learning for estimation,  regression, prediction, and so forth,  in which we engage in information geometry to elucidate their intrinsic properties of the corresponding loss functions, learning algorithms, and statistical models. One of the most elementary  examples is Gauss’s least squares estimator in a linear regression model, in which the estimator is given by minimization of the sum of squares between a response vector and a vector of the linear subspace hulled by explanatory vectors.  This is extended to Fisher’s maximum likelihood estimator (MLE) for an exponential model, in which the estimator is provided by minimization of the Kullback-Leibler (KL) divergence between a data distribution and a parametric distribution of the exponential model in an empirical analogue. Thus, we envisage a geometric interpretation of such  minimization procedures such that a right triangle is kept with Pythagorean identity in the sense of the KL divergence.  This understanding sublimates  a dualistic interplay between a statistical estimation and model, which requires dual geodesic paths, called m-geodesic and e-geodesic paths, in a framework of information geometry. We extend such a dualistic structure of the MLE and exponential model to that of the minimum divergence estimator and the maximum entropy model, which is applied to robust statistics, maximum entropy, density estimation, principal component analysis, independent component analysis, regression analysis, manifold learning, boosting algorithm,  clustering, dynamic treatment regimes, and so forth. We consider a variety of information divergence measures typically including KL divergence to express departure from one probability distribution to another. An information divergence is decomposed into the cross-entropy and the (diagonal) entropy in which the entropy associates with a generative model as a family of maximum entropy distributions; the cross entropy associates with a statistical estimation method via minimization of the empirical analogue based on given data. Thus any statistical divergence includes an intrinsic object between the generative model and the estimation method. Typically, KL divergence leads to the exponential model and the maximum likelihood estimation. It is shown that any information divergence leads to a Riemannian metric and a pair of the linear connections in the framework of information geometry. We focus on a class of information divergence generated by an increasing and convex function U, called U-divergence. It is shown that any generator function U generates the U-entropy and U-divergence, in which there is a dualistic structure between the U-divergence method and the maximum U-entropy model. We observe that a specific choice of  U leads to a robust statistical procedure via the minimum U-divergence method. If U is selected as an exponential function, then the corresponding  U-entropy and U-divergence are reduced to the Boltzmann-Shanon entropy and the KL divergence; the minimum U-divergence estimator is equivalent to the MLE. For robust supervised learning to predict a class label we observe that the U-boosting algorithm performs well for contamination of mislabel examples if U is appropriately selected. We present such maximal U-entropy and minimum U-divergence methods, in particular, selecting a power function as U to provide flexible performance in statistical machine learning. 

Artículos relacionados

  • ENGINEERING UNCERTAINTY AND RISK ANALYSIS
    Sergio E. Serrano
    An integrated coverage of probability, statistics, Monte Carlo simulation, inferential statistics, design of experiments, systems reliability, fitting random data to models, analysis of variance, stochastic processes, and stochastic differential equations for engineers and scientists. The author for first time presents an introduction to the broad field of applied engineering u...
    Disponible

    134,56 €

  • UNDERSTANDING AND CALCULATING THE ODDS
    Catalin Barboianu
    Man’s daily life is full of decisional situations. Whether we have math skills or not, we frequently estimate and compare probabilities, sometimes without realizing it, especially when making decisions. But probabilities are not just simple numbers attached objectively or subjectively to events, as they perhaps look, and their calculus and usage is highly predisposed to qualita...
    Disponible

    31,61 €

  • Random Graphs and Complex Networks
    Remco van der Hofstad
    ...
  • Introduction to Malliavin Calculus
    David Nualart / Eulalia Nualart
    ...
    Disponible

    60,35 €

  • Probability, Markov Chains, Queues, and Simulation
    William J. Stewart
    Probability, Markov Chains, Queues, and Simulation provides a modern and authoritative treatment of the mathematical processes that underlie performance modeling. The detailed explanations of mathematical derivations and numerous illustrative examples make this textbook readily accessible to graduate and advanced undergraduate students taking courses in which stochastic process...
  • SPSS for you
    A. Rajathi / P. Chandran
    In an era where statistical analysis underpins breakthroughs across all fields, the importance of mastering statistical software cannot be overstated. 'SPSS for you' emerges as a pivotal resource for anyone keen to navigate the complexities of statistical analysis with ease and precision. Drawing from over 25 years of teaching experience, practical guidance in statistical analy...
    Disponible

    29,30 €

Otros libros del autor

  • Statistical Methods for Imbalanced Data in Ecological and Biological Studies
    Osamu Komori / Shinto Eguchi
    This book presents a fresh, new approach in that it provides a comprehensive recent review of challenging problems caused by imbalanced data in prediction and classification, and also in that it introduces several of the latest statistical methods of dealing with these problems. The book discusses the property of the imbalance of data from two points of view. The first is quant...
    Disponible

    67,89 €