Inicio > Lenguas > Lingüistica > Statistical Methods for Annotation Analysis
Statistical Methods for Annotation Analysis

Statistical Methods for Annotation Analysis

Massimo Poesio / Ron Artstein / Silviu Paun

96,17 €
IVA incluido
Disponible
Editorial:
Springer Nature B.V.
Año de edición:
2022
Materia
Lingüistica
ISBN:
9783031037535
96,17 €
IVA incluido
Disponible

Selecciona una librería:

  • Librería Samer Atenea
  • Librería Aciertas (Toledo)
  • Kálamo Books
  • Librería Perelló (Valencia)
  • Librería Elías (Asturias)
  • Donde los libros
  • Librería Kolima (Madrid)
  • Librería Proteo (Málaga)

Labelling data is one of the most fundamental activities in science, and has underpinned practice, particularly in medicine, for decades, as well as research in corpus linguistics since at least the development of the Brown corpus. With the shift towards Machine Learning in Artificial Intelligence (AI), the creation of datasets to be used for training and evaluating AI systems, also known in AI as corpora, has become a central activity in the field as well. Early AI datasets were created on an ad-hoc basis to tackle specific problems. As larger and more reusable datasets were created, requiring greater investment, the need for a more systematic approach to dataset creation arose to ensure increased quality. A range of statistical methods were adopted, often but not exclusively from the medical sciences, to ensure that the labels used were not subjective, or to choose among different labels provided by the coders. A wide variety of such methods is now in regular use. This book is meant to provide a survey of the most widely used among these statistical methods supporting annotation practice. As far as the authors know, this is the first book attempting to cover the two families of methods in wider use. The first family of methods is concerned with the development of labelling schemes and, in particular, ensuring that such schemes are such that sufficient agreement can be observed among the coders. The second family includes methods developed to analyze the output of coders once the scheme has been agreed upon, particularly although not exclusively to identify the most likely label for an item among those provided by the coders. The focus of this book is primarily on Natural Language Processing, the area of AI devoted to the development of models of language interpretation and production, but many if not most of the methods discussed here are also applicable to other areas of AI, or indeed, to other areas of Data Science.

Artículos relacionados

  • User-Centered Computer Aided Language Learning
    Giorgos Zacharia / Panayiotis Zaphiris
    ...
  • Deep Learning for Natural Language Processing
    Marco Antonio Valenzuela-Escárcega / Mihai Surdeanu
    ...
    Disponible

    47,60 €

  • Deep Learning for Natural Language Processing
    Marco Antonio Valenzuela-Escárcega / Mihai Surdeanu
    ...
  • Lecciones sobre espinosa medrano
    Luis Jaime Cisneros Vizquerra
    La obra de Juan de Espinosa Medrano, apodado en su tiempo «El Lunarejo» (c. 1629-1688), fue uno de los mayores focos de interés académico de Luis Jaime Cisneros (1921-2011). En 1980 aparecieron sus primeros trabajos dedicados a estudiar los textos capitales de Espinosa Medrano (el Apologético en favor de don Luis de Góngora, la Panegírica declamación por la protección de las ci...
    Disponible

    17,63 €

  • Lyre Book
    Matthew Kilbane
    Redefines modern lyric poetry at the intersection of literary and media studies.In The Lyre Book, Matthew Kilbane urges literary scholars to consider lyric not as a genre or a reading practice but as a media condition: the generative tension between writing and sound. In addition to clarifying issues central to the study of modern poetry--including its proximity to popular song...
    Disponible

    50,84 €

  • Translation-mediated Communication in a Digital World
    David Ashworth / Minako O’Hagan
    The Internet is accelerating globalization by exposing organizations and individuals to global audiences. This in turn is driving teletranslation and teleinterpretation, new types of multilingual support, which are functional in digital communications environments. The book describes teletranslation and teleinterpretation by exploring a number of key emerging contexts for langu...
    Disponible

    45,19 €