Inicio > Matemáticas y ciencia > Matemáticas > Probabilidad y estadística > Multidimensional Mining of Massive Text Data
Multidimensional Mining of Massive Text Data

Multidimensional Mining of Massive Text Data

Chao Zhang / Jiawei Han

82,95 €
IVA incluido
Disponible
Editorial:
Springer Nature B.V.
Año de edición:
2019
Materia
Probabilidad y estadística
ISBN:
9783031007866
82,95 €
IVA incluido
Disponible

Selecciona una librería:

  • Librería Samer Atenea
  • Librería Aciertas (Toledo)
  • Kálamo Books
  • Librería Perelló (Valencia)
  • Librería Elías (Asturias)
  • Donde los libros
  • Librería Kolima (Madrid)
  • Librería Proteo (Málaga)

Unstructured text, as one of the most important data forms, plays a crucial role in data-driven decision making in domains ranging from social networking and information retrieval to scientific research and healthcare informatics. In many emerging applications, people’s information need from text data is becoming multidimensional-they demand useful insights along multiple aspects from a text corpus. However, acquiring such multidimensional knowledge from massive text data remains a challenging task.This book presents data mining techniques that turn unstructured text data into multidimensional knowledge. We investigate two core questions. (1) How does one identify task-relevant text data with declarative queries in multiple dimensions? (2) How does one distill knowledge from text data in a multidimensional space? To address the above questions, we develop a text cube framework. First, we develop a cube construction module that organizes unstructured data into a cube structure, by discovering latent multidimensional and multi-granular structure from the unstructured text corpus and allocating documents into the structure. Second, we develop a cube exploitation module that models multiple dimensions in the cube space, thereby distilling from user-selected data multidimensional knowledge. Together, these two modules constitute an integrated pipeline: leveraging the cube structure, users can perform multidimensional, multigranular data selection with declarative queries; and with cube exploitation algorithms, users can extract multidimensional patterns from the selected data for decision making.The proposed framework has two distinctive advantages when turning text data into multidimensional knowledge: flexibility and label-efficiency. First, it enables acquiring multidimensional knowledge flexibly, as the cube structure allows users to easily identify task-relevant data along multiple dimensions at varied granularities and further distill multidimensional knowledge. Second, the algorithms for cube construction and exploitation require little supervision; this makes the framework appealing for many applications where labeled data are expensive to obtain.

Artículos relacionados

  • ENGINEERING UNCERTAINTY AND RISK ANALYSIS
    Sergio E. Serrano
    An integrated coverage of probability, statistics, Monte Carlo simulation, inferential statistics, design of experiments, systems reliability, fitting random data to models, analysis of variance, stochastic processes, and stochastic differential equations for engineers and scientists. The author for first time presents an introduction to the broad field of applied engineering u...
    Disponible

    134,56 €

  • UNDERSTANDING AND CALCULATING THE ODDS
    Catalin Barboianu
    Man’s daily life is full of decisional situations. Whether we have math skills or not, we frequently estimate and compare probabilities, sometimes without realizing it, especially when making decisions. But probabilities are not just simple numbers attached objectively or subjectively to events, as they perhaps look, and their calculus and usage is highly predisposed to qualita...
    Disponible

    31,61 €

  • Random Graphs and Complex Networks
    Remco van der Hofstad
    ...
    Disponible

    112,33 €

  • Introduction to Malliavin Calculus
    David Nualart / Eulalia Nualart
    ...
    Disponible

    60,35 €

  • Probability, Markov Chains, Queues, and Simulation
    William J. Stewart
    Probability, Markov Chains, Queues, and Simulation provides a modern and authoritative treatment of the mathematical processes that underlie performance modeling. The detailed explanations of mathematical derivations and numerous illustrative examples make this textbook readily accessible to graduate and advanced undergraduate students taking courses in which stochastic process...
    Disponible

    185,21 €

  • SPSS for you
    A. Rajathi / P. Chandran
    In an era where statistical analysis underpins breakthroughs across all fields, the importance of mastering statistical software cannot be overstated. 'SPSS for you' emerges as a pivotal resource for anyone keen to navigate the complexities of statistical analysis with ease and precision. Drawing from over 25 years of teaching experience, practical guidance in statistical analy...
    Disponible

    29,30 €