Big Data Science & Analytics

Big Data Science & Analytics

Big Data Science & Analytics

Arshdeep Bahga / Vijay Madisetti

73,67 €
IVA incluido
Consulta disponibilidad
Editorial:
Vijay Madisetti
Año de edición:
2016
ISBN:
9780996025546

Selecciona una librería:

  • Librería Samer Atenea
  • Librería Aciertas (Toledo)
  • Kálamo Books
  • Librería Perelló (Valencia)
  • Librería Elías (Asturias)
  • Donde los libros
  • Librería Kolima (Madrid)
  • Librería Proteo (Málaga)

Data and information are fuel of this new age where powerful analytics algorithms burn this fuel to generate decisions that are expected to create a smarter and more efficient world for all of us to live in. This new area of technology has been defined as Big Data Science and Analytics, and the industrial and academic communities are realizing this as a competitive technology that can generate significant new wealth and opportunity. Big data is defined as collections of datasets whose volume, velocity or variety is so large that it is difficult to store, manage, process and analyze the data using traditional databases and data processing tools. Big data science and analytics deals with collection, storage, processing and analysis of massive-scale data. Industry surveys, by Gartner and e-Skills, for instance, predict that there will be over 2 million job openings for engineers and scientists trained in the area of data science and analytics alone, and that the job market is in this area is growing at a 150 percent year-over-year growth rate.   We have written this textbook, as part of our expanding 'A Hands-On Approach'(TM) series, to meet this need at colleges and universities, and also for big data service providers who may be interested in offering a broader perspective of this emerging field to accompany their customer and developer training programs. The typical reader is expected to have completed a couple of courses in programming using traditional high-level languages at the college-level, and is either a senior or a beginning graduate student in one of the science, technology, engineering or mathematics (STEM) fields. An accompanying website for this book contains additional support for instruction and learning (www.big-data-analytics-book.com)  The book is organized into three main parts, comprising a total of twelve chapters. Part I provides an introduction to big data, applications of big data, and big data science and analytics patterns and architectures. A novel data science and analytics application system design methodology is proposed and its realization through use of open-source big data frameworks is described. This methodology describes big data analytics applications as realization of the proposed Alpha, Beta, Gamma and Delta models, that comprise tools and frameworks for collecting and ingesting data from various sources into the big data analytics infrastructure, incorporating distributed filesystems and non-relational (NoSQL) databases for data storage, and processing frameworks for batch and real-time analytics. This new methodology forms the pedagogical foundation of this book.  Part II introduces the reader to various tools and frameworks for big data analytics, and the architectural and programming aspects of these frameworks, with examples in Python. We describe Publish-Subscribe messaging frameworks (Kafka & Kinesis), Source-Sink connectors (Flume), Database Connectors (Sqoop), Messaging Queues (RabbitMQ, ZeroMQ, RestMQ, Amazon SQS) and custom REST, WebSocket and MQTT-based connectors. The reader is introduced to data storage, batch and real-time analysis, and interactive querying frameworks including HDFS, Hadoop, MapReduce, YARN, Pig, Oozie, Spark, Solr, HBase, Storm, Spark Streaming, Spark SQL, Hive, Amazon Redshift and Google BigQuery. Also described are serving databases (MySQL, Amazon DynamoDB, Cassandra, MongoDB) and the Django Python web framework.   Part III introduces the reader to various machine learning algorithms with examples using the Spark MLlib and H2O frameworks, and visualizations using frameworks such as Lightning, Pygal and Seaborn. 3

Artículos relacionados

  • Amazon API Gateway Developer Guide
    Documentation Team
    Amazon API Gateway is an AWS service that enables developers to create, publish, maintain, monitor, and secure APIs at any scale. You can create APIs that access AWS or other web services, as well as data stored in the AWS Cloud.Topics Gateway to AWS Cloud and Beyond Developer Experiences Benefits of API Gateway Amazon API Gateway Concepts ...
    Disponible

    171,14 €

  • AWS OpsWorks User Guide
    Documentation Team
    AWS OpsWorks is a configuration management service that helps you configure and operate applications in a cloud enterprise by using Puppet or Chef. AWS OpsWorks Stacks and AWS OpsWorks for Chef Automate let you use Chef cookbooks and solutions for configuration management, while AWS OpsWorks for Puppet Enterprise lets you configure a Puppet Enterprise master server in AWS. Pupp...
    Disponible

    153,05 €

  • Amazon Simple Storage Service Developer Guide
    Documentation Team
    Amazon Simple Storage Service is storage for the Internet. It is designed to make web-scale computing easier for developers. Amazon S3 has a simple web services interface that you can use to store and retrieve any amount of data, at any time, from anywhere on the web. It gives any developer access to the same highly scalable, reliable, fast, inexpensive data storage infrastruct...
    Disponible

    150,68 €

  • Amazon CloudFront Developer Guide
    Documentation Team
    Amazon CloudFront is a web service that speeds up distribution of your static and dynamic web content, such as .html, .css, .js, and image files, to your users. CloudFront delivers your content through a worldwide network of data centers called edge locations. When a user requests content that you’re serving with CloudFront, the user is routed to the edge location that provides...
    Disponible

    93,22 €

  • AWS Elemental MediaStore User Guide
    Development Team
    AWS Elemental MediaStore is a video origination and storage service that offers the high performance and immediate consistency required for live origination. With AWS Elemental MediaStore, you can manage video assets as objects in containers to build dependable, cloud-based media workflows.To use the service, you upload your objects from a source, such as an encoder or data fee...
    Disponible

    50,46 €

  • Amazon Route 53 Developer Guide
    Development Team
    You can use Amazon Route 53 to help you get a website or web application up and running. Route 53 performs three main functions: Register domain names – Your website needs a name, such as example.com. Route 53 lets you registera name for your website or web application, known as a domain name. For an overview, see How Domain Registration Works. Route internet traffic to the ...
    Disponible

    120,53 €

Otros libros del autor

  • Cloud Computing Solutions Architect
    Arshdeep Bahga / Vijay Madisetti
    A recent industry report from Gartner points out that choices related to cloud computing at enterprises have changed from "if" to "how" to build, deploy, consume, manage, secure and integrate cloud services into their operations.  The cloud solutions architect is the person who defines the enterprise cloud strategy from a technical point of view and must take responsibility for...
  • Blockchain Applications
    Arshdeep Bahga / Vijay Madisetti
    In the US, the services sector provides employment to about 100 million, while the manufacturing sector provides employment to about 20 million. These sectors are highly automated, and driven by sophisticated business processes forming an integral part of the digital economy. While the applications themselves may be distributed over the Internet in time and space, the core busi...
  • Internet of Things
    Arshdeep Bahga / Vijay Madisetti
    Internet of Things (IoT) refers to physical and virtual objects that have unique identities and are connected to the internet to facilitate intelligent applications that make energy, logistics, industrial control, retail, agriculture and many other domains 'smarter'. Internet of Things is a new revolution of the Internet that is rapidly gathering momentum driven by the advancem...
    Disponible

    51,73 €

  • Cloud Computing
    Arshdeep Bahga / Vijay Madisetti
    Recent industry surveys expect the cloud computing services market to be in excess of $20 billion and cloud computing jobs to be in excess of 10 million worldwide in 2014 alone. In addition, since a majority of existing information technology (IT) jobs is focused on maintaining legacy in-house systems, the demand for these kinds of jobs is likely to drop rapidly if cloud comput...
    Disponible

    51,76 €