Inicio > > Informática: cuestiones generales > Effiziente Datendeduplizierung in Hadoop
Effiziente Datendeduplizierung in Hadoop

Effiziente Datendeduplizierung in Hadoop

Parth Shah / Priteshkumar Prajapati

60,52 €
IVA incluido
Disponible
Editorial:
KS OmniScriptum Publishing
Año de edición:
2025
Materia
Informática: cuestiones generales
ISBN:
9786202087261
60,52 €
IVA incluido
Disponible

Selecciona una librería:

  • Librería Samer Atenea
  • Librería Aciertas (Toledo)
  • Kálamo Books
  • Librería Perelló (Valencia)
  • Librería Elías (Asturias)
  • Donde los libros
  • Librería Kolima (Madrid)
  • Librería Proteo (Málaga)

Hadoop wird häufig für die Speicherung massiv verteilter Daten verwendet. Obwohl es sehr fehlertolerant und skalierbar ist und auf handelsüblicher Hardware läuft, bietet es keine effiziente und optimierte Datenspeicherlösung. Wenn Benutzer Dateien mit identischem Inhalt in Hadoop hochladen, werden alle Dateien im HDFS (Hadoop Distributed File System) gespeichert, auch wenn der Inhalt identisch ist, was zu einer Duplizierung des Inhalts und damit zu einer Verschwendung von Speicherplatz führt. Datendeduplizierung ist ein Prozess zur Reduzierung der erforderlichen Speicherkapazität, da nur die eindeutigen Dateninstanzen gespeichert werden. Der Prozess der Datendeduplizierung wird häufig in Dateiservern, Datenbankmanagementsystemen, Backup-Speichern und vielen anderen Speicherlösungen eingesetzt. Eine geeignete Deduplizierungsstrategie nutzt den Speicherplatz auf den begrenzten Speichergeräten ausreichend aus. Hadoop bietet keine Lösung zur Datendeduplizierung. In dieser Arbeit wurde das Modul zur Deduplizierung in das Hadoop-Framework integriert, um eine optimierte Datenspeicherung zu erreichen.

Artículos relacionados

  • Interview with Jeffery Khoury, Bringing Telemedicine to the People
    Richard G Lowe Jr
    Did you know you can consult with a medical specialist over your smartphone from the comfort of your own home? Imagine speaking to a highly-trained and accredited doctor about whatever is ailing you from virtually anywhere in the world.Thanks to a young entrepreneur named Jeffery Khoury, you can get the advice you need from a pool of medical specialists without waiting in a doc...
  • IT Consulting Secrets
    Carl A Katz
    This book is for IT consultants of all experience levels and the content is relevant to any IT support business model from managed services (MSP) to break/fix. The author has methodically compiled these strategies and this information from over sixteen years of experience working in the IT support field at the small and medium sized business and enterprise levels. ...
    Disponible

    29,41 €

  • Modeling, Analysis, and Applications in Metaheuristic Computing
    Peng-Yeng Yin
    The engineering and business problems the world faces today have become more impenetrable and unstructured, making the design of a satisfactory problem-specific algorithm nontrivial. Modeling, Analysis, and Applications in Metaheuristic Computing: Advancements and Trends is a collection of the latest developments, models, and applications within the transdisciplinary fields rel...
  • Knowledge Management and Drivers of Innovation in Services Industries
    Knowledge Management is concerned with all aspects of eliciting, acquiring, modelling, and managing knowledge. Application of knowledge resources successfully helps the organization to deliver creative products and services. Especially in service business, service job experience and information about the customer, as well as the installed site equipment, are key factors to deli...
  • Current Trends and Future Practices for Digital Literacy and Competence
    Antonio Cartelli
    Being a digital citizen has transformed from a process of familiarizing ones’ self with terminology and techniques to a full-time responsibility in the hands of any who want to stay abreast of the latest technological change in their respective field. Current Trends and Future Practices for Digital Literacy and Competence offers a look at the latest research within digital lite...
  • Human Rights and Risks in the Digital Era
    Globalization, along with its digital and information communication technology counterparts, including the Internet and cyberspace, may signify a whole new era for human rights, characterized by new tensions, challenges, and risks for human rights, as well as new opportunities. Human Rights and Risks in the Digital Era: Globalization and the Effects of Information Technologies ...

Otros libros del autor

  • Efektywna deduplikacja danych w Hadoop
    Parth Shah / Priteshkumar Prajapati
    Hadoop jest szeroko stosowany do masowego przechowywania danych. Mimo że jest bardzo odporny na awarie, skalowalny i działa na standardowym sprzęcie, nie zapewnia wydajnego i zoptymalizowanego rozwiązania do przechowywania danych. Gdy użytkownik przesyła pliki o tej samej zawartości do Hadoop, wszystkie pliki są przechowywane w HDFS (Hadoop Distributed File System), nawet jeśli...
    Disponible

    60,52 €

  • Desduplicação eficiente de dados no Hadoop
    Parth Shah / Priteshkumar Prajapati
    O Hadoop é amplamente utilizado para armazenamento de dados massivamente distribuído. Embora seja altamente tolerante a falhas, escalável e funcione em hardware comum, ele não oferece uma solução de armazenamento de dados eficiente e otimizada. Quando o utilizador carrega ficheiros com o mesmo conteúdo no Hadoop, ele armazena todos os ficheiros no HDFS (Hadoop Distributed File ...
    Disponible

    60,52 €

  • Déduplication efficace des données dans Hadoop
    Parth Shah / Priteshkumar Prajapati
    Hadoop est largement utilisé pour le stockage massif de données distribuées. Même s’il est hautement tolérant aux pannes, évolutif et fonctionne sur du matériel standard, il ne fournit pas de solution de stockage de données efficace et optimisée. Lorsque l’utilisateur télécharge des fichiers avec le même contenu dans Hadoop, celui-ci stocke tous les fichiers dans HDFS (Hadoop D...
    Disponible

    60,52 €

  • Deduplicazione efficiente dei dati in Hadoop
    Parth Shah / Priteshkumar Prajapati
    Hadoop è ampiamente utilizzato per l’archiviazione di dati distribuiti su larga scala. Sebbene sia altamente tollerante ai guasti, scalabile e funzionante su hardware standard, non fornisce una soluzione di archiviazione dati efficiente e ottimizzata. Quando un utente carica file con lo stesso contenuto su Hadoop, tutti i file vengono archiviati su HDFS (Hadoop Distributed File...
    Disponible

    60,52 €