
作者:JanKunigk[等]著
页数:605
出版社:东南大学出版社
出版日期:2020
ISBN:9787564188245
电子书格式:pdf/epub/txt
内容简介
关于大数据技术的信息很好丰富,但将所有这些技术无缝拼接成端到端的企业数据平台却是一项艰巨的任务,一直以来没有引起广泛的讨论。通过这本实用的指导书,你将了解如何在企业内部和云计算平台中构建大数据基础设施,并成功地构建出现代数据平台。对于企业架构师、IT经理、应用程序架构师和数据工程师来说,这是一本理想读物,它为你展示了如何克服Hadoop项目中出现的诸多挑战。
本书特色
可以通过这本由币和比特币编程领域一流教师编写的实践指导书深入了解比特币技术。 作者向Python程序员和开发人员展示了如何从零开始编写比特币库。你将学习如何使用这个流行的加密货币及区块链支付系统背后的基础知识,包括数学,密码学,区块和交易规则。
目录
Foreword
Preface
1.Big DataTechnologyPrimer
A Tour of the Landscape
Core Components
Computational Frameworks
Analytical SQL Engines
Storage Engines
Ingestion
Orchestration
Summary
Part Ⅰ.Infrastructure
2.Clusters
Reasons for Multiple Clusters
Multiple Clusters for Resiliency
Multiple Clusters for Software Development
Multiple Clusters for Workload Isolation
Multiple Clusters for Legal Separation
Multiple Clusters and Independent Storage and Compute
Multitenancy
Requirements for Multitenancy
Sizing Clusters
Sizing by Storage
Sizing by Ingest Rate
Sizing by Woddoad
Cluster Growth
The Drivers of Cluster Growth
Implementing Cluster Growth
Data Replication
Replication for Software Development
Replication and Workload Isolation
Summary
3.Computeand Storage
Computer Architecture for Hadoop
Commodity Servers
Server CPUs and RAM
Nonuniform Memory Access
CPU Specifications
RAM
Commoditized Storage Meets the Enterprise
Modularity of Compute and Storage
Everything Is Java
Replication or Erasure Coding?
Alternatives
Hadoop and the Linux Storage Stack
User Space
Important System CalIs
The Linux Page Cache
Short-Circuit and Zero-Copy Reads
Filesystems
Erasure Coding Versus Replication
Discussion
Guidance
Low-Level Storage
Storage Controllers
Disk Layer
Server Form Factors
Form Factor Comparison
Guidance
Workload Profiles
Cluster Configurations and Node Types
Master Nodes
Worker Nodes
Utility Nodes
……
Part Ⅱ Platform
Part Ⅲ Taking Hadoop to the Cloud
A.Backup Onboarding Checklist.
Index
Preface
1.Big DataTechnologyPrimer
A Tour of the Landscape
Core Components
Computational Frameworks
Analytical SQL Engines
Storage Engines
Ingestion
Orchestration
Summary
Part Ⅰ.Infrastructure
2.Clusters
Reasons for Multiple Clusters
Multiple Clusters for Resiliency
Multiple Clusters for Software Development
Multiple Clusters for Workload Isolation
Multiple Clusters for Legal Separation
Multiple Clusters and Independent Storage and Compute
Multitenancy
Requirements for Multitenancy
Sizing Clusters
Sizing by Storage
Sizing by Ingest Rate
Sizing by Woddoad
Cluster Growth
The Drivers of Cluster Growth
Implementing Cluster Growth
Data Replication
Replication for Software Development
Replication and Workload Isolation
Summary
3.Computeand Storage
Computer Architecture for Hadoop
Commodity Servers
Server CPUs and RAM
Nonuniform Memory Access
CPU Specifications
RAM
Commoditized Storage Meets the Enterprise
Modularity of Compute and Storage
Everything Is Java
Replication or Erasure Coding?
Alternatives
Hadoop and the Linux Storage Stack
User Space
Important System CalIs
The Linux Page Cache
Short-Circuit and Zero-Copy Reads
Filesystems
Erasure Coding Versus Replication
Discussion
Guidance
Low-Level Storage
Storage Controllers
Disk Layer
Server Form Factors
Form Factor Comparison
Guidance
Workload Profiles
Cluster Configurations and Node Types
Master Nodes
Worker Nodes
Utility Nodes
……
Part Ⅱ Platform
Part Ⅲ Taking Hadoop to the Cloud
A.Backup Onboarding Checklist.
Index














