图书介绍
Hadoop操作手册 英文PDF|Epub|txt|kindle电子书版本网盘下载
- ERICSAMMER著 著
- 出版社: 南京:东南大学出版社
- ISBN:9787564142582
- 出版时间:2013
- 标注页数:283页
- 文件大小:65MB
- 文件页数:295页
- 主题词:数据处理软件-技术手册-英文
PDF下载
下载说明
Hadoop操作手册 英文PDF格式电子书版下载
下载的文件为RAR压缩包。需要使用解压软件进行解压得到PDF格式图书。建议使用BT下载工具Free Download Manager进行下载,简称FDM(免费,没有广告,支持多平台)。本站资源全部打包为BT种子。所以需要使用专业的BT下载软件进行下载。如BitComet qBittorrent uTorrent等BT下载工具。迅雷目前由于本站不是热门资源。不推荐使用!后期资源热门了。安装了迅雷也可以迅雷进行下载!
(文件页数 要大于 标注页数,上中下等多册电子书除外)
注意:本站所有压缩包均有解压码: 点击下载压缩包解压工具
图书目录
1.Introduction1
2.HDFS7
Goals and Motivation7
Design8
Daemons9
Reading and Writing Data11
The Read Path12
The Write Path13
Managing Filesystem Metadata14
Namenode High Availability16
Namenode Federation18
Access and Integration20
Command-Line Tools20
FUSE23
REST Support23
3.MapReduce25
The Stages of MapReduce26
Introducing Hadoop MapReduce33
Daemons34
When It All Goes Wrong36
YARN37
4.Planning a Hadoop Cluster41
Picking a Distribution and Version of Hadoop41
Apache Hadoop41
Cloudera's Distribution Including Apache Hadoop42
Versions and Features42
What Should I Use?44
Hardware Selection45
Master Hardware Selection46
Worker Hardware Selection48
Cluster Sizing50
Blades,SANs,and Virtualization52
Operating System Selection and Preparation54
Deployment Layout54
Software56
Hostnames,DNS,and Identification57
Users,Groups,and Privileges60
Kernel Tuning62
vm.swappiness62
vm.overcommit_memory62
Disk Configuration63
Choosing a Filesystem64
Mount Options66
Network Design66
Network Usage in Hadoop:A Review67
1 Gb versus 10 GbNetworks69
Typical Network Topologies69
5.Installation and Configuration75
Installing Hadoop75
Apache Hadoop76
CDH80
Configuration:An Overview84
The Hadoop XML Configuration Files87
Environment Variables and Shell Scripts88
Logging Configuration90
HDFS93
Identification and Location93
Optimization and Tuning95
Formatting the Namenode99
Creating a /tmp Directory100
Namenode High Availability100
Fencing Options102
Basic Configuration104
Automatic Failover Configuration105
Format and Bootstrap the Namenodes108
Namenode Federation113
MapReduce120
Identification and Location120
Optimization and Tuning122
Rack Topology130
Security133
6.Identity,Authentication,and Authorization135
Identity137
Kerberos and Hadoop137
Kerberos:A Refresher138
Kerberos Support in Hadoop140
Authorization153
HDFS153
MapReduce155
Other Tools and Systems159
Tying It Together164
7.ResourceManagement167
What Is Resource Management?167
HDFS Quotas168
MapReduce Schedulers170
The FIFO Scheduler171
The Fair Scheduler173
The Capacity Scheduler185
The Future193
8.Cluster Maintenance195
Managing Hadoop Processes195
Starting and Stopping Processes with I? Scripts195
Starting and Stopping Processes Manually196
HDFS Maintenance Tasks196
Adding a Datanode196
Decommissioning a Datanode197
Checking Filesystem Integrity with fsck198
Balancing HDFS Block Data202
Dealing with a Failed Disk204
MapReduce Maintenance Tasks205
Adding a Tasktracker205
Decommissioning a Tasktracker206
Killing a MapReduce Job206
Killing a MapReduce Task207
Dealing with a Blacklisted Tasktracker207
9.Troubleshooting209
Differential Diagnosis Applied to Systems209
Common Failures and Problems211
Humans(You)211
Misconfiguration212
Hardware Failure213
Resource Exhaustion213
Host Identification and Naming214
Network Partitions214
"Is the Computer Plugged In?"215
E-SPORE215
Treatment and Care217
War Stories220
A Mystery Bottleneck221
There's No Place Like 127.0.0.1224
10.Monitoring229
An Overview229
Hadoop Metrics230
Apache Hadoop 0.20.0 and CDH3(metrics 1)231
Apache Hadoop 0.20.203 and Later,and CDH4(metrics 2)237
What about SNMP?239
Health Monitoring239
Host-Level Checks240
All Hadoop Processes242
HDFS Checks244
MapReduce Checks246
11.Backupand Recovery249
Data Backup249
Distributed Copy(distcp)250
Parallel Data Ingestion252
Namenode Metadata254
Appendix:Deprecated Configuration Properties257
Index267