Hadoop搭建和使用
2018-08-01 本文已影响0人
Claire_cc
1.搭建Hadoop
安装环境:虚拟机(VMware)Ubuntu 16.04.3
Step 1:安装JDK
(1)从Oracle官网
http://www.oracle.com/technetwork/cn/java/javase/downloads/jdk8-downloads-2133151-zhs.html下载JDK包
(2)解压 :tar -zxvf jdk-8u117-linux-x64.tar.gz
(3)修改全局配置文件vim /etc/profile并激活:source /etc/profile
data:image/s3,"s3://crabby-images/4e5f5/4e5f5fd2da2616d275ec51a58f30b02a9dc09cfb" alt=""
data:image/s3,"s3://crabby-images/24c52/24c52b172380b49eaf1339d492332150d52fc582" alt=""
Step2:安装ssh
Apt-get install openssh-server
报错Unable to fetch some archives, maybe run apt-get update or try with --fix-missing,则输入:apt-get upgrade
报错Unable to correct problems, you have held broken packages.输入
apt-get autoremove openssh-client openssh-server
sudo apt-get install openssh-client openssh-server
安装成功后,输入ssh localhost测试
data:image/s3,"s3://crabby-images/3c814/3c8143a844cd70210a6cc32c4f30cebac10bc826" alt=""
Step3:安装Hadoop
(1)下载解压和Step2类似不再重复
(2)创建hadoop用户和组,并授予执行权限
sudo addgroup hadoop
sudo usermod -a -G hadoop juchang
sudo gedit etc/sudoers
在root ALL=(ALL:ALL) ALL下添加hadoop ALL=(ALL:ALL) ALL.
sudo chmod -R 755 /opt/hadoop
sudo chown -R xxx:hadoop /opt/hadoop
(3)和Step2一样,修改/etc/profile并激活
data:image/s3,"s3://crabby-images/f56b0/f56b0a34b9f54b0af0531c65bbd49ec73eedd982" alt=""
data:image/s3,"s3://crabby-images/93eba/93eba70cdb2c0b88ea7f658194e504322960439f" alt=""
2.运行WordCount实例(非分布式)
hadoop默认是非分布式模式且自带wordcount的demo
cd /opt/hadoop
mkdir input
cp README.txt input
bin/hadoop jar share/hadoop/mapreduce/sources/hadoop-mapreduce-examples-2.7.6-sources.jar org.apache.hadoop.examples.WordCount input output
data:image/s3,"s3://crabby-images/fd6cf/fd6cfe2f6bc5c552845c6e2da085baf38f961cf9" alt=""
3.运行WordCount实例(伪分布式)
(1)修改hadoop的核心配置文件core-site.xml,主要配置hdfs的地址和端口号。data:image/s3,"s3://crabby-images/5490a/5490a3dff9c7cf4fc17056b702cd993c1cf8677c" alt=""
data:image/s3,"s3://crabby-images/2e115/2e115f6e18ec5b8c7b67692101c868869a0b701a" alt=""
data:image/s3,"s3://crabby-images/ebedc/ebedccb6413720940971dd9a913848e9d137db45" alt=""
data:image/s3,"s3://crabby-images/dd467/dd467839f8f6ab43bd6342e61ccc716963d8e87e" alt=""
(5)格式化namenode
hdfs namenode -format
./sbin/start-yarn.sh
data:image/s3,"s3://crabby-images/098f3/098f30ea96be2d8ece46df7b4a30f201e910228a" alt=""
参考资料
https://blog.csdn.net/henni_719/article/details/77732815
https://www.cnblogs.com/lighten/p/6106891.html
http://www.cnblogs.com/lighten/p/6105463.html
https://blog.csdn.net/hearbeat/article/details/50042547