Skip to content

MSRSSP/docker-hadoop-3

 
 

Repository files navigation

Docker file for Hadoop 3

Most of the work is coming from : http://bigdatums.net/2017/11/04/creating-hadoop-docker-image/

Just added a few adaptations for Hadoop 3.

For some details about Hadoop 3 (such as new ports), see: https://fr.slideshare.net/HadoopSummit/hadoop-3-in-a-nutshell

Please, read the content of Dockerfile, because it may be possible that you have to update it. See the comments about the tgz of hadoop3.

After starting the container, you can access the web UI:

Warning: hue is not fully functional... Its integration is a work in progess (file browsing is ok) !

How-to

  • Build the image
sudo docker build -t hadoop3 .
  • Run the container
sudo docker run --hostname=hadoop3 -p 8088:8088 -p 9870:9870 -p 9864:9864 -p 19888:19888 \
  -p 8042:8042 -p 8888:8888 --name hadoop3 -d hadoop3
  • Access the container
sudo docker exec -it hadoop3 bash
  • Test a job
yarn jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-examples-3.0.0.jar pi 10 100
  • Clean
sudo docker stop hadoop3 
sudo docker rm hadoop3 

Next steps

Product/Framework/Env. Version (R)equired/((O)ptional
Hue 4.1 R
Hive 2.3.2 R
Minifi ? O
Druid ? O
Kafka ? O
Storm ? O
Spark 2.2.0 O
Ambari 2.6.1 O
Ambari-metrics 2.6.1 O
HBase ? O

Some notes

About

Docker file for Hadoop 3

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Shell 100.0%