hadoop-single-node-cluster

This will install a single node hadoop cluster on a machine with single command. This is intended as use for a dev setup or simple testing.

This is part of NeverwinterDP the Data Pipeline for Hadoop

Help WANTED - Looking for help with

Installation

Two Methods
- Remote - One liner from Github
- Local - From local copy of the repo

Remote - One liner

  curl -sSL https://raw.githubusercontent.com/DemandCube/hadoop-single-node-cluster/master/INSTALL-HADOOP | bash -s -- -r

Local - Run from local repo

git clone https://github.com/DemandCube/hadoop-single-node-cluster.git
cd hadoop-single-node-cluster
sudo ./INSTALL-HADOOP

NOTE: Meant to be run as root/super user

#Minimal Installation (Installs just HDFS, YARN, Zookeeper, MapReduce2, and Tez)

git clone https://github.com/DemandCube/hadoop-single-node-cluster.git
cd hadoop-single-node-cluster
sudo ./INSTALL-HADOOP -y

#To specify your own custom blueprint file (Not recommended)

git clone https://github.com/DemandCube/hadoop-single-node-cluster.git
cd hadoop-single-node-cluster
sudo ./INSTALL-HADOOP -b my-custom-blueprint.json

Services with their components.

Available components to configure blueprint.json

What is blueprint?

A blueprint defines the logical structure of a cluster, without needing informations about the actual infrastructure. Therefore you can use the same blueprint for different amount of nodes, different IPs and different domain names.

The component names are Ambari specific, for convenience you can find the HDP-2.1 services with their components below.

HDFS - DATANODE, HDFS_CLIENT, JOURNALNODE, NAMENODE, SECONDARY_NAMENODE, ZKFC
YARN - APP_TIMELINE_SERVER, NODEMANAGER, RESOURCEMANAGER, YARN_CLIENT
MAPREDUCE2 - HISTORYSERVER, MAPREDUCE2_CLIENT
GANGLIA - GANGLIA_MONITOR, GANGLIA_SERVER
HBASE - HBASE_CLIENT, HBASE_MASTER, HBASE_REGIONSERVER
HIVE - HIVE_CLIENT, HIVE_METASTORE, HIVE_SERVER, MYSQL_SERVER
HCATALOG - HCAT
WEBHCAT - WEBHCAT_SERVER
NAGIOS - NAGIOS_SERVER
OOZIE - OOZIE_CLIENT, OOZIE_SERVER
PIG - PIG
SQOOP - SQOOP
STORM - DRPC_SERVER, NIMBUS, STORM_REST_API, STORM_UI_SERVER, SUPERVISOR
TEZ - TEZ_CLIENT
FALCON - FALCON_CLIENT, FALCON_SERVER
ZOOKEEPER - ZOOKEEPER_CLIENT, ZOOKEEPER_SERVER

Compatibility

OS

CentOS 6.5

Should be good on RED-HAT Family of Distros

ENVIRONMENTS (Been tested on)

vagrant (centos 6.4,6.5 x86_64)
ec2 (centos 6.4,6.5 x86_64)
digitalocean (centos 6.4,6.5 x86_64)

Usage

INSTALL-HADOOP

Usage: INSTALL-HADOOP [-rfyb]
This will
    
    -y,                              (Optional) MINIMAL   - Installs a minimal installation - just HDFS, Yarn, MapReduce, Tez, and Zookeeper
    -b [blueprint file],             (Optional) BLUEPRINT - Specify your own custom blueprint file
    -r,                              (Optional) REMOTE    - Pulls all templates as remote 
    -f,                              (Optional) FORCE     - Forces in install if less them 4 gigs of ram

Contributing

See the [NeverwinterDP Guide to Contributing] (https://github.com/DemandCube/NeverwinterDP#how-to-contribute)

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
test		test
HADOOP-SINGLE-NODE-CLUSTER		HADOOP-SINGLE-NODE-CLUSTER
INSTALL-HADOOP		INSTALL-HADOOP
LICENSE		LICENSE
NEVERWINTERDP		NEVERWINTERDP
README.md		README.md
blueprint-raw-all.json		blueprint-raw-all.json
blueprint-raw-minimal.json		blueprint-raw-minimal.json
cluster-creation-raw.json		cluster-creation-raw.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hadoop-single-node-cluster

Help WANTED - Looking for help with

Installation

Remote - One liner

Local - Run from local repo

Services with their components.

Available components to configure blueprint.json

What is blueprint?

Compatibility

OS

Usage

INSTALL-HADOOP

Contributing

About

Releases

Packages

Contributors 7

Languages

License

smorin/hadoop-single-node-cluster

Folders and files

Latest commit

History

Repository files navigation

hadoop-single-node-cluster

Help WANTED - Looking for help with

Installation

Remote - One liner

Local - Run from local repo

Services with their components.

Available components to configure blueprint.json

What is blueprint?

Compatibility

OS

Usage

INSTALL-HADOOP

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 7

Languages

Packages