- EDIT (2015-09-12): This example also includes Cassandra now!
- EDIT (2017-04-28): Defined a license (Apache 2), using Zeppelin binary files, upgraded to Java 8.
A step by step guide is available with the blog post: Vagrant + Spark + Zeppelin a Toolbox to the Data Analyst
This is an installation of Apache Spark and Apache Zeppelin based on Debian debian/jessie64
bootstrap.sh
was inspired by gettyimages/docker-spark
There are a few datasets available here.
I'm looking for more datasets, if you want to donate yours please reach out
Apache License Version 2.0